Tag: Big data
-
Testing Machine Learning Algorithms with K-Fold Cross Validation
Posted on May 17, 2017, Level intermediate Resource Length medium
Norbert Krupa wrote blog post on choosing a machine learning algorithm, then using a validation technique. He uses Talend Studio without hand coding.
Tags machine-learning big-data
-
The Algorithms Behind Probabilistic Programming
Posted on February 1, 2017, Level beginner Resource Length medium
This post by Mike gives a feel for the content in our report on probabilistic programming by introducing the algorithms and technology that make probabilistic programming possible.
Tags programming big-data
-
Data Exploration with Python, Part 1
Posted on January 26, 2017, Level intermediate Resource Length long
Tony Ojeda witnessed the lack of structure in conventional approaches in Exploratory data analysis, so he decided to document his own process in an attempt to come up with a framework for data exploration.
Tags big-data data-science
-
Analyzing Big Data with Twitter
Posted on January 25, 2017, Level intermediate Resource Length 15h+
UC Berkeley published their Course Lectures: Analyzing Big Data With Twitter. Bit older but still very good - published and available for free. Over 15+ hours of video lectures. These lecture notes simply summarized the course at a high level.
Tags big-data
-
[Podcast] Using AI to build a comprehensive database of knowledge
Posted on January 13, 2017, Level beginner Resource Length 40 mins
O'Reilly Data Show - Mike Tung, founder and CEO of Diffbot - talks about extracting structured information from semi-structured or unstructured data sources ("dark data"). Diffbot is dedicated to building large-scale knowledge databases.
Tags big-data
-
Learn How To Import And Explore Data In R
Posted on January 9, 2017, Level beginner Resource Length medium
Sabeer Shaikh from Eduonix demonstrates data exploration and data management in R.
Tags big-data analytics