How is Data Science related to Machine learning?



I went through this comparison of analytic disciplines and this perspective of machine learning, but I am not finding any answers on the following:

  1. How is Data Science related to Machine learning?
  2. How is it not related to Machine Learning?

Subham Tripathi

Posted 2015-05-21T17:57:35.003

Reputation: 189

Probably this question should exist as community wiki. – Shagun Sodhani – 2015-05-25T14:54:08.057



Data science is much broader concept than machine learning. It starts from simple data visualization and descriptive statistics to get insights, manipulations like cleansing to prepare data. Before you can use some ML algorithms.

Basically such huge stacks as bigdata, visualization and data preprocessing are out of machine learning scope. And they are all integral parts of "Data Science".

Large resolution image:


Posted 2015-05-21T17:57:35.003

Reputation: 4 894

where is that image to be found in original size? – Walter Tross – 2015-05-24T13:41:44.707

@WalterTross, here

– IharS – 2015-05-24T14:33:45.417


Machine Learning tries to create systems that can learn from data. As such it can be used in a wide variety of settings, for example to make robots learn to walk or train virtual agents to play video games.

Data science concerns itself with the extraction of knowledge from data. In order to do so it uses a bunch of different techniques from different disciplines. Machine learning includes some techniques that can be very useful for a data scientist such as deep learning, decision trees and different clustering algorithms. However, machine learning has more to offer than Data Science uses and Data Science does not solely rely on Machine Learning.


Posted 2015-05-21T17:57:35.003

Reputation: 181


Data science is much more broad. It's sort of a catch-all term that right now doesn't honestly have a very clear definition. But data-science includes all of the skills and techniques required to make sense of data which has high velocity (it's coming at you quickly), volume (there's a lot of it), or variability (it's messy, like natural language processing). This means that it certainly includes machine learning and AI, but that it's also about the tools one might use in a real-world situation such as SQL, Hadoop or Spark (and related information such as a knowledge of parallel programming). Additionally, data science may or may not include the communication aspect such as making good graphs and using Excel.

Basically, Data Science is ML+.


Posted 2015-05-21T17:57:35.003

Reputation: 323


Data Science is, as others have noted, a much broader term than machine learning. Applying Machine learning techniques is one aspect of data science. Data Science, more generally, is the science of deriving knowledge from data. The term was coined back in 1960 and kept evolving to describe the flow and interplay of problem definition, data collection, data transformation, data modeling/ analysis, and decision making. So to answer your question specifically:

  1. Machine learning aids data science by providing a suit of algorithms for data modeling/ analysis (through training of machine learning algorithms), decision making (through streaming, online learning, real-time testing that are all topics that come under machine learning), and even data preparation (machine learning algorithms automatically detect anomalies in the data).
  2. Data Science stitches together a bunch of ideas/ algorithms drawn from machine learning to create a solution and in doing so borrows a lot of ideas from traditional statistics, domain expertise and basic mathematics. In this way, data science is the process of solving a use case, providing a solution as opposed to machine learning that is an important cog in that solution.


Posted 2015-05-21T17:57:35.003

Reputation: 1 515