0% found this document useful (0 votes)

125 views7 pages

Computer Graphics Lab Manual

difference between machine learning , AI and data science

Uploaded by

durgeshcse8002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

125 views7 pages

Computer Graphics Lab Manual

difference between machine learning , AI and data science

Uploaded by

durgeshcse8002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Search Data Science Centra Search

HOME DATAVIZ HADOOP BIG DATA ANALYTICS WEBINARS DEEP LEARNING AI STATISTICS JOBS MEMBERSHIP FORUMS SEARCH CLASSIFIEDS

CONTACT

Subscribe to DSC Newsletter

All Blog Posts

My Blog
Add

Difference between Machine Learning, Data Science, AI, Deep Learning, and Statistics
Posted by Vincent Granville on January 2, 2017 at 8:30pm
View Blog

In this article, I clarify the various roles of the data scientist, and how data science compares and overlaps with related fields such as machine learning, deep learning, AI,
statistics, IoT, operations research, and applied mathematics. As data science is a broad discipline, I start by describing the different types of data scientists that one may
encounter in any business setting: you might even discover that you are a data scientist yourself, without knowing it. As in any scientific discipline, data scientists may
borrow techniques from related disciplines, though we have developed our own arsenal, especially techniques and algorithms to handle very large unstructured data sets
in automated ways, even without human interactions, to perform transactions in real-time or to make predictions.

1. Different Types of Data Scientists

To get started and gain some historical perspective, you can read my article about 9 types of data scientists, published in 2014, or my article where I compare data
science with 16 analytic disciplines, also published in 2014.

The following articles, published during the same time period, are still useful:

Data Scientist versus Data Architect

Data Scientist versus Data Engineer
Data Scientist versus Statistician
Data Scientist versus Business Analyst
How To Structure A Modern DataOps Team ✕
More recently (August 2016) Ajit Jaokar discussed Type A (Analytics) versus Type B (Builder) data scientist:
In this latest Data Science Central webinar join guest speakers, Forrester Principal Analyst Michele Goetz and Trifacta Director of Product Marketing Will Davis as they focus on the
Register today
The Type A Data Scientist can code well enough to work with data but is not necessarily an expert. The Type A data scientist may be an expert in experimental
roles and responsibilities required for today’s modern dataops teams to be successful.
design, forecasting, modelling, statistical inference, or other things typically taught in statistics departments. Generally speaking though, the work product of a data
scientist is not "p-values and confidence intervals" as academic statistics sometimes seems to suggest (and as it sometimes is for traditional statisticians working in
the pharmaceutical industry, for example). At Google, Type A Data Scientists are known variously as Statistician, Quantitative Analyst, Decision Support
Engineering Analyst, or Data Scientist, and probably a few more.

Type B Data Scientist: The B is for Building. Type B Data Scientists share some statistical background with Type A, but they are also very strong coders and may
be trained software engineers. The Type B Data Scientist is mainly interested in using data "in production." They build models which interact with users, often
serving recommendations (products, people you may know, ads, movies, search results). Source: click here.

I also wrote about the ABCD's of business processes optimization where D stands for data science, C for computer science, B for business science, and A for analytics
science. Data science may or may not involve coding or mathematical practice, as you can read in my article on low-level versus high-level data science. In a startup, data
scientists generally wear several hats, such as executive, data miner, data engineer or architect, researcher, statistician, modeler (as in predictive modeling) or developer.

While the data scientist is generally portrayed as a coder experienced in R, Python, SQL, Hadoop and statistics, this is just the tip of the iceberg, made popular by data
camps focusing on teaching some elements of data science. But just like a lab technician can call herself a physicist, the real physicist is much more than that, and her
domains of expertise are varied: astronomy, mathematical physics, nuclear physics (which is borderline chemistry), mechanics, electrical engineering, signal processing
(also a sub-field of data science) and many more. The same can be said about data scientists: fields are as varied as bioinformatics, information technology, simulations
and quality control, computational finance, epidemiology, industrial engineering, and even number theory.

In my case, over the last 10 years, I specialized in machine-to-machine and device-to-device communications, developing systems to automatically process large data
sets, to perform automated transactions: for instance, purchasing Internet traffic or automatically generating content. It implies developing algorithms that work with
unstructured data, and it is at the intersection of AI (artificial intelligence,) IoT (Internet of things,) and data science. This is referred to as deep data science. It is relatively
math-free, and it involves relatively little coding (mostly API's), but it is quite data-intensive (including building data systems) and based on brand new statistical technology
designed specifically for this context.

Prior to that, I worked on credit card fraud detection in real time. Earlier in my career (circa 1990) I worked on image remote sensing technology, among other things to
identify patterns (or shapes or features, for instance lakes) in satellite images and to perform image segmentation: at that time my research was labeled as computational
statistics, but the people doing the exact same thing in the computer science department next door in my home university, called their research artificial intelligence.
Today, it would be called data science or artificial intelligence, the sub-domains being signal processing, computer vision or IoT.

Also, data scientists can be found anywhere in the lifecycle of data science projects, at the data gathering stage, or the data exploratory stage, all the way up to statistical
modeling and maintaining existing systems.

2. Machine Learning versus Deep Learning

Before digging deeper into the link between data science and machine learning, let's briefly discuss machine learning and deep learning. Machine learning is a set of
algorithms that train on a data set to make predictions or take actions in order to optimize some systems. For instance, supervised classification algorithms are used to
classify potential clients into good or bad prospects, for loan purposes, based on historical data. The techniques involved, for a given task (e.g. supervised clustering), are
varied: naive Bayes, SVM, neural nets, ensembles, association rules, decision trees, logistic regression, or a combination of many. For a detailed list of algorithms, click
here. For a list of machine learning problems, click here.

All of this is a subset of data science. When these algorithms are automated, as in automated piloting or driver-less cars, it is called AI, and more specifically, deep
learning. Click here for another article comparing machine learning with deep learning. If the data collected comes from sensors and if it is transmitted via the Internet,
then it is machine learning or data science or deep learning applied to IoT.

Some people have a different definition for deep learning. They consider deep learning as neural networks (a machine learning technique) with a deeper layer. The
question was asked on Quora recently, and below is a more detailed explanation (source: Quora)

AI (Artificial intelligence) is a subfield of computer science, that was created in the 1960s, and it was (is) concerned with solving tasks that are easy for humans, but
hard for computers. In particular, a so-called Strong AI would be a system that can do anything a human can (perhaps without purely physical things). This is fairly
generic, and includes all kinds of tasks, such as planning, moving around in the world, recognizing objects and sounds, speaking, translating, performing social or
business transactions, creative work (making art or poetry), etc.

NLP (Natural language processing) is simply the part of AI that has to do with language (usually written).

Machine learning is concerned with one aspect of this: given some AI problem that can be described in discrete terms (e.g. out of a particular set of actions, which
one is the right one), and given a lot of information about the world, figure out what is the “correct” action, without having the programmer program it in. Typically
some outside process is needed to judge whether the action was correct or not. In mathematical terms, it’s a function: you feed in some input, and you want it to to
produce the right output, so the whole problem is simply to build a model of this mathematical function in some automatic way. To draw a distinction with AI, if I can
write a very clever program that has human-like behavior, it can be AI, but unless its parameters are automatically learned from data, it’s not machine learning.

Deep learning is one kind of machine learning that’s very popular now. It involves a particular kind of mathematical model that can be thought of as a composition of
simple blocks (function composition) of a certain type, and where some of these blocks can be adjusted to better predict the final outcome.

What is the difference between machine learning and statistics?

This article tries to answer the question. The author writes that statistics is machine learning with confidence intervals for the quantities being predicted or estimated. I tend
to disagree, as I have built engineer-friendly confidence intervals that don't require any mathematical or statistical knowledge.

3. Data Science versus Machine Learning

Machine learning and statistics are part of data science. The word learning in machine learning means that the algorithms depend on some data, used as a training set, to
fine-tune some model or algorithm parameters. This encompasses many techniques such as regression, naive Bayes or supervised clustering. But not all techniques fit in
this category. For instance, unsupervised clustering - a statistical and data science technique - aims at detecting clusters and cluster structures without any a-priori
knowledge or training set to help the classification algorithm. A human being is needed to label the clusters found. Some techniques are hybrid, such as semi-supervised
classification. Some pattern detection or density estimation techniques fit in this category.

Data science is much more than machine learning though. Data, in data science, may or may not come from a machine or mechanical process (survey data could be
manually collected, clinical trials involve a specific type of small data) and it might have nothing to do with learning as I have just discussed. But the main difference is the
fact that data science covers the whole spectrum of data processing, not just the algorithmic or statistical aspects. In particular, data science also covers

data integration
distributed architecture
automating machine learning
data visualization
dashboards and BI
data engineering
deployment in production mode
How To Structure
automated, A Modern
data-driven DataOps
decisions Team ✕
InOf
thiscourse,
latest Data Scienceorganisations,
in many Central webinardata
join guest speakers,
scientists Forrester
focus Principal
on only Analyst
one part Michele
of this Goetz and
process. Trifacta
To read Director
about some of Product Marketing
of my original Will Davis as to
contributions they focus
data on the click here.
science, Register today
roles and responsibilities required for today’s modern dataops teams to be successful.
Follow @analyticbridge 191K followers

Top DSC Resources

Article: What is Data Science? 24 Fundamental Articles Answering This Question

Article: Hitchhiker's Guide to Data Science, Machine Learning, R, Python
Tutorial: Data Science Cheat Sheet
Tutorial: How to Become a Data Scientist - On Your Own
Categories: Data Science - Machine Learning - AI - IoT - Deep Learning
Tools: Hadoop - DataViZ - Python - R - SQL - Excel
Techniques: Clustering - Regression - SVM - Neural Nets - Ensembles - Decision Trees
Links: Cheat Sheets - Books - Events - Webinars - Tutorials - Training - News - Jobs
Links: Announcements - Salary Surveys - Data Sets - Certification - RSS Feeds - About Us
Newsletter: Sign-up - Past Editions - Members-Only Section - Content Search - For Bloggers
DSC on: Ning - Twitter - LinkedIn - Facebook - GooglePlus

Follow us on Twitter: @DataScienceCtrl | @AnalyticBridge

Like
96 members like this

Tweet
Share Facebook

Like 1.6K

< Previous Post

Next Post >

Comment

You need to be a member of Data Science Central to add comments!

Join Data Science Central

Comment by Pouya Esmailian on August 1, 2018 at 9:10pm

Great article!

Comment by johan boie on July 9, 2018 at 10:24am

Thanks for that much information !!

Comment by Analytics Path on December 25, 2017 at 2:00am

Hi,
Thanks for sharing such a great article with us On Data Science
Data Science Training In Hyderabad

Comment by sravanthi kakarla on December 5, 2017 at 4:13am

Thanks a lot , much appreciated. This gives an insight to those who are digging deep to know AI, IoT and Data science in the present day situation where their
importance is growing rapidly

Comment by Azman Ali on November 29, 2017 at 8:16pm

Good Sharing

Comment by Hariprasad on November 22, 2017 at 11:34pm

Hi,

Thanks for sharing the great information about data science, statistics,… Its useful and helpful information…Keep Sharing.

Thanks

Hari

How To Structure A Modern DataOps Team ✕

Comment by Parker LAU on October 15, 2017 at 5:17pm
In this latest Data Science Central webinar join guest speakers, Forrester Principal Analyst Michele Goetz and Trifacta Director of Product Marketing Will Davis as they focus on the
Register today
Very clear and
roles and responsibilities master
required forpiece
today’sindeed.
modern dataops teams to be successful.
Comment by Joana Oliveira on July 17, 2017 at 12:04am

Thank you for sharing this !

Comment by Yogesh Maney on July 5, 2017 at 8:53pm

Thank you for sharing this Vincent

Comment by Pedro Deloso Recio on April 23, 2017 at 10:47pm

Talking about Machine Learning, I think the main challenge is to make a decision between options that are contradictory between them. If there are not
contradictions, the problem would be solved using deterministic logic, however complex. Making a decision between contradictions is what an emotional system is
used for.

‹ Previous
1
2
Next ›
Page 2 Go

RSS

Welcome to
Data Science Central

Or sign in with:

How To Structure A Modern DataOps Team ✕

@DataScienceCtrl | RSS Feeds

Three Predictive Analytics Traps to Avoid

BI to AI: Harnessing the power of Data

Forrester Insights: Analysis-Ready Data

Northwestern’s MS in Data Science

Manipulating Data in Apache Spark™ [eBook]

Advance Your Career at NYU Stern

Get a ticket while you still can to Tableau Conference 2018

Learn Top 3 Must-Haves to Become AI-Driven

How
UnlockTo
theStructure
Power of BigAData
Modern
AnalyticsDataOps Team ✕
InMS
thisin
latest Data Science
Business CentralDrive
Analytics: webinar joinCareer
Your guest speakers,
Forward Forrester Principal Analyst Michele Goetz and Trifacta Director of Product Marketing Will Davis as they focus on the
Register today
roles and responsibilities required for today’s modern dataops teams to be successful.
VIDEOS

DSC Webinar Series: Deep Learning - Training your Neural Network

Added by Tim Matteson 0 Comments 0 Likes

DSC Webinar Series: The Essentials of Training Data for Machine Learning
Added by Tim Matteson 0 Comments 2 Likes

DSC Webinar Series: 4 Ways to Tackle Common Data Prep Issues

Added by Tim Matteson 0 Comments 0 Likes

Add Videos
View All

RESOURCES

Migrating an Excel Spreadsheet to MySQL and to Spark 2.0.1 (Part 1)

Introduction to Programming in Stata
Benchmarking 20 Machine Learning Models Accuracy and Speed
Stata Cheat Sheet
Selection of best articles from our past weekly digests
Statistical Analysis Advisor Chart
Selection of best articles from our past weekly digests
Free Online Book: Forecasting, Principles and Practice
38 Seminal Articles Every Data Scientist Should Read
Black-box Confidence Intervals: Excel and Perl Implementation

TOP CATEGORIES

Machine Learning

R Programming

Python for Data Science

Visualization, Dashboards

NoSQL and NewSQL

Big Data

Cheat Sheets

Internet of Things

Excel

Badges | Report an Issue | Privacy Policy | Terms of Service

How To Structure A Modern DataOps Team ✕

In this latest Data Science Central webinar join guest speakers, Forrester Principal Analyst Michele Goetz and Trifacta Director of Product Marketing Will Davis as they focus on the
Register today
roles and responsibilities required for today’s modern dataops teams to be successful.
How To Structure A Modern DataOps Team ✕
In this latest Data Science Central webinar join guest speakers, Forrester Principal Analyst Michele Goetz and Trifacta Director of Product Marketing Will Davis as they focus on the
Register today
roles and responsibilities required for today’s modern dataops teams to be successful.

Datascience
80% (10)
Datascience
28 pages
Fundamentals of Data Science Course Overview
No ratings yet
Fundamentals of Data Science Course Overview
65 pages
Lecture 2-Quick Overview of Data Science
No ratings yet
Lecture 2-Quick Overview of Data Science
18 pages
Data Science
No ratings yet
Data Science
85 pages
Session 2-1
No ratings yet
Session 2-1
32 pages
INTRODUCTION and M1-CH-1
No ratings yet
INTRODUCTION and M1-CH-1
63 pages
Data Science & Machine Learning Insights
No ratings yet
Data Science & Machine Learning Insights
29 pages
DCIT414 Session 2
No ratings yet
DCIT414 Session 2
32 pages
DS Unit 1 - ABM
No ratings yet
DS Unit 1 - ABM
103 pages
Module 1 Introduction Ds
No ratings yet
Module 1 Introduction Ds
18 pages
Assignament
No ratings yet
Assignament
4 pages
Data Science Unit 1
No ratings yet
Data Science Unit 1
70 pages
Fintech J Chap 6
No ratings yet
Fintech J Chap 6
18 pages
Data Science Presentation Enhanced
No ratings yet
Data Science Presentation Enhanced
34 pages
Intro To Career in Data Science: Md. Rabiul Islam
100% (1)
Intro To Career in Data Science: Md. Rabiul Islam
62 pages
Beginner's Guide to Data Science
No ratings yet
Beginner's Guide to Data Science
26 pages
Data Science Applications by Rajesh - 91
No ratings yet
Data Science Applications by Rajesh - 91
46 pages
Data Science Class Lecture
No ratings yet
Data Science Class Lecture
22 pages
Dia 1
No ratings yet
Dia 1
88 pages
01 Introduction
No ratings yet
01 Introduction
37 pages
Introduction To Datasciecne
No ratings yet
Introduction To Datasciecne
50 pages
Question 3
No ratings yet
Question 3
6 pages
Data Science 101 Overview
No ratings yet
Data Science 101 Overview
23 pages
Intro To Data Science - LVC1 With Markings
No ratings yet
Intro To Data Science - LVC1 With Markings
22 pages
Intro To Data Science - LVC1
No ratings yet
Intro To Data Science - LVC1
22 pages
Aall
No ratings yet
Aall
41 pages
Ids Unit-I
No ratings yet
Ids Unit-I
34 pages
Big Data CH01
No ratings yet
Big Data CH01
12 pages
Data Sci
No ratings yet
Data Sci
67 pages
Lecture 1 Introduction
No ratings yet
Lecture 1 Introduction
16 pages
Research On Data Science, Data Analytics and Big Data Rahul Reddy Nadikattu
No ratings yet
Research On Data Science, Data Analytics and Big Data Rahul Reddy Nadikattu
7 pages
CSIC 221: Machine Learning & Data Analytics: Mayank Dave Professor Dept. of Computer Engineering
No ratings yet
CSIC 221: Machine Learning & Data Analytics: Mayank Dave Professor Dept. of Computer Engineering
23 pages
DS-BDS (Unit 1) Technical
No ratings yet
DS-BDS (Unit 1) Technical
22 pages
Data-Science - Introduction
No ratings yet
Data-Science - Introduction
35 pages
Data Science Chacha
No ratings yet
Data Science Chacha
150 pages
What Is Data Science - IBM
No ratings yet
What Is Data Science - IBM
10 pages
Data Science Unit 1
No ratings yet
Data Science Unit 1
30 pages
Big Data and Data Science Overview
No ratings yet
Big Data and Data Science Overview
31 pages
Unit 1
No ratings yet
Unit 1
8 pages
Introduction To Data Science What Is Data Science?
No ratings yet
Introduction To Data Science What Is Data Science?
11 pages
Data Analytics vs. Data Science Guide
No ratings yet
Data Analytics vs. Data Science Guide
16 pages
Ids Unit 1 Final
No ratings yet
Ids Unit 1 Final
30 pages
Ch7-Overview of Data Science-Part 1
No ratings yet
Ch7-Overview of Data Science-Part 1
37 pages
DSC Unit 1
No ratings yet
DSC Unit 1
59 pages
Understanding Data Science Essentials
No ratings yet
Understanding Data Science Essentials
6 pages
Data Science Presentation Final
No ratings yet
Data Science Presentation Final
34 pages
Seminar On Data Science
100% (7)
Seminar On Data Science
25 pages
Introduction To Data Science UNIT 1
No ratings yet
Introduction To Data Science UNIT 1
44 pages
M 1 FDS Notes
No ratings yet
M 1 FDS Notes
19 pages
Data Scientist or Machine Learning Engineer 1670079855
No ratings yet
Data Scientist or Machine Learning Engineer 1670079855
16 pages
Ids (R22) U1 PPT 03092024
No ratings yet
Ids (R22) U1 PPT 03092024
87 pages
Unit-3 Intr Data Science
No ratings yet
Unit-3 Intr Data Science
150 pages
Unit I-Introduction of Data Science & R Programming: What Is Data Science? What Is Data Science?
No ratings yet
Unit I-Introduction of Data Science & R Programming: What Is Data Science? What Is Data Science?
30 pages
CME538 Lecture 1 Slide 1
No ratings yet
CME538 Lecture 1 Slide 1
122 pages
Introduction To Data Science - Ii-I
No ratings yet
Introduction To Data Science - Ii-I
128 pages
Question 1
No ratings yet
Question 1
5 pages
Unit 3 Part 1
No ratings yet
Unit 3 Part 1
43 pages
Self Made Problems - D. K. Olukoya
100% (2)
Self Made Problems - D. K. Olukoya
17 pages
Gabi Resume
No ratings yet
Gabi Resume
2 pages
Info Privacy & Security Guide
No ratings yet
Info Privacy & Security Guide
14 pages
Quadratic Equation
No ratings yet
Quadratic Equation
5 pages
Engineering & Environmental Ethics
No ratings yet
Engineering & Environmental Ethics
5 pages
Class 10 History Nationalism in India
No ratings yet
Class 10 History Nationalism in India
3 pages
Wednesday Lessonss
No ratings yet
Wednesday Lessonss
16 pages
Architectural Research Quarterly
No ratings yet
Architectural Research Quarterly
9 pages
Anthropology Research Methods Explained
No ratings yet
Anthropology Research Methods Explained
7 pages
MML Commands
100% (1)
MML Commands
375 pages
Alan Bates Case: Third Witness Statement
No ratings yet
Alan Bates Case: Third Witness Statement
6 pages
Patient-Focused Drug Development Guide
No ratings yet
Patient-Focused Drug Development Guide
49 pages
SpeakOut Answers ProgressTests1-2A Key
50% (6)
SpeakOut Answers ProgressTests1-2A Key
1 page
Hum 101 Instructor Course Guide
No ratings yet
Hum 101 Instructor Course Guide
3 pages
Kristian Tlangau - April, 2011
No ratings yet
Kristian Tlangau - April, 2011
36 pages
Land Acquisition Act 2034 Overview
No ratings yet
Land Acquisition Act 2034 Overview
5 pages
PDP Study Guide 1
No ratings yet
PDP Study Guide 1
33 pages
IIM Indore Annual Highlights
No ratings yet
IIM Indore Annual Highlights
98 pages
KWL Chart
No ratings yet
KWL Chart
1 page
GUDUCHI
No ratings yet
GUDUCHI
14 pages
Meth Trafficking Appeal Ruling
No ratings yet
Meth Trafficking Appeal Ruling
17 pages
Hidden Meaning of Kurma Avtaar Explained
No ratings yet
Hidden Meaning of Kurma Avtaar Explained
212 pages
Hutedghjoo
No ratings yet
Hutedghjoo
7 pages
Primary Health Care Expectations and Reality of Bangladesh1
No ratings yet
Primary Health Care Expectations and Reality of Bangladesh1
17 pages
Us B.ing
No ratings yet
Us B.ing
11 pages
Judicial Affidavit Agri
No ratings yet
Judicial Affidavit Agri
5 pages
Bài Tập Biển Báohs
No ratings yet
Bài Tập Biển Báohs
12 pages
ADXR - Volume 42 - No 600 - October 2024
No ratings yet
ADXR - Volume 42 - No 600 - October 2024
26 pages
Chapter II Marketing Aspect
No ratings yet
Chapter II Marketing Aspect
15 pages
JAMES A. HALL - Accounting Information System ch05
No ratings yet
JAMES A. HALL - Accounting Information System ch05
41 pages