Unit 3 - Data Science, Machine Learning

The document discusses the history and evolution of machine learning and artificial intelligence. It covers early concepts from ancient philosophers through the development of computers and the modern era of deep learning and big data. Key algorithms and approaches are explained, including neural networks, reinforcement learning, and natural language processing.

Uploaded by

badaltanwarr

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

67 views20 pages

Unit 3 - Data Science, Machine Learning

Uploaded by

badaltanwarr

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 20

Section C

Data Science, Machine learning – history and evolution, AI

Evolution, Statistics vs. data mining vs. data analytics Vs. data
science. Supervised and unsupervised learning.
Important Definitions
• Data mining is a process that uses statistical, mathematical, and artificial
intelligence techniques to extract and identify useful information and
subsequent knowledge (or patterns) from large sets of data.

• “the nontrivial process of identifying valid, novel, potentially useful, and

ultimately understandable patterns in data stored in structured
databases,” where the data are organized in records structured by
categorical, ordinal, and continuous variables

• Machine learning: The process by which a computer learns from

experience (e.g., using programs that can learn from historical cases).
Machine learning - History and Evolution
Early Concepts (Ancient to 1940s):
 Ancient philosophers and mathematicians like Aristotle and Pythagoras laid the
foundation for logical reasoning and mathematical principles, which are
fundamental to machine learning.
 In the 17th century, the philosopher and mathematician René Descartes proposed
the idea of automating reasoning.
 In the 19th century, George Boole's work on Boolean algebra provided a
mathematical framework for logic, which became essential for machine learning
algorithms.
 Alan Turing's 1936 paper on the Turing machine introduced the concept of a
universal machine that could simulate any other machine. This idea is fundamental
to modern computing and machine learning.
Machine learning - History and Evolution
Early Computational Models (1940s-1950s):
• The development of electronic computers in the mid-20th century allowed
researchers to experiment with computational models of learning and artificial
intelligence (AI).
• In 1950, Alan Turing published a paper titled "Computing Machinery and
Intelligence," which introduced the Turing Test as a measure of a machine's ability
to exhibit intelligent behavior.

Rule-Based Systems (1950s-1960s):

• Early AI research focused on rule-based systems and expert systems, which used
predefined rules to make decisions or solve problems.
• The General Problem Solver (GPS), developed by Allen Newell and Herbert A.
Simon, was one of the first computer programs capable of solving a wide range of
problems.
Machine learning - History and Evolution

Symbolic AI and Knowledge-Based Systems (1960s-1980s):

• Symbolic AI, also known as "good old-fashioned AI" (GOFAI), dominated this era.
Researchers used formal logic and symbols to represent knowledge and make
inferences.
• Expert systems, such as MYCIN (used for medical diagnosis) and Dendral (used for
chemistry), gained popularity during this time.

Connectionism and Neural Networks (1980s-1990s):

• The field of artificial neural networks gained attention, inspired by the human
brain's neural structure.
• Backpropagation, a key algorithm for training neural networks, was developed.
• However, neural networks fell out of favor in the 1990s due to limitations and the
dominance of other AI approaches.
Machine learning - History and Evolution

Reinforcement Learning and Support Vector Machines (1990s-2000s):

• Reinforcement learning gained prominence, with algorithms like Q-learning and
the development of reinforcement learning in game-playing agents.
• Support vector machines (SVMs) became popular for classification tasks.

Big Data and Deep Learning (2010s-Present):

• The advent of big data and powerful hardware led to a resurgence of neural
networks, particularly deep learning.
• Convolutional neural networks (CNNs) revolutionized image recognition, and
recurrent neural networks (RNNs) improved sequence modeling.
• Advances in deep learning contributed to significant breakthroughs in speech
recognition, natural language processing, and computer vision.
• Reinforcement learning saw notable successes, with deep reinforcement learning
methods achieving superhuman performance in games like Go and Dota 2.
Data Science
• Data science is commonly defined as a methodology by which
actionable insights can be inferred from data:
• In general, data science allows us to adopt four different
strategies to explore the world using data:
1. Probing reality : Data can be gathered by passive or by active
methods. In the latter case, data represents the response of
the world to our actions.
• Analysis of those responses can be extremely valuable when
it comes to taking decisions about our subsequent actions.
e.g. are:
• Use of A/B testing for web development: What is the best
button size and color?
• The best answer can only be found by probing the world.
Data Science
2. Pattern discovery : Divide and conquer is an old heuristic used to
solve complex problems.
• Datified problems can be analyzed automatically to discover useful
patterns and natural clusters that can greatly simplify their solutions.
• The use of this technique to profile users is a critical ingredient
today in such important fields as programmatic advertising or digital
marketing.
3. Predicting future events: Predictive analytics allows decisions to be
taken in response to future events, not only reactively.
• the identification of predictable events represents valuable
knowledge.
• For example, predictive analytics can be used to optimize the tasks
planned for retail store staff during the following week, by
analyzing data such as weather, historic sales, traffic conditions, etc.
Data Science

3. Understanding people and the world: The development of

deep learning methods for natural language understanding and
for visual object recognition is a good example of this kind of
research.
Artificial Intelligence
• An artiﬁcial intelligence is a system that can learn how to learn
• The subfield of computer science is concerned with symbolic reasoning and
problem-solving.
• A series of instructions (an algorithm) that allows computers to write their own
algorithms without being explicitly programmed for.
• Artificial Intelligence (AI) is a broad and interdisciplinary field of computer
science that focuses on creating intelligent agents or systems capable of
simulating human-like cognitive processes.
• These systems aim to perform tasks that typically require human intelligence,
such as understanding natural language, recognizing patterns, making
decisions, and learning from experience
• AI as an interdisciplinary ﬁeld that covers (and requires) the study of manifold
sub-disciplines, such as natural language processes, computer vision, as well as
the Internet of things and robotics.
Artificial Intelligence - Some key aspects and concepts
• Machine Learning: Machine learning is a subset of AI that focuses on developing
algorithms and models that allow computers to learn from data and improve
their performance on a specific task over time. Common machine-learning
techniques include supervised learning, unsupervised learning, and
reinforcement learning.
• Deep Learning: Deep learning is a subfield of machine learning that involves
neural networks with multiple layers (deep neural networks). It has been highly
successful in tasks such as image and speech recognition and natural language
processing.
• Natural Language Processing (NLP): NLP is a branch of AI that deals with the
interaction between computers and human language. It enables computers to
understand, interpret, and generate human language, facilitating applications
like chatbots, translation, and sentiment analysis.
• Computer Vision: Computer vision involves teaching computers to interpret and
understand visual information from the world, such as images and videos. It has
applications in facial recognition, object detection, and autonomous vehicles.
Artificial Intelligence - Some key aspects
and concepts
• Robotics: AI plays a crucial role in robotics, enabling robots to perceive their
environment, make decisions, and perform tasks autonomously. This has
applications in industries like manufacturing, healthcare, and space exploration.

• Reinforcement Learning: In reinforcement learning, agents learn to make decisions

by interacting with an environment. They receive feedback in the form of rewards
or penalties and aim to maximize their cumulative reward over time. This approach
is used in autonomous systems, game playing, and robotics.
Data Mining Vs Statistics
Data Mining Statistics
Explorative – Dig out the data first, discover Confirmative – Provide theory first and
novel patterns and then make theories. then test it using various statistical tools.

Involves Data Cleaning Statistical methods applied on Clean Data

Usually involves working with large Usually involves working with small
datasets. datasets.

Makes generous use of heuristics think There is no scope for heuristics think.
Inductive process Deductive (Does not involve making any
predictions)

Numeric and Non-Numeric Data Numeric Data

Less concerned about data collection. More concerned about data collection.
Some of the popular data mining methods Some of the popular statistical methods
include –Estimation, Classification, Neural include –Inferential and Descriptive
Networks, Clustering, Association, and Statistics.
Visualization.
Data Science Vs Data Analytics:
Data science is an umbrella term that encompasses data analytics, data mining, machine learning,
and several other related disciplines. While a data scientist is expected to forecast the future based
on past patterns, data analysts extract meaningful insights from various data sources. A data
scientist creates questions, while a data analyst finds answers to the existing set of questions .
Data Science Data Analytics

Scope Macro: Data science encompasses a broader Micro : Data analytics is more
range of activities, including data collection, data focused on processing and
cleaning, data transformation, machine learning, analyzing structured data
statistical analysis, and data visualization. using various techniques such
as descriptive statistics, data
mining, and business
intelligence.
Objective Data science is a multidisciplinary field that aims Data analytics focuses on
to extract insights, knowledge, and predictions examining historical data to
from complex and unstructured data. It often identify trends, draw
involves asking open-ended questions and conclusions, and support
exploring data to discover new patterns and decision-making. Its primary
trends. goal is to provide answers to
specific questions and solve
well-defined problems.
Data Science Data Analytics
Techniqu Data scientists use a wide variety of Data analysts primarily use descriptive
es: techniques, including machine and diagnostic analytics techniques to
learning algorithms, statistical summarize data, identify trends, and gain
modeling, and deep learning, to insights. While some basic predictive
develop predictive models and gain a analytics might be involved, the focus is
deep understanding of data. less on building complex predictive
models.

Role Data scientists are responsible for Data analysts play a key role in
developing complex models, creating generating reports, dashboards, and
algorithms, and designing visualizations to support operational and
experiments to solve business tactical decisions within an organization.
problems. They have a strong They often work closely with business
background in computer science, stakeholders.
mathematics, and domain expertise.

Tools Data scientists use programming Data analysts commonly use tools like
languages like Python and R Excel, Tableau, Power BI, and SQL for
extensively, along with tools like data analysis and reporting. They may
Jupyter notebooks and libraries such not need extensive programming or
as TensorFlow and scikit-learn. machine learning expertise.
Data Science Data Analytics
Output The primary output of data science Data analytics produces reports,
includes predictive models, data- charts, and dashboards that
driven recommendations, and provide a clear picture of historical
actionable insights that drive decision- performance, enabling businesses
making at a strategic level within an to make informed decisions for
organization day-to-day operations and short-
term planning.
Supervised and Unsupervised Learning
Supervised Learning Unsupervised Learning
Objective In supervised learning, the algorithm Unsupervised learning, in
learns to map input data to a known target contrast, deals with unlabeled
or output variable. The primary goal is to data and seeks to discover
make predictions or classify data based on patterns, structures, or
labeled examples. relationships within the data
without the guidance of
predefined target variables.

Labeled Data Supervised learning requires a labeled Unsupervised learning

dataset, where each data point has algorithms work with data
associated target values or class labels. that lacks explicit labels or
These labels serve as the ground truth for categories. The goal is to find
the learning algorithm. hidden structures or
groupings within the data.

Training Process During training, the algorithm adjusts its Unsupervised learning is
model parameters to minimize the more close to the true
difference between its predictions and the Artificial Intelligence as it
true labels. Common supervised learning learns similarly as a child
tasks include regression (predicting a learns daily routine things by
continuous value) and classification his experiences.
(assigning data points to predefined
classes).
Supervised Learning Unsupervised Learning

Examples: Some common examples of supervised Clustering customer data to

learning tasks include predicting house identify distinct customer
prices based on features like square segments, reducing the
footage and location (regression) or dimensionality of image data for
classifying emails as spam or not spam feature extraction, and topic
(classification). modeling for text data are
examples of unsupervised learning
applications.

Evaluation: Supervised learning models are Unsupervised learning models are

evaluated based on their ability to typically evaluated differently from
accurately predict or classify new, supervised models. Evaluation
unseen data. Common evaluation often involves measuring the
metrics include accuracy, precision, quality of the discovered patterns
recall, and mean squared error, among or structures. However, evaluation
others. can be more subjective and
context-dependent in unsupervised
learning.
In summary, the key difference between supervised
and unsupervised learning lies in the presence or
absence of labeled data and the primary objectives.
Supervised learning focuses on making predictions
or classifications based on labeled data, while
unsupervised learning aims to discover hidden
patterns or structures in unlabeled data. Each
approach has its own set of algorithms, techniques,
and applications suited to specific problem
domains.
Important Questions
• Differentiate between Supervised and Unsupervised learning.
• "What is the primary difference between supervised and unsupervised learning,
and how does it impact the way each approach is used in machine learning
tasks?“
• "What are the fundamental distinctions between data science and data analytics,
and how do these differences impact the roles and responsibilities associated with
each field?“
• Differentiate Between Data Science and data analytics.
• "Can you provide an in-depth discussion of a specific aspect of artificial
intelligence, such as natural language processing, computer vision, reinforcement
learning, or any other area of your expertise or interest?"
• Discuss some aspect of artificial intelligence
• Discuss data science and outline the strategies and techniques that data scientists
employ to extract insights from data.
• What is the concept of data science, and how do data scientists apply various
strategies to analyze and interpret data?
• Explain data science and discuss the different methodologies that data scientists
employ.

Artificial Intelligence Notes
No ratings yet
Artificial Intelligence Notes
156 pages
Ai For Biginners (Autosaved)
No ratings yet
Ai For Biginners (Autosaved)
135 pages
Namaskar Dosto
No ratings yet
Namaskar Dosto
15 pages
AI New All Units
No ratings yet
AI New All Units
137 pages
AI Library Python
No ratings yet
AI Library Python
5 pages
1- Business Artificial IntelegenceIntroduction
No ratings yet
1- Business Artificial IntelegenceIntroduction
25 pages
01 Introduction and 02 ML
No ratings yet
01 Introduction and 02 ML
8 pages
Full Paper Formatting Guidlines
No ratings yet
Full Paper Formatting Guidlines
3 pages
Intor AI ESGB
No ratings yet
Intor AI ESGB
26 pages
2 A Review On Applicatons of Ai in Machine Learning and Deep Learning
No ratings yet
2 A Review On Applicatons of Ai in Machine Learning and Deep Learning
6 pages
1. Machine Learning Lessons
No ratings yet
1. Machine Learning Lessons
44 pages
unit 3
No ratings yet
unit 3
47 pages
Seminar On Machine Learning and Ai
No ratings yet
Seminar On Machine Learning and Ai
15 pages
Module 1
No ratings yet
Module 1
5 pages
Intro - Intro - AI
No ratings yet
Intro - Intro - AI
58 pages
Data Science Fir Civil Engineering Unit 1 Notes and Assignments
No ratings yet
Data Science Fir Civil Engineering Unit 1 Notes and Assignments
29 pages
Introduction To Artificial Intelligence
No ratings yet
Introduction To Artificial Intelligence
13 pages
ML Chapter 01
No ratings yet
ML Chapter 01
38 pages
PDF&Rendition=1 4
No ratings yet
PDF&Rendition=1 4
33 pages
textbook ML_removed_removed_removed
No ratings yet
textbook ML_removed_removed_removed
42 pages
11 Ai Level 1 Notes
No ratings yet
11 Ai Level 1 Notes
8 pages
Rohit PPT Aisc
No ratings yet
Rohit PPT Aisc
20 pages
AIML Lect1 Introduction
No ratings yet
AIML Lect1 Introduction
70 pages
AI IN EE
No ratings yet
AI IN EE
9 pages
AI RajeevSir Merged
No ratings yet
AI RajeevSir Merged
148 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
140 pages
843_AI_Student_HandbookXI.pdf_20240803_173743_0000
No ratings yet
843_AI_Student_HandbookXI.pdf_20240803_173743_0000
16 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
8 pages
AI Unit - 1
No ratings yet
AI Unit - 1
8 pages
CH - 1 Artificial Intelligence Class 11 Notes
100% (2)
CH - 1 Artificial Intelligence Class 11 Notes
11 pages
Introduction To Intelligence Systems
No ratings yet
Introduction To Intelligence Systems
22 pages
Intro To AI - Course Notes
No ratings yet
Intro To AI - Course Notes
27 pages
ML Module2-Chapter 1
No ratings yet
ML Module2-Chapter 1
50 pages
02 - Terminology - 2012-13-A - annotated
No ratings yet
02 - Terminology - 2012-13-A - annotated
68 pages
Report On Machine Learning
No ratings yet
Report On Machine Learning
13 pages
PBBML L1
No ratings yet
PBBML L1
31 pages
Xi Ai - Unit 1 Notes & Exercise
No ratings yet
Xi Ai - Unit 1 Notes & Exercise
18 pages
Lecture 1 - Introduction to the Course and AI,ML (1)
No ratings yet
Lecture 1 - Introduction to the Course and AI,ML (1)
44 pages
Introduction To Artificial Intelligence: Inte Ligê Ncia Artif Icial E Cibe Rse Gurança (Inacs)
No ratings yet
Introduction To Artificial Intelligence: Inte Ligê Ncia Artif Icial E Cibe Rse Gurança (Inacs)
35 pages
introduction
No ratings yet
introduction
8 pages
MACHINE LEARNING CLASS NOTE 1
No ratings yet
MACHINE LEARNING CLASS NOTE 1
16 pages
Intro to AI - Course notes
No ratings yet
Intro to AI - Course notes
26 pages
AIML Module-2.2 Notes
No ratings yet
AIML Module-2.2 Notes
55 pages
week#1
No ratings yet
week#1
46 pages
DS Xi Sec3
No ratings yet
DS Xi Sec3
101 pages
MODULE 1 PART 2
No ratings yet
MODULE 1 PART 2
19 pages
(R17A1204) Artificial Intelligence
No ratings yet
(R17A1204) Artificial Intelligence
257 pages
Ch-1 Notes
No ratings yet
Ch-1 Notes
7 pages
BCS602-module-1-pdf
No ratings yet
BCS602-module-1-pdf
36 pages
Chapter 1
No ratings yet
Chapter 1
56 pages
Machine Learning
No ratings yet
Machine Learning
17 pages
Class Notes - XI
No ratings yet
Class Notes - XI
17 pages
Eaton Ceag El Cps Global Catalogue 2022 En
No ratings yet
Eaton Ceag El Cps Global Catalogue 2022 En
498 pages
Machine learning Unit 1
No ratings yet
Machine learning Unit 1
14 pages
Introduction To Artificial Intelligence
No ratings yet
Introduction To Artificial Intelligence
26 pages
Artificial Intelligence Introduction
No ratings yet
Artificial Intelligence Introduction
8 pages
Motivation 24111
No ratings yet
Motivation 24111
23 pages
unit-1 (1)
No ratings yet
unit-1 (1)
55 pages
Machine Learning Unraveled: Exploring the World of Data Science and AI
From Everand
Machine Learning Unraveled: Exploring the World of Data Science and AI
Alex Murphy
No ratings yet
Artificial Intelligence & Machine Learning Digital Notes
100% (2)
Artificial Intelligence & Machine Learning Digital Notes
116 pages
AI_intro_Session
No ratings yet
AI_intro_Session
21 pages
Unit-1
No ratings yet
Unit-1
88 pages
PDS_Exp_13_to_16
No ratings yet
PDS_Exp_13_to_16
14 pages
Bis 13 1294 Uk Cyber Security Standards Research Report
No ratings yet
Bis 13 1294 Uk Cyber Security Standards Research Report
105 pages
Introduction To AI and ML
100% (1)
Introduction To AI and ML
68 pages
Egn9 - Ecap Faq - V.4b Nov 2020
No ratings yet
Egn9 - Ecap Faq - V.4b Nov 2020
8 pages
Manual de Partes MP C 2800 3000
No ratings yet
Manual de Partes MP C 2800 3000
164 pages
Download PDF for Eaton Specification Sheet - KD3400F
No ratings yet
Download PDF for Eaton Specification Sheet - KD3400F
3 pages
CSP Crypto Puzzle
No ratings yet
CSP Crypto Puzzle
3 pages
What Are The Different Integration Testing Techniques - Mention Their Advantages and Disadvantages.
No ratings yet
What Are The Different Integration Testing Techniques - Mention Their Advantages and Disadvantages.
2 pages
TMP4210/470 Marking System: System Overview System Configuration
No ratings yet
TMP4210/470 Marking System: System Overview System Configuration
12 pages
Teaching After Covid - Jacobson Term Paper
No ratings yet
Teaching After Covid - Jacobson Term Paper
11 pages
List of Participants ABIM 2016: Last (Family) Name First (Given) Name Company / Institution E-Mail Country Webaddress
No ratings yet
List of Participants ABIM 2016: Last (Family) Name First (Given) Name Company / Institution E-Mail Country Webaddress
20 pages
At1 Pro (E) User Manual: Revision: 02 Revision Date: 2012/06/25
No ratings yet
At1 Pro (E) User Manual: Revision: 02 Revision Date: 2012/06/25
19 pages
AIs New Workforce - The Data Labelling Industry Spreads Globally - Financial Times
No ratings yet
AIs New Workforce - The Data Labelling Industry Spreads Globally - Financial Times
6 pages
FS102 Written Report
No ratings yet
FS102 Written Report
3 pages
Research Proposal Updated
No ratings yet
Research Proposal Updated
7 pages
109E a055CQ45 ML (R38MS) CN
100% (1)
109E a055CQ45 ML (R38MS) CN
12 pages
Linear Programming Sensitivity Analysis
No ratings yet
Linear Programming Sensitivity Analysis
35 pages
9852 2009 01 Driving Boltec MC, LC
No ratings yet
9852 2009 01 Driving Boltec MC, LC
2 pages
Bipolar Stepper Controller Using The Couple L297 - L6203 (Upto 5A)
No ratings yet
Bipolar Stepper Controller Using The Couple L297 - L6203 (Upto 5A)
4 pages
Instr Datasheet DSP Leak Monitor Ex Ia 62 320 00139 December 2016
No ratings yet
Instr Datasheet DSP Leak Monitor Ex Ia 62 320 00139 December 2016
2 pages
Tos (Mathematics 7) SCHOOL YEAR 2020 - 2021: Alfred A. Intong
No ratings yet
Tos (Mathematics 7) SCHOOL YEAR 2020 - 2021: Alfred A. Intong
4 pages
Comparison of Deck Sheet Profiles-2
No ratings yet
Comparison of Deck Sheet Profiles-2
1 page
Quality Sirim
No ratings yet
Quality Sirim
9 pages
Valvoline Maxlife Multi Vehicle ATF
No ratings yet
Valvoline Maxlife Multi Vehicle ATF
2 pages
Project
No ratings yet
Project
4 pages
Havells - Flood Light
No ratings yet
Havells - Flood Light
1 page
Project Fact Sheet-Key Chain 2021 - Updated
No ratings yet
Project Fact Sheet-Key Chain 2021 - Updated
9 pages
ILC Crane Brochure - 300 T (GMK6300L)
No ratings yet
ILC Crane Brochure - 300 T (GMK6300L)
27 pages