Module2 ML 22 01 2024 WM
Module2 ML 22 01 2024 WM
Savitha Nagaraju
Assistant Professor
AIML Department, ATME College
Machine Learning , Data Science, Data Mining and Data Analytics
Data Science is an “Umbrella” term that encompasses many fields. Machine Learning starts with data.
Data Science and Machine Learning are interlinked. Data Science deals with gathering of data for analysis.
Data
Analytics
Data
Mining
Big Data
Data
Mining
Machine Learning , Data Science, Data Mining and Data Analytics
Big Data
Data Science concerns about collection of data. Big Data is a field of data science that deals with
data’s following characteristics:
Volume: Huge amount of data is generated by big companies
like Facebook, Twitter, You Tube.
Machine Learning , Data Science, Data Mining and Data Analytics
Big Data
Data Science concerns about collection of data. Big Data is a field of data science that deals with
data’s following characteristics:
Variety: Data is available in variety of forms like images,
videos and in different formats.
Machine Learning , Data Science, Data Mining and Data Analytics
Big Data
Data Science concerns about collection of data. Big Data is a field of data science that deals with
data’s following characteristics:
Velocity: It refers to the speed at which the data is generated and processed.
Machine Learning , Data Science, Data Mining and Data Analytics
Data Mining
Data mining aims to extract the hidden patterns that are in the data,
whereas, machine learning aims to use it for prediction.
Machine Learning , Data Science, Data Mining and Data Analytics
Data Analytics
Data Data
Mining Analytics
Machine
Learning
Pattern Big
Recognition Data
Machine Learning , Data Science, Data Mining and Data Analytics
Statistics is a branch of mathematics that has solid theoretical foundation regarding statistical learning.
Like machine learning(ML), it can learn from data.
Machine
Learning
Classification Models
(based on implementation methods)
Classification
Models
Generative Discriminative
models models
Machine Learning , Data Science, Data Mining and Data Analytics
Types of Machine Learning
Generative Models
Deals with process of data generation and its distribution. Probabilistic models are examples of generative models.
Discriminative Models
Discriminative models do not care about the generation of data. Instead, they simply concentrate
on classifying the given data
Machine Learning , Data Science, Data Mining and Data Analytics
Types of Machine Learning
The program is supplied with objects, but no labels are defined. The algorithm
itself observes the patterns based on the principles of grouping. Grouping is done in ways
that similar objects form the same group
The reward can be positive or negative(Punishment). When the rewards are more, the behaviour gets reinforced and
Learning becomes possible.
Goal
Grid Game
Danger
Machine Learning , Data Science, Data Mining and Data Analytics
Challenges of Machine Learning
Input(x1,x2) Output(y)
1,1 1
2,1 2
3,1 3
4,1 4
5,1 5
Machine Learning , Data Science, Data Mining and Data Analytics
Challenges of Machine Learning
• Huge Data
This is primary requirement of machine learning. Availability of a quality data is a challenge.
Input(x1,x2)
A quality data means Output(y)
it should be large and should not have data problems such as missing data or incorrect data.
• High Computation power
With the availability of big data, the computational resource requirement is also increased. Systems with GPU and TPU
Are required to execute machine learning algorithms.
• Complexity of Algorithms
The selection of algorithms, describing the algorithms application of algorithms to solve machine learning task,
and comparison of algorithms have become necessary for machine learning or data scientist now.
Machine Learning , Data Science, Data Mining and Data Analytics
Challenges of Machine Learning
• Bias/Variance
Variance is the error of the model. This leads to bias and variance trade-off.
A model that fitsInput(x1,x2)
the training data correctly butOutput(y)
fails for test data, is called overfitting.
The reverse problem is underfitting, where the model fails for training data but has good generalization.
Machine Learning , Data Science, Data Mining and Data Analytics
Machine Learning Process
Data
preprocessing
Modelling
Model Model
evaluation deployment
Machine Learning , Data Science, Data Mining and Data Analytics
Machine Learning Applications
Machine Learning , Data Science, Data Mining and Data Analytics
Machine Learning Applications