ML-Unit 1

Uploaded by

SaMPaTH CM 19&[

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views101 pages

ML-Unit 1

Uploaded by

SaMPaTH CM 19&[

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 101

Machine Learning

Unit - 1
Dr. R.Seeta Sireesha
Associate Professor,
Department of Computer Science and Engineering
GVPCE (A), Madhurawada.
Machine Learning Basics
⮚The Need for Machine Learning.
⮚Understanding Machine Learning,
⮚Computer Science, Data Science, Artificial Intelligence,
⮚Natural Language Processing, Deep Learning,
⮚Machine Learning Methods, Semi-Supervised Learning,
Reinforcement Learning,
⮚Model Based Learning,
⮚The CRISP-DM Process Model,
⮚Building Machine Intelligence, and Real-World Case Study
The Need for machine learning
1. Making data driven decisions:
⮚ Making data-driven decisions means using your data effectively to gain
valuable insights and make improved choices..
⮚ Fields like operations research, statistics, and management information systems
have existed for decades and attempt to bring efficiency to any business or
organization by using data and analytics to make data-driven decisions.
⮚ Solutions to problems that cannot be programmed inherently need a different
approach where we use the data itself to drive decisions instead of using
programmable logic, rules, or code to make these decisions
2. Efficiency and Scale:
⮚ While getting insights and making decisions driven by data are of paramount
importance, it also needs to be done with efficiency and at scale.
⮚ The key idea of using techniques from Machine Learning or artificial
intelligence is to automate processes or tasks by learning specific patterns
from the data.
⮚ We all want computers or machines to tell us “when a stock might rise or fall”,
“whether an image is of a computer or a television”, “whether our product
placement and offers are the best”, “determine shopping price trends” etc.

Scale :
A unit is said to be scale efficient when its size of operations is optimal so
that any modifications on its size will render the unit less efficient.
3. Traditional Programming Paradigm:
⮚ Traditional programming paradigms basically involve the user or programmer
to write a set of instructions or operations using code that makes the computer
perform specific computations on data to give the desired results.
Why Machine Learning?
⚫The traditional programming paradigm is quite good and human intelligence
and domain expertise is definitely an important factor in making data-driven
decisions, we need Machine Learning to make faster and better decisions.
⚫The Machine Learning paradigm tries to take into account data and expected
outputs or results if any and uses the computer to build the program, which is
also known as a model.
⚫This program or model can then be used in the future to make necessary
decisions and give expected outputs from new inputs.
Understanding Machine Learning
Why Make Machines Learn?
There are several scenarios when it might be beneficial to make machines learn and
some of them are:
1. Lack of sufficient human expertise in a domain (e.g., simulating
navigations in unknown territories or even spatial planets).
2. Scenarios and behavior can keep changing over time (e.g., availability of
infrastructure in an organization, network connectivity, and so on).
3. Humans have sufficient expertise in the domain but it is extremely
difficult to formally explain or translate this expertise into computational tasks (e.g.,
speech recognition, translation, scene recognition, cognitive tasks, and so on).
4. Addressing domain specific problems at scale with huge volumes of data
with too many complex conditions and constraints.
Formal Definition
“A computer program is said to learn from experience E with respect to some
class of tasks T and performance measure P, if its performance at tasks in T, as
measured by P, improves with experience E.”
Defining the Task, T
⚫Classification or categorization: A simple example would be classifying
animal images into dogs, cats, and zebras.
⚫Regression: stock predictions, House price predictions
⚫Translation: Translating from one language to other
⚫Clustering or grouping: Examples would be grouping similar products,
events and entities.
⚫Transcriptions: Examples include speech to text, optical character
recognition, images to text, and so on
Defining the Experience, E
⚫The process of consuming a dataset that consists of data samples or data
points such that a learning algorithm or model learns inherent patterns is
defined as the experience, E which is gained by the learning algorithm.
⚫Any experience that the algorithm gains is from data samples or data points
and this can be at any point of time.
⚫You can feed it data samples in one go using historical data or even supply
fresh data samples whenever they are acquired.
Defining the Performance, P
⚫The performance, P, is usually a quantitative measure or metric that’s used to
see how well the algorithm or model is performing the task, T, with
experience, E.
⚫Typical performance measures include accuracy, precision, recall, F1 score,
sensitivity, specificity, error rate, misclassification rate, and many more.
⚫Performance measures are usually evaluated on training data samples as well
as data samples which it has not seen or learned from before, which are usually
known as validation and test data samples.
A Multi-Disciplinary Field
Computer Science
⚫The field of computer science (CS) can be defined as the study of the
science of understanding computers.
⚫This involves study, research, development, engineering, and
experimentation of areas dealing with understanding, designing, building,
and using computers.
⚫This also involves extensive design and development of algorithms and
programs that can be used to make the computer perform computations and
tasks as desired.
⚫There are mainly two major areas or fields under computer science, as
follows
⚫Theoretical computer science
⚫Applied or practical computer science

Theoretical computer science:

⚫Theoretical computer science is the study of theory and logic that tries to
explain the principles and processes behind computation.
⚫This involves understanding the theory of computation which talks about how
computation can be used efficiently to solve problems.
⚫Theory of computation includes the study of formal languages, automata, and
understanding complexities involved in computations and algorithms
⚫data structures and algorithms are the two fundamental pillars of theoretical
CS used extensively in computational programs and functions.
Applied or practical computer science:
⚫Practical computer science also known as applied computer science is more
about tools, methodologies, and processes that deal with applying concepts and
principles from computer science in the real world to solve practical day-to-
day problems.
⚫This includes emerging sub-fields like artificial intelligence, Machine
Learning, computer vision, Deep Learning, natural language processing, data
mining, and robotics and they try to solve complex real-world problems based
on multiple constraints and parameters and try to emulate tasks that require
considerable human intelligence and experience
⚫Besides these, we also have well established fields, including computer
architecture, operating systems, digital logic and design, distributed computing,
computer networks, security, databases, and software engineering.
Data Science
⚫Data Science basically deals with principles, methodologies, processes,
tools, and techniques to gather knowledge or information from data
Data Science
Basically there are three major components and Data Science sits at the
intersection of them.
⚫Math and statistics knowledge is all about applying various computational
and quantitative math and statistical based techniques to extract insights from
data.
⚫Hacking skills basically indicate the capability of handling, processing,
manipulating and wrangling data into easy to understand and analyzable
formats.
⚫Substantive expertise is basically the actual real-world domain expertise
which is extremely important when you are solving a problem because you
need to know about various factors, attributes, constraints, and knowledge
related to the domain besides your expertise in data and algorithms.
Artificial Intelligence
⚫Artificial Intelligence is composed of two words “Artificial” and
“Intelligence”, where Artificial defines "man-made," and intelligence
defines "thinking power", hence AI means "a man-made thinking power.“

Definition:
"It is a branch of computer science by which we can create intelligent
machines which can behave like a human, think like humans, and able to make
decisions”.
Machine Learning Applications

● Sales forecasting for different products

● Fraud analysis in banking
● Product recommendations
● Stock price prediction
Deep Learning Applications

● Cancer tumor detection

● Captionbot for captioning an image
● Music generation
● Image coloring
● Object detection
Artificial Intelligence (AI) Machine Learning (ML) Deep Learning (DL)

AI simulates human intelligence to ML is a subset of AI that uses algorithms DL is a subset of ML that employs artificial neural
perform tasks and make decisions. to learn patterns from data. networks for complex tasks.
AI may or may not require large ML heavily relies on labeled data for DL requires extensive labeled data and performs
datasets; it can use predefined rules. training and making predictions. exceptionally with big datasets.
AI can be rule-based, requiring human ML automates learning from data and DL automates feature extraction, reducing the need
programming and intervention. requires less manual intervention. for manual engineering.
AI can handle various tasks, from ML specializes in data-driven tasks like DL excels at complex tasks like image recognition,
simple to complex, across domains. classification, regression, etc. natural language processing, and more.
AI algorithms can be simple or ML employs various algorithms like DL relies on deep neural networks, which can have
complex, depending on the application. decision trees, SVM, and random forests. numerous hidden layers for complex learning.
AI may require less training time and ML training time varies with the algorithm DL training demands substantial computational
resources for rule-based systems. complexity and dataset size. resources and time for deep networks.
AI systems may offer interpretable ML models can be interpretable or less DL models are often considered less interpretable due
results based on human rules. interpretable based on the algorithm. to complex network architectures.
AI is used in virtual assistants, ML is applied in image recognition, spam DL is utilized in autonomous vehicles, speech
recommendation systems, and more. filtering, and other data tasks. recognition, and advanced AI applications.
Importance of AI?
There are few main reason to learn AI:
• With the help of AI, we can create such software which can solve real-
world problems very easily and with accuracy such as health issues,
marketing, traffic issues, etc.
• We can also create our personal virtual Assistant, such as Cortana, Google
Assistant, Siri, etc.
• We can build Robots which can work in an environment where survival of
humans can be at risk.
Goals of AI:
1. Replicate human intelligence
2. Solve Knowledge-intensive tasks
3. An intelligent connection of perception and action
4. Building a machine which can perform tasks that requires human
intelligence such as:
1. Proving a theorem
2. Playing chess
3. Plan some surgical operation
4. Driving a car in traffic
5. Creating some system which can exhibit intelligent behavior, learn new
things by itself, demonstrate, explain, and can advise to its user.
What Comprises to Artificial Intelligence?
⚫To create the AI, first we should know that how intelligence is composed, so
the Intelligence is an intangible part of our brain which is a combination of
“Reasoning, learning, problem-solving, language, understanding,” etc.
⚫To achieve the above factors for an AI machine requires the following
disciplines:
Advantages of AI:
⚫High Accuracy with less errors
⚫High speed and reliability
⚫Useful for risky areas
⚫Digital Assistant
⚫Useful as public utility
Disadvantages of AI:
⚫High cost
⚫Can’t think out of box
⚫No feelings and emotions
⚫Increase dependency on machines
⚫No original creativity
Types of AI:
⚫Artificial Intelligence can be divided in two types based on capabilities
and functionalities
AI type 1: Based on Capability
1. Weak or Narrow AI:
-> only trained for one specific task
-> It can fail in unpredictable ways if it goes beyond its limits.
Example: Chat bots, Apple Siris,
2. General AI:
-> could perform any intellectual task with efficiency like a human.
-> The worldwide researchers are now focused on developing
machines with General AI
Example: Robot
AI type 1: Based on Capability
3. Super AI:
-> Machines can perform any task better than human with cognitive
properties.
Example: Terminator movie.
AI type 2: Based on functionality
1. Reactive Machines:
⚫Reactive machines do not store memories or past experiences for future
actions.
⚫Only focus on current scenarios and react on it as per possible best action.
⚫Example: IBM’s Deep Blue, AlphaGo

2. Limited Memory:
⚫Limited memory machines can store past experiences or some data for a short
period of time.
⚫Example: Self-driving cars. These cars can store recent speed of nearby cars,
the distance of other cars, speed limit, and other information to navigate the
road
AI type 2: Based on functionality
3. Theory of Mind:
⚫Should understand the human emotions, people, beliefs, and be able to interact
socially like humans.
⚫This type of AI machines are still not developed, but researchers are making lots
of efforts.

4. Self – Awareness:
⚫These machines will be super intelligent, and will have their own consciousness,
sentiments, and self-awareness
⚫Self-Awareness AI does not exist in reality still
Natural Language Processing
⚫Natural language processing (NLP) is the ability of a computer program to
understand human language as it is spoken and written -- referred to as
natural language. It is a component of artificial intelligence
How does natural language processing work?
⚫NLP enables computers to understand natural language as humans do.
⚫Whether the language is spoken or written, natural language processing
uses artificial intelligence to take real-world input, process it, and make
sense of it in a way a computer can understand.
⚫Just as humans have different sensors -- such as ears to hear and eyes to
see -- computers have programs to read and microphones to collect audio
Natural Language Processing
There are two main phases to NLP
1. Data preprocessing: Data preprocessing involves preparing and
"cleaning" text data for machines to be able to analyze it.. There are
several ways this can be done, including:
-> Tokenization: text is broken down into smaller units
-> Stop word: common words are removed from text
-> Part-of-speech tagging: words are marked based on the part-of
speech
Natural Language Processing
2. Algorithm Development:
-> Rules-based system
-> Machine learning-based system
Applications of NLP
Some of the major applications of NLP are mentioned as follows.
• Machine translation
• Speech recognition
• Question answering systems
• Context recognition and resolution
• Text summarization
• Text categorization
• Information extraction
• Sentiment and emotion analysis
• Topic segmentation
Parser in NLP
Deep Learning
⚫Deep learning is a subdomain of machine
learning.
⚫With accelerated computational power and
large data sets, deep learning algorithms are able
to self-learn hidden patterns within data to make
predictions.
⚫Deep learning architecture contains a
computational unit that allows modeling of
nonlinear functions called perceptron.
Deep learning and human brain
⚫Generally, how a "neuron" in a human brain transmits electrical pulses
throughout our nervous system, the perceptron receives a list of input signals
and transforms them into output signals.
⚫The perceptron aims to understand data representation by stacking together
many layers, where each layer is responsible for understanding some part of
the input.
⚫A network of these perceptron mimics how neurons in the brain form a
network, so the architecture is called neural networks (or artificial neural
networks).
Purpose of DL
Features of Deep Learning algorithms
The following points describe the salient features of most Deep Learning algorithms
⚫ Hierarchical layered representation of concepts. These concepts are also called features in Machine
Learning terminology (data attributes).
⚫ Distributed representational learning of the data happens through a multi-layered architecture
(unsupervised learning).
⚫ More complex and high-level features and concepts are derived from simpler, lowlevel features.
⚫ A “deep” neural network usually is considered to have at least more than one hidden layer besides the
input and output layers. Usually it consists of a minimum of three to four hidden layers.
⚫ Deep architectures have a multi-layered architecture where each layer consists of multiple non-linear
processing units. Each layer’s input is the previous layer in the architecture. The first layer is usually
the input and the last layer is the output.
⚫ Can perform automated feature extraction, classification, anomaly detection, and many other Machine
Learning tasks.
DL-considers high level features
High level features increase accuracy in
prediction
What are Neural Networks?
⚫A neural network is a system modeled on the human brain, consisting of an
input layer, multiple hidden layers, and an output layer.
⚫Data is fed as input to the neurons. The information is transferred to the next
layer using appropriate weights and biases.
⚫The output is the final value predicted by the artificial neuron.
What are Neural Networks?
Each neuron in a neural network performs the following operations:
• The product of each input and the weight of the channel it is passed over is
found
• The sum of the weighted products is computed, which is called the weighted
sum
• A bias value of the neuron is added to the weighted sum
• The final sum is then subjected to a particular function known as the activation
function.
Activation function
⚫Activation function decides, whether a neuron should be activated or not by
calculating weighted sum and further adding bias with it.
⚫In a neural network, we would update the weights and biases of the neurons
on the basis of the error at the output. This process is known as back-
propagation.
⚫Activation functions make the back-propagation possible since the gradients
are supplied along with the error to update the weights and biases.
Back Propagation
⚫1. Propagation
⚫a. The input data sample vectors are propagated forward through the neural
network to generate the output values from the output layer.
⚫b. Compare the generated output vector with the actual/desired output vector for
that input data vector.
⚫c. Compute difference in error at the output units.
⚫d. Backpropagate error values to generate deltas at each node/neuron.
⚫ 2. Weight Update
⚫a. Compute weight gradients by multiplying the output delta (error) and input
activation.
⚫b. Use learning rate to determine percentage of the gradient to be subtracted from
original weight and update the weight of the nodes.
Machine Learning Methods
⚫Machine learning methods are classified under some categories such
as
1. Methods based on the amount of human
supervision in the learning process
1. Supervised Learning
⚫Supervised learning is used to identify the relationship between the input
and output variables and then use it to map new unlabelled data.
⚫Types:
1. Classification: used to predict a categorical or nominal variable
Algorithms: KNN, Logistic Regression, SVM, Decision Trees, Random
Forest, Naïve Bayes etc.
2. Regression: used to predict real – valued or continuous variable.
Algorithms: Linear regression, SVR, Decision Trees, Random Forest etc.
Classification:
⚫Classification is a task that requires the use of machine learning algorithms
that learn how to assign a class label to examples from the problem domain. An
easy to understand example is classifying emails as “spam” or “not spam.”
⚫In this classification we have many other types some of them are:
⚫Binary classification
⚫Multi – class classification
Supervised learning: binary classification for weather
prediction
Binary classification
⚫In machine learning, binary classification is a supervised learning algorithm that
categorizes new observations into one of two classes.
⚫The following are a few binary classification applications, where the 0 and 1
columns are two possible classes for each observation:

⚫Popular algorithms that can be used for binary classification include:

⚫Logistic Regression, k-Nearest Neighbors, Decision Trees, Support Vector Machine, Naive
Bayes
Evaluation of binary classifiers
⚫If the model successfully predicts the patients as positive, this case is called True
Positive (TP).
⚫If the model successfully predicts patients as negative, this is called True Negative
(TN).
⚫The binary classifier may misdiagnose some patients as well. If a diseased patient is
classified as healthy by a negative test result, this error is called False Negative (FN).
⚫Similarly, If a healthy patient is classified as diseased by a positive test result, this
error is called False Positive(FP).
⚫After obtaining these values, we can compute the accuracy score of the binary
classifier as follows:
Multi class classification
⚫Multi class classification is the task of classifying elements into different classes.

Examples include:
• Face classification, Plant species classification, Optical character recognition.
⚫Popular algorithms that can be used for multi-class classification include:
k-Nearest Neighbors, Decision Trees, Naive Bayes, Random Forest,
Gradient Boosting.
Real Life examples for supervised ML
• Text categorization
• Face Detection
• Signature recognition
• Customer discovery
• Spam detection
• Weather forecasting
• Predicting housing prices based on the prevailing market price
• Stock price predictions, among others
Regression
⚫Regression is a supervised learning algorithm which helps in finding the
correlation between variables and enables us to predict the continuous
output variable based on the one or more predictor variables.
⚫It is mainly used for prediction, forecasting, time series modeling, and
determining the causal-effect relationship between variables.
⚫Some examples are:
Prediction of rain using temperature and other factors,
Determining Market trends,
Prediction of road accidents due to rash driving.
Types of Linear Regression
Linear regression can be further divided into two types of the algorithm:
• Simple Linear Regression:
If a single independent variable is used to predict the value of a
numerical dependent variable, then such a Linear Regression algorithm is
called Simple Linear Regression.
• Multiple Linear regression:
If more than one independent variable is used to predict the value of a
numerical dependent variable, then such a Linear Regression algorithm is
called Multiple Linear Regression.
Linear Regression
2. Unsupervised Learning
⚫In which only have the input data to feed to the model but no corresponding
output data. There are 2 categories:
1. Clustering: group or organize similar objects together.
Algorithms: K means, DBSCAN, Mean- shift algorithm etc.
2. Association: the association between data elements is identified.
Algorithms: Apriori and FP growth
Clustering
⚫Clustering is a way of grouping the data points into different clusters,
consisting of similar data points.
⚫The objects with the possible similarities remain in a group that has less or
no similarities with another group.
Clustering cont..
There are various types of clustering methods that can be classified under the
following major approaches.
⚫ Centroid based methods such as K-means and K-medoids
⚫ Hierarchical clustering methods such as agglomerative and divisive
(Ward’s, affinity propagation)
⚫ Distribution based clustering methods such as Gaussian mixture models
⚫ Density based methods such as dbscan and optics.
Dimensionality Reduction
Dimensionality reduction techniques can be classified in two major
approaches as follows.
• Feature Selection methods: Specific features are selected for each data
sample from the original list of features and other features are discarded. No
new features are generated in this process.
• Feature Extraction methods: We engineer or extract new features from the
original list of features in the data. Thus the reduced subset of features will
contain newly generated features that were not part of the original feature set.
PCA falls under this category.
Anomaly Detection
Anomaly detection based methods are extremely popular in real-world
scenarios like detection of security attacks or breaches, credit card fraud,
manufacturing anomalies, network issues etc.
Association – Market Basket Analysis
⚫Market basket analysis is a data mining technique used by retailers to increase
sales by better understanding customer purchasing patterns. It involves analyzing
large data sets, such as purchase history, to reveal product groupings and products
that are likely to be purchased together.
Examples for market basket analysis
Real Life examples for Unsupervised ML
• Audience segmentation.
• Customer personality investigation.
• Anomaly detection (for example, to detect bot activity)
• Pattern recognition (grouping images, transcribing audio)
• Inventory management (by conversion activity or by availability)
3. Semi supervised learning
⚫The most basic disadvantage of any Supervised Learning algorithm is that
the dataset has to be hand-labeled either by a Machine Learning Engineer or a
Data Scientist.
⚫This is a very costly process, especially when dealing with large volumes of
data. The most basic disadvantage of any Unsupervised Learning is that
it’s application spectrum is limited.
⚫To counter these disadvantages, the concept of Semi-Supervised
Learning was introduced. In this type of learning, the algorithm is trained
upon a combination of labeled and unlabeled data.
3. Semi supervised learning (cont..)
⚫Typically, this combination will contain a very small amount of labeled data
and a very large amount of unlabeled data.
⚫The basic procedure involved is that first, the programmer will cluster
similar data using an unsupervised learning algorithm and then use the
existing labeled data to label the rest of the unlabeled data.
⚫The typical use cases of such type of algorithm have a common property
among them – The acquisition of unlabeled data is relatively cheap while
labeling the said data is very expensive.
3. Semi supervised learning (cont..)
Intuitively, one may imagine the three types of learning algorithms as:
⚫Supervised learning where a student is under the supervision of a teacher at
both home and school,
⚫Unsupervised learning where a student has to figure out a concept himself
and
⚫Semi-Supervised learning where a teacher teaches a few concepts in class
and gives questions as homework which are based on similar concepts.
4. Reinforcement Learning
• Reinforcement Learning is a feedback-based Machine learning technique in
which an agent learns to behave in an environment by performing the
actions and seeing the results of actions. For each good action, the agent
gets positive feedback, and for each bad action, the agent gets negative
feedback or penalty.
⚫Algorithms: Q – learning, Sarsa
Terms used in Reinforcement Learning
• Agent(): An entity that can perceive/explore the environment and act upon it.
• Environment(): A situation in which an agent is present or surrounded by. In RL,
we assume the stochastic environment, which means it is random in nature.
• Action(): Actions are the moves taken by an agent within the environment.
• State(): State is a situation returned by the environment after each action taken by
the agent.
• Reward(): A feedback returned to the agent from the environment to evaluate the
action of the agent.
• Policy(): Policy is a strategy applied by the agent for the next action based on the
current state.
• Value(): It is expected long-term retuned with the discount factor and opposite to
the short-term reward.
Steps of Reinforcement Learning
The main steps of a reinforcement learning method are mentioned as follows.
⚫ 1. Prepare agent with set of initial policies and strategy
⚫ 2. Observe environment and current state
⚫ 3. Select optimal policy and perform action
⚫ 4. Get corresponding reward (or penalty)
⚫ 5. Update policies if needed
⚫ 6. Repeat Steps 2 - 5 iteratively until agent learns the most optimal
policies
Reinforcement Learning Example
How does Reinforcement Learning Work?
⚫To understand the working process of the RL, we need to consider two main
things:
• Environment: It can be anything such as a room, maze, football ground, etc.
• Agent: An intelligent agent such as AI robot.
⚫Let's take an example of a maze environment that the agent needs to explore.
Consider an image:
⚫the agent is at the very first block of the maze. The maze is consisting of
an S6 block, which is a wall, S8 a fire pit, and S4 a diamond block.
⚫The agent cannot cross the S6 block, as it is a solid wall. If the agent
reaches the S4 block, then get the +1 reward; if it reaches the fire pit, then
gets -1 reward point. It can take four actions: move up, move down,
move left, and move right.
⚫The agent can take any path to reach to the final point, but he needs to
make it in possible fewer steps. Suppose the agent considers the path S9-
S5-S1-S2-S3, so he will get the +1-reward point.
⚫The agent will try to remember the preceding steps that it has taken to
reach the final step. To memorize the steps, it assigns 1 value to each
previous step. Consider the below step:
⚫Now, the agent has successfully stored the previous steps assigning the 1
value to each previous block.
Applications of Reinforcement Learning
• Robotics for industrial automation.
• Business strategy planning
• Machine learning and data processing
• It helps you to create training systems that provide custom instruction and
materials according to the requirement of students.
• Aircraft control and robot motion control
Examples in Reinforcement Learning
⚫Personalized product recommendation system: Personalize / customize
what products need to be shown to individual users to realize maximum sale;
This would be something ecommerce portals would love to implement to
realize maximum click-through rates on any given product and related sales, on
any given day
⚫Customized action in video games based on reinforcement learning; AI
agents use reinforcement learning to coordinate actions and react appropriately
to new situations through a series of rewards
Examples in Reinforcement Learning
⚫RL in healthcare can be used to recommend different treatment options.
While supervised learning models can be used to predict whether a person is
suffering from a disease or not, RL can be used to predict treatment options
given a person is suffering from a particular disease.
⚫RL can be used for NLP use cases such as text summarization, question &
answers, machine translation.
⚫AI-powered stock buying/selling: While supervised learning algorithms can
be used to predict the stock prices, it s the reinforcement learning which can be
used to decide whether to buy, sell or hold the stock at given predicted price.
2. Methods based on the ability to learn from incremental
data samples
1. Batch or Offline learning:
⚫Offline learning refers to situations where the program is not operating and
taking in new information in real-time
⚫So the model doesn’t keep learning over a period of time continuously with
the new data. Once the training is complete the model stops learning.
⚫We can always train the model on new data but then we would have to add
new data samples along with the older historical training data and again re-
build the model using this new batch of data.
⚫Used in applications where data patterns remain constant and don’t have
sudden concept drifts (e.g., Netflix recommendation system)
2. Online Learning:
⚫Online learning is ideal for machine learning systems that receive
data as a continuous flow and need to be able to adapt to rapidly
changing conditions.
⚫More computational power is required because of the continuous
feed of data that leads to continuous refinement.
⚫Harder to implement and control because the production model
changes in real-time according to its data feed.
⚫Used in applications where new data patterns are constantly
required (e.g., weather prediction tools)
3. Methods based on their approach to generalization
from data samples
1. Instance Based Learning:
⚫instance-based learning are the systems that learn the training
examples by heart and then generalizes to new instances based on
some similarity measure
⚫It is also known as memory-based learning or lazy-learning.
⚫Example: K-Nearest Neighbours.
⚫Some of the instance-based learning algorithms are :
1. K Nearest Neighbor (KNN)
2. Self-Organizing Map (SOM)
3. Learning Vector Quantization (LVQ)
4. Locally Weighted Learning (LWL)
2. Model Based Learning:
⚫The model based learning methods are a more traditional ML
approach toward generalizing based on training data.
⚫Typically an iterative process takes place where the input data is
used to extract features and models are built based on various
model parameters (known as hyperparameters).
⚫These hyperparameters are optimized based on various model
validation techniques to select the model that generalizes best on
the training data and some amount of validation and test data (split
from the initial dataset).
⚫ Finally, the best model is used to make predictions or decisions as
and when needed.
Model vs Instance based learning
Model based learning Instance based learning
Prepare the data for model training Prepare the data for model training
Train model from training data to Do not train model
estimate model parameters i.e discover
patterns
Store the model in suitable form There is no model to store
Can throw away input/training data Input/training data must be kept since
after model training each query uses part or full set of training
observations
Storing models generally requires less Storing training data generally requires
storage more storage
CRISP-DM Process Model

The CRoss Industry Standard Proc

ess for Data Mining (CRISP-DM)
is a process model that serves as
the base for a data science process.
CRISP-DM Process Model
⚫It has six sequential phases:
1.Business understanding – What does the business need?
2.Data understanding – What data do we have / need? Is it clean?
3.Data preparation – How do we organize the data for modeling?
4.Modeling – What modeling techniques should we apply?
5.Evaluation – Which model best meets the business objectives?
6.Deployment – How do stakeholders access the results?
Business Understanding
⚫The Business Understanding phase focuses on understanding the
objectives and requirements of the project.
1. Determine business objectives: You should first “thoroughly
understand, from a business perspective, what the customer really wants
to accomplish
2. Assess situation: Determine resources availability, project requirements,
assess risks and conduct a cost-benefit analysis.
3. Determine data mining goals: In addition to defining the business
objectives, you should also define what success looks like from a
technical data mining perspective.
4. Produce project plan: Select technologies and tools and define detailed
plans for each project phase
Data understanding
⚫It drives the focus to identify, collect, and analyze the data sets that can help
you accomplish the project goals.
1. Collect initial data: Acquire the necessary data and (if necessary) load it into
your analysis tool.
2. Describe data: Examine the data and document its surface properties like data
format, number of records, or field identities.
3. Explore data: Dig deeper into the data. Query it, visualize it, and identify
relationships among the data.
4. Verify data quality: How clean/dirty is the data? Document any quality
issues.
Data Preparation
1. Select data: Determine which data sets will be used and document reasons
for inclusion/exclusion.
2. Clean data: Often this is the lengthiest task. Without it, you’ll likely fall
victim to garbage-in, garbage-out. A common practice during this task is to
correct, impute, or remove erroneous values.
3. Construct data: Derive new attributes that will be helpful. For example,
derive someone’s body mass index from height and weight fields.
4. Integrate data: Create new data sets by combining data from multiple
sources.
5. Format data: Re-format data as necessary. For example, you might
convert string values that store numbers to numeric values so that you can
perform mathematical operations.
Modeling
1. Select modeling techniques: Determine which algorithms to try (e.g.
regression, neural net).
2. Generate test design: Pending your modeling approach, you might need to
split the data into training, test, and validation sets.
3. Build model: As glamorous as this might sound, this might just be
executing a few lines of code like “reg = LinearRegression().fit(X, y)”.
4. Assess model: Generally, multiple models are competing against each
other, and the data scientist needs to interpret the model results based on
domain knowledge, the pre-defined success criteria, and the test design.
Evaluation
Evaluation phase looks more broadly at which model best meets the business
and what to do next. This phase has three tasks:
1. Evaluate results: Do the models meet the business success criteria? Which
one(s) should we approve for the business?
2. Review process: Review the work accomplished. Was anything
overlooked? Were all steps properly executed? Summarize findings and
correct anything if needed.
3. Determine next steps: Based on the previous three tasks, determine
whether to proceed to deployment, iterate further, or initiate new projects.
Deployment
⚫A model is not particularly useful unless the customer can access its
results.
1. Plan deployment: Develop and document a plan for deploying the model.
2. Plan monitoring and maintenance: Develop a thorough monitoring and
maintenance plan to avoid issues during the operational phase (or post-
project phase) of a model.
3. Produce final report: The project team documents a summary of the
project which might include a final presentation of data mining results.
4. Review project: Conduct a project retrospective about what went well,
what could have been better, and how to improve in the future.
Building Machine Intelligence
⚫Machine Learning Pipe Lines: A Machine Learning pipeline will mainly
consist of elements related to data retrieval and extraction, preparation,
modeling, evaluation, and deployment.
⚫Supervised Machine Learning Pipeline
⚫Unsupervised Machine Learning Pipeline:
Real-World Case Study: Predicting Student Grant
Recommendations

⚫Given that you have several students with multiple attributes like
grades, performance, and scores, can you build a model based on past
historical data to predict the chance of the student getting a
recommendation grant for a research project?

Unit 2 Introduction To Deep Learning
No ratings yet
Unit 2 Introduction To Deep Learning
79 pages
ML and AI Notes
100% (1)
ML and AI Notes
43 pages
Unit1 ML
No ratings yet
Unit1 ML
23 pages
ML - Full Slides Srikanth Allamshatty
No ratings yet
ML - Full Slides Srikanth Allamshatty
369 pages
ML - 1 - Sovan - Introduction To ML
No ratings yet
ML - 1 - Sovan - Introduction To ML
83 pages
Duda Solutions PDF
No ratings yet
Duda Solutions PDF
77 pages
Unit 1 Introduction of Machine Learning Notes
No ratings yet
Unit 1 Introduction of Machine Learning Notes
57 pages
Machine Learning
No ratings yet
Machine Learning
135 pages
Helsenki - Intro To ML
No ratings yet
Helsenki - Intro To ML
35 pages
ML Notes (BCS602)
No ratings yet
ML Notes (BCS602)
186 pages
Neural Controller of DC Motor
50% (2)
Neural Controller of DC Motor
81 pages
5G Recorded Courses Catalog VF
No ratings yet
5G Recorded Courses Catalog VF
49 pages
Unit 3
No ratings yet
Unit 3
10 pages
1.2.1 ML Intro
No ratings yet
1.2.1 ML Intro
18 pages
Machine Learning Unit 1
100% (7)
Machine Learning Unit 1
112 pages
1.2.1 ML Intro
No ratings yet
1.2.1 ML Intro
15 pages
ML - Unit I - Final
No ratings yet
ML - Unit I - Final
132 pages
UNIT-1 Machine Learning
No ratings yet
UNIT-1 Machine Learning
43 pages
Machine Learning: Instructor: Prof. Ayesha
No ratings yet
Machine Learning: Instructor: Prof. Ayesha
31 pages
ML Full Slides Final
No ratings yet
ML Full Slides Final
458 pages
Unit I MACHINE LEARNING
No ratings yet
Unit I MACHINE LEARNING
87 pages
Applied DS and Smart Systems
No ratings yet
Applied DS and Smart Systems
202 pages
Machine Learning
No ratings yet
Machine Learning
31 pages
UNIT-3 Part2
No ratings yet
UNIT-3 Part2
14 pages
Unit - 1 - SC
No ratings yet
Unit - 1 - SC
98 pages
Combinepdf
No ratings yet
Combinepdf
85 pages
ML Module2-Chapter 1
No ratings yet
ML Module2-Chapter 1
50 pages
Chapter-1 ML Intro
No ratings yet
Chapter-1 ML Intro
36 pages
Iml Material
No ratings yet
Iml Material
139 pages
Machine Learning Notes - Concepts, Algorithms
No ratings yet
Machine Learning Notes - Concepts, Algorithms
171 pages
Presentation 33360 Content Document 20250319044717PM
No ratings yet
Presentation 33360 Content Document 20250319044717PM
126 pages
DeepLearning L1 Intro
No ratings yet
DeepLearning L1 Intro
92 pages
Unit 3
No ratings yet
Unit 3
104 pages
MLUnit 1
No ratings yet
MLUnit 1
131 pages
Module-1 ML
No ratings yet
Module-1 ML
113 pages
ML Lecture Notes Unit-1
No ratings yet
ML Lecture Notes Unit-1
45 pages
Unit - 3 - ML
No ratings yet
Unit - 3 - ML
53 pages
Machine Learning Unit-1.1
No ratings yet
Machine Learning Unit-1.1
29 pages
Week 12 Intro To DS and ML
No ratings yet
Week 12 Intro To DS and ML
67 pages
Bcs602-Ml-mod 1 & 2
No ratings yet
Bcs602-Ml-mod 1 & 2
235 pages
ML - Module 1
No ratings yet
ML - Module 1
52 pages
Energy-Efficient Deep Learning Inference On Edge Devices
No ratings yet
Energy-Efficient Deep Learning Inference On Edge Devices
55 pages
Module - 1
No ratings yet
Module - 1
132 pages
Datascience
No ratings yet
Datascience
14 pages
Financial Trading As A Game: A Deep Reinforcement Learning Approach
No ratings yet
Financial Trading As A Game: A Deep Reinforcement Learning Approach
15 pages
ML Notes
No ratings yet
ML Notes
101 pages
Chapter 01 Machine Learning
No ratings yet
Chapter 01 Machine Learning
22 pages
Module1 Introduction
No ratings yet
Module1 Introduction
35 pages
LM #01-Introduction To ML
No ratings yet
LM #01-Introduction To ML
33 pages
Artificial Intelligence in The United Arab Emirates Public Sector: A Systematic Literature Review
No ratings yet
Artificial Intelligence in The United Arab Emirates Public Sector: A Systematic Literature Review
10 pages
Machine Learning Unit-1.1
No ratings yet
Machine Learning Unit-1.1
43 pages
ML-Unit 3
No ratings yet
ML-Unit 3
58 pages
Seminar
No ratings yet
Seminar
34 pages
Sushil 7th (1 PDF
No ratings yet
Sushil 7th (1 PDF
29 pages
ActivationFun Survey Arxiv
No ratings yet
ActivationFun Survey Arxiv
49 pages
MAchine Learning Notes
No ratings yet
MAchine Learning Notes
41 pages
BCS602 Module 1 PDF
No ratings yet
BCS602 Module 1 PDF
36 pages
A Framework For Process Risk Assessment Incorporating Prior Hazard
No ratings yet
A Framework For Process Risk Assessment Incorporating Prior Hazard
19 pages
7 TH
No ratings yet
7 TH
30 pages
Introduction To Emerging Technologies
No ratings yet
Introduction To Emerging Technologies
43 pages
R20 ML - Unit-1
No ratings yet
R20 ML - Unit-1
23 pages
Canshield: Deep-Learning-Based Intrusion Detection Framework For Controller Area Networks at The Signal Level
No ratings yet
Canshield: Deep-Learning-Based Intrusion Detection Framework For Controller Area Networks at The Signal Level
17 pages
Data Science & ML Syllabus
No ratings yet
Data Science & ML Syllabus
12 pages
Session 1 Introduction Contemporary Business Anaytics 1
No ratings yet
Session 1 Introduction Contemporary Business Anaytics 1
21 pages
Combining Inductive and Analytical Learning: CS 5751 Machine Lear Ning Chapter 12 Comb. Inductive/Analyti Cal 1
No ratings yet
Combining Inductive and Analytical Learning: CS 5751 Machine Lear Ning Chapter 12 Comb. Inductive/Analyti Cal 1
19 pages
Machine Learning by Sahil
No ratings yet
Machine Learning by Sahil
15 pages
A Simple PPT With Data Concepts 1703019257
No ratings yet
A Simple PPT With Data Concepts 1703019257
28 pages
Machine Learning
No ratings yet
Machine Learning
17 pages
DBMS 5Q
No ratings yet
DBMS 5Q
10 pages
ML Chapter 01
No ratings yet
ML Chapter 01
38 pages
ML Module 1
No ratings yet
ML Module 1
52 pages
Ai and ML Notes
No ratings yet
Ai and ML Notes
26 pages
Top Data Science and Ai Course For Working Professional - Tutort Academy
No ratings yet
Top Data Science and Ai Course For Working Professional - Tutort Academy
32 pages
Manual Data
No ratings yet
Manual Data
13 pages
Computers and Electronics in Agriculture: P.S. Maya Gopal, R. Bhargavi T
No ratings yet
Computers and Electronics in Agriculture: P.S. Maya Gopal, R. Bhargavi T
9 pages
Machine Learning Overview
No ratings yet
Machine Learning Overview
7 pages
Predicting Weather Forecaste Uncertainty With Machine Learning
No ratings yet
Predicting Weather Forecaste Uncertainty With Machine Learning
17 pages
1 s20 S2542660521001037 Main
No ratings yet
1 s20 S2542660521001037 Main
18 pages
Ids Ashber
No ratings yet
Ids Ashber
9 pages
ICT - REPORT - Final Report 100
No ratings yet
ICT - REPORT - Final Report 100
17 pages
UNIT-5 Part1
No ratings yet
UNIT-5 Part1
15 pages
The Rise of AI in Gaming From Chess To Go and Beyond
No ratings yet
The Rise of AI in Gaming From Chess To Go and Beyond
8 pages
ML Introduction-06!08!21
No ratings yet
ML Introduction-06!08!21
25 pages
AIML Module-3
No ratings yet
AIML Module-3
31 pages
Module-1 Notes-Bcs602
No ratings yet
Module-1 Notes-Bcs602
32 pages
Summer Training 2020: Advanced Data Science With IBM & Bionic Robotic Arm
No ratings yet
Summer Training 2020: Advanced Data Science With IBM & Bionic Robotic Arm
10 pages
Unit 2 AIML
No ratings yet
Unit 2 AIML
28 pages
Predicting and Mitigating Cyber Threats Through Data Mi - 2024 - Computer Commun
No ratings yet
Predicting and Mitigating Cyber Threats Through Data Mi - 2024 - Computer Commun
16 pages
Data Mining 456
No ratings yet
Data Mining 456
8 pages
Optical Character Recognition Using Neural Networks (ECE 539 Project Report)
No ratings yet
Optical Character Recognition Using Neural Networks (ECE 539 Project Report)
15 pages
State Farm Distracted Driver Detection
No ratings yet
State Farm Distracted Driver Detection
10 pages
Cbsyllabus Bda 1
No ratings yet
Cbsyllabus Bda 1
4 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
5 pages
Wa0000.
No ratings yet
Wa0000.
4 pages
Week 4
No ratings yet
Week 4
4 pages
Applied Analytics Using Enterprise Miner5
No ratings yet
Applied Analytics Using Enterprise Miner5
3 pages
HieuVu CV Short PDF
No ratings yet
HieuVu CV Short PDF
2 pages
Taimur Hassan Sarmad's CV
No ratings yet
Taimur Hassan Sarmad's CV
2 pages
Fundamentals of Machine Learning: a Simplified Approach
From Everand
Fundamentals of Machine Learning: a Simplified Approach
Er. Sudhir Goswami
No ratings yet