0% found this document useful (0 votes)

51 views16 pages

Ai & ML Week-1

Uploaded by

ಹರಿ ಶಂ

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views16 pages

Ai & ML Week-1

Uploaded by

ಹರಿ ಶಂ

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

Artificial Intelligence and Machine Learning Code: 20CS511

Artificial Intelligence and Machine Learning

 Fundamentals
 Machine learning types
 -Machine learning workflow
 Machine learning applications
 Challenges in ML
 Building a model – steps involved

Machine Learning: Fundamentals

Machine learning (ML) is a type of artificial intelligence (AI) that allows software
applications to become more accurate at predicting outcomes without being explicitly
programmed to do so. Machine learning algorithms use historical data as input to predict
new output values.
(OR)
Machine learning is a subset of AI, which enables the machine to automatically learn
from data, improve performance from past experiences, and make predictions.
(OR)
Machine Learning is the field of study that gives computers the capability to learn
without being explicitly programmed. ML is one of the most exciting technologies that
Search Educations Page 1
Artificial Intelligence and Machine Learning Code: 20CS511

one would have ever come across. As it is evident from the name, it gives the computer
that makes it more similar to humans: The ability to learn. Machine learning is actively
being used today, perhaps in many more places than one would expect.
Machine learning types
1. Supervised Machine Learning
 As its name suggests, supervised machine learning is based on supervision. It
means in the supervised learning technique, we train the machines using the
"labeled" dataset, and based on the training, the machine predicts the output.
 Supervised learning works on labeled data.
 Each input data has a corresponding labeled output. The goal of supervised
machine learning is to learn a mapping from the input to the output. The input
data is called attributes, features or predictors.
 This output variable is also called response variable or target variable.
 For example, the problem of building a utility for predicting the selling price of
the car. The dataset is shown below:

Search Educations Page 2

Artificial Intelligence and Machine Learning Code: 20CS511

From the given dataset, the machine learning algorithm learns the mapping from input
variables to output variable. This learning is represented in the form of a model. When a
new instance is given to the model as shown below, it can predict its output value.

Some of the other examples of supervised learning:

Given an email defined by its collection of phrases(X), predict if the mail is a
spam(Y). Given a medical brain scan image (X), predict if the patient has
tumors(Y).

2. Unsupervised Machine Learning

Unsupervised learning algorithm is to group or categories the unsorted dataset according
to the similarities, patterns, and differences.
Machines are instructed to find the hidden patterns from the input dataset.
Unsupervised machine learning has no explicitly defined output.
The idea is to discover knowledge or structure in the data.

For example, an online retailer will have data about all items that the customers purchased.
Unsupervised learning algorithms can be applied on this data to group customers
based on their buying patterns.

Grouping new articles based on topics like sports, politics, business etc. is another example
ofunsupervised learning.
This task of discovering inherent clusters or groups in the data is known as Clustering

3. Reinforcement Learning
Reinforcement learning works on a feedback-based process, in which an AI agent (A
software component) automatically explore its surrounding by hitting & trail, taking
action, learning from experiences, and improving its performance.

Search Educations Page 3

Artificial Intelligence and Machine Learning Code: 20CS511

Machine Learning Process

The diagram above illustrates the Machine Learning process.

In this process, first relevant data is gathered then is cleaned and transformed
through a process called Feature Engineering.
During the process of Feature Engineering, handling missing value, handling outliers,
creating new features out of existing ones are some of the common tasks performed.
After feature engineering, the data is split into Train Data and Test Data. The Train Data is
used for training the machine learning model.
Once the model is built, it is validated against the Test Data for accuracy.
This accuracy helps us in estimating the performance on previously unseen data.
If the model performance on both Train and Test Data is satisfactory, the model may be
deployed.

Search Educations Page 4

Artificial Intelligence and Machine Learning Code: 20CS511

Once deployed, the model makes predictions on new data ; these predictions/insights are
used to take business decisions

Machine learning workflow

In order to execute and produce results successfully, a machine learning model must
automate some standard workflows. T
He process of automate these standard workflows can be done with the help of Scikit-learn
Pipelines.
From a data scientist’s perspective, pipeline is a generalized, but very important concept.
It basically allows data flow from its raw format to some useful information.
The working of pipelines can be understood with the help of following diagram −

The blocks of ML pipelines are as follows −

Data ingestion − as the name suggests, it is the process of importing the data for use in
ML project. The data can be extracted in real time or batches from single or multiple
systems. It is one of the most challenging steps because the quality of data can affect the
whole ML model.
Data Preparation − after importing the data, we need to prepare data to be used for our
ML model. Data preprocessing is one of the most important technique of data preparation.
ML Model Training − Next step is to train our ML model. We have various ML
algorithms like supervised, unsupervised, reinforcement to extract the features from data,
and make predictions.
Model Evaluation − Next, we need to evaluate the ML model. In case of AutoML
pipeline, ML model can be evaluated with the help of various statistical methods and
business rules.
ML Model retraining − In case of AutoML pipeline, it is not necessary that the first
model is best one. The first model is considered as a baseline model and we can train it
repeatable to increase model’s accuracy.

Search Educations Page 5

Artificial Intelligence and Machine Learning Code: 20CS511

Deployment − At last, we need to deploy the model. This step involves applying and
migrating the model to business operations for their use.

Machine learning Applications

Search Educations Page 6

Artificial Intelligence and Machine Learning Code: 20CS511

4. Challenges in ML
 Inadequate Training Data
 Poor quality of data
 Monitoring and maintenance
 Getting bad recommendations
 Lack of skilled resources
 Process Complexity of Machine Learning
 Data Bias
 Slow implementations and results

Building model steps involved

Step 1: Collect Data
Given the problem you want to solve, you will have to investigate and obtain data that
you will use to feed your machine.
Step 2: Prepare the data

This is a good time to visualize your data and check if there are correlations between the
different characteristics that we obtained.
You must also separate the data into two groups: one for training and the other for model
evaluation which can be divided approximately in a ratio of 80/20 but it can vary
depending on the case and the volume of data we have.

At this stage, you can also pre-process your data by normalizing, eliminating duplicates,
and making error corrections.

Step 3: Choose the model

you will use algorithms of classification, prediction, linear regression, clustering, i.e. k-
means or K-Nearest Neighbor, Deep Learning, i.e. Neural Networks, Bayesian, etc.
There are various models to be used depending on the data you are going to process
such as images, sound, text, and numerical values.

In the following table, we will see some models and their applications that you can
apply in your projects:

Search Educations Page 7

Artificial Intelligence and Machine Learning Code: 20CS511

Model Applications

Logistic Regression Price prediction

Fully connected networks Classification

Convolutional Neural Networks Image processing

Recurrent Neural Networks Voice recognition

Random Forest Fraud Detection

Reinforcement Learning Learning by trial and error

Generative Models Image creation

K-means Segmentation

k-Nearest Neighbors Recommendation systems

Bayesian Classifiers Spam and noise filtering

Search Educations Page 8

Artificial Intelligence and Machine Learning Code: 20CS511

Step 4 Train your machine model

You will need to train the datasets to run smoothly and see an incremental improvement
in the prediction rate.

Step 5: Evaluation
You will have to check the machine created against your evaluation data set of your
already trained model.
If the accuracy is less than or equal to 50%, that model will not be useful.
If you reach 90% or more, you can have good confidence in the results that the model
gives you.

Step 6: Parameter Tuning

If during the evaluation you did not obtain good predictions, you must return to the
training step before making a new configuration of parameters in your model.

Step 7: Prediction or Inference

You are now ready to use your Machine Learning model inferring results in real-life
scenarios.

Search Educations Page 9

Artificial Intelligence and Machine Learning Code: 20CS511

Examples of ML in Daily life

Machine Learning

 Pipeline

o Data engineering
o Machine Learning
o Deployment
 What is Data Science?
 How Data Science works?
 Data Science uses

Pipeline
Machine Learning

A machine learning pipeline is a way to control and automate the workflow it

takes to produce a machine learning model. Machine learning pipelines consist of
multiple sequential steps that do everything from data extraction and preprocessing to
Search Educations Page 1
Artificial Intelligence and Machine Learning Code: 20CS511

model training and deployment.

Machine learning pipelines are iterative as every step is repeated to continuously

improve the accuracy of the model and achieve the end goal.

An example of ML Pipeline O'Reilly

 The term Pipeline is used generally to describe the independent sequence of

steps that are arranged together to achieve a task.
 This task could be machine learning or not.
 Machine Learning Pipelines are very common but that is not the only type of
pipeline that exists.
 Data Orchestration Pipelines are another example.

Search Educations Page 2

Artificial Intelligence and Machine Learning Code: 20CS511

According to Microsoft docs, there are three scenarios:

Deployment
The deployment of machine learning models (or pipelines) is the process of making
models available in production where web applications, enterprise software (ERPs) and
APIs can consume the trained model by providing new data points, and get the
predictions.

In short, Deployment in Machine Learning is the method by which you integrate a

machine learning model into an existing production environment to make practical
business decisions based on data. It is the last stage in the machine learning lifecycle.

Normally the term Machine Learning Model Deployment is used to describe

deployment of the entire Machine Learning Pipeline, in which the model itself is only
one component of the Pipeline.

Search Educations Page 3

Artificial Intelligence and Machine Learning Code: 20CS511

An example of a machine learning pipeline built using

sklearn

As you can see in the above example, this pipeline consists of a

Logistic Regression model. There are several steps in the pipeline that
have to be executed first before training can begin, such as Imputation
of missing values, One-Hot encoding, Scaling, and Principal
Component Analysis (PCA).

Data engineering
Data engineering is the process of designing and building systems that

SEARCH EDUCATIONS Page 4

Artificial Intelligence and Machine Learning Code: 20CS511

let people collect and analyze raw data from multiple sources and
formats.

These systems empower people to find practical applications of the

data, which businesses can use to thrive.

What Do Data Engineers Do?

Data engineering is a skill that is in increasing demand. Data engineers
are the people who design the system that unifies data and can help you
navigate it. Data engineers perform many different tasks including:

Acquisition: Finding all the different

data sets around the business Cleansing:

Finding and cleaning any errors in the

data Conversion: Giving all the data a

common format

Disambiguation: Interpreting data that could be interpreted in

multiple ways

Deduplication: Removing duplicate copies of data

SEARCH EDUCATIONS Page 5

Artificial Intelligence and Machine Learning Code: 20CS511

Once this is done, data may be stored in a central repository such as a

data lake or data lake house.
Data engineers may also copy and move subsets of data into a data
warehouse.

Machine learning pipeline

A machine learning pipeline is a way to codify and automate the
workflow it takes to produce a machine learning model.

Machine learning pipelines consist of multiple sequential steps that do

everything from data extraction and preprocessing to model training
and deployment.

What is Data Science?

Data science is the domain of study that deals with vast volumes of
data using modern tools and techniques to find unseen patterns,

SEARCH EDUCATIONS Page 6

Artificial Intelligence and Machine Learning Code: 20CS511

derive meaningful information, and make business decisions.

For example, finance companies can use a customer's banking and bill-
paying history to assess creditworthiness and loan risk.
How Data Science works?
Data science uses techniques such as machine learning and artificial
intelligence to extract meaningful information and to predict future
patterns and behaviours. Advances in technology, the internet, social
media, and the use of technology have all increased access to big data.

Data Science uses

Data science is used in marketing, finance, and human resources,
healthcare, government programmes, and any other industry that
generates data.
Marketing departments use data science to determine which product is
most likely to sell.

SEARCH EDUCATIONS Page 7

Machine Learning PPT For Students
70% (10)
Machine Learning PPT For Students
18 pages
Din 1685 - 1
67% (3)
Din 1685 - 1
4 pages
Foc QP 3
No ratings yet
Foc QP 3
18 pages
Unix PPT Lesson
75% (4)
Unix PPT Lesson
70 pages
Untitled
100% (1)
Untitled
26 pages
Advantage Workstation 4.3 SM
100% (1)
Advantage Workstation 4.3 SM
346 pages
NLP Using Python
100% (3)
NLP Using Python
12 pages
Difference Between NGN and Legacy TDM Network
50% (2)
Difference Between NGN and Legacy TDM Network
18 pages
Project Management Skills (20Pm01T) : II Semester Diploma Examinations, Mar/Apr-2022 Scheme of Valuation
50% (2)
Project Management Skills (20Pm01T) : II Semester Diploma Examinations, Mar/Apr-2022 Scheme of Valuation
15 pages
Quantum Mechanics - Special Chapters PDF
No ratings yet
Quantum Mechanics - Special Chapters PDF
398 pages
Inspur Server NF5288M5 User Manual V1.4
No ratings yet
Inspur Server NF5288M5 User Manual V1.4
117 pages
Project Management Skills Unit-1
100% (1)
Project Management Skills Unit-1
15 pages
Project Management Skills Unit-4
100% (1)
Project Management Skills Unit-4
22 pages
Chapter 1-2 - Mobile Analytics Basics
No ratings yet
Chapter 1-2 - Mobile Analytics Basics
60 pages
Netops
No ratings yet
Netops
81 pages
Project Management Skills Unit-5
100% (1)
Project Management Skills Unit-5
26 pages
BI in The Telecomm Industry
100% (1)
BI in The Telecomm Industry
27 pages
Unite 3 Research Desigh and Methodology
No ratings yet
Unite 3 Research Desigh and Methodology
18 pages
TMF814 Network Simulator
No ratings yet
TMF814 Network Simulator
496 pages
Drafting and Making The Shieldmaiden Corset
100% (2)
Drafting and Making The Shieldmaiden Corset
6 pages
Migration To NGN-Tech
No ratings yet
Migration To NGN-Tech
42 pages
Project Management Skills Unit-2
No ratings yet
Project Management Skills Unit-2
10 pages
Informatica February Release
No ratings yet
Informatica February Release
15 pages
SSIS Interview Questions
No ratings yet
SSIS Interview Questions
13 pages
Network Management PDF
No ratings yet
Network Management PDF
37 pages
Code: 20PM01T: Scheme of Valuation & Model Answers
100% (1)
Code: 20PM01T: Scheme of Valuation & Model Answers
27 pages
Learn To Create MSBI (Microsoft Business Intelligence) Project in 7 Days - CodeProject
No ratings yet
Learn To Create MSBI (Microsoft Business Intelligence) Project in 7 Days - CodeProject
20 pages
Deep Learning Booklet
No ratings yet
Deep Learning Booklet
55 pages
Smart Traffic Management System Using IOT and Machine Learning Approach
No ratings yet
Smart Traffic Management System Using IOT and Machine Learning Approach
6 pages
Foc QP 1
No ratings yet
Foc QP 1
15 pages
ConfigGuide TR 069
No ratings yet
ConfigGuide TR 069
110 pages
Huawei
No ratings yet
Huawei
12 pages
Power Bi Boot Camp - v2
No ratings yet
Power Bi Boot Camp - v2
17 pages
C 4
No ratings yet
C 4
61 pages
Design Approach To Handle Late Arriving Dimensions and Late Arriving Facts
No ratings yet
Design Approach To Handle Late Arriving Dimensions and Late Arriving Facts
109 pages
Sqoop Demo
No ratings yet
Sqoop Demo
7 pages
Basit MSTR Resume
No ratings yet
Basit MSTR Resume
8 pages
Innovile Optima Parsers
No ratings yet
Innovile Optima Parsers
10 pages
PAM For Informatica Platform v10 5 4
No ratings yet
PAM For Informatica Platform v10 5 4
237 pages
Informatica PDF
No ratings yet
Informatica PDF
55 pages
Resume Format 1
No ratings yet
Resume Format 1
5 pages
Inpur NF5280M6 Datasheet
No ratings yet
Inpur NF5280M6 Datasheet
49 pages
Velocity v8 Data Warehousing Methodology
No ratings yet
Velocity v8 Data Warehousing Methodology
1,106 pages
Wireless Networks Seminar Report and Topic
No ratings yet
Wireless Networks Seminar Report and Topic
32 pages
Slingshot Elastics Test
100% (1)
Slingshot Elastics Test
12 pages
How To Create The Deployment Utility?: SSIS Interview Questions and Answers: Series 3
No ratings yet
How To Create The Deployment Utility?: SSIS Interview Questions and Answers: Series 3
6 pages
Ex 12 Workflow For Diploma Admission
No ratings yet
Ex 12 Workflow For Diploma Admission
1 page
Foc QP 4
No ratings yet
Foc QP 4
18 pages
Heat Treatment of Steel: Assessment Performance Criteria
No ratings yet
Heat Treatment of Steel: Assessment Performance Criteria
6 pages
SEQ Analyst Infomation Gateway 2013 Issue 2 (SEQ Analyst Solution Documentation) PDF
No ratings yet
SEQ Analyst Infomation Gateway 2013 Issue 2 (SEQ Analyst Solution Documentation) PDF
8 pages
Ronald N. Miles - Physical Approach To Engineering Acoustics (Mechanical Engineering Series) (2023, Springer) - Libgen - Li
No ratings yet
Ronald N. Miles - Physical Approach To Engineering Acoustics (Mechanical Engineering Series) (2023, Springer) - Libgen - Li
406 pages
UMP Brochure
No ratings yet
UMP Brochure
2 pages
Project Management Skills Unit-6
No ratings yet
Project Management Skills Unit-6
15 pages
Sandvine - DS - Performance Monitoring and Analysis
No ratings yet
Sandvine - DS - Performance Monitoring and Analysis
4 pages
UMP Product Brochure 2020
No ratings yet
UMP Product Brochure 2020
8 pages
Steam Jet Spindle Operated Thermocompressor
No ratings yet
Steam Jet Spindle Operated Thermocompressor
3 pages
Imanager N2000 PDF
No ratings yet
Imanager N2000 PDF
12 pages
SSIS Package Configurations
No ratings yet
SSIS Package Configurations
20 pages
Unit 4 PPT
No ratings yet
Unit 4 PPT
34 pages
Project Management Skills Unit-3
No ratings yet
Project Management Skills Unit-3
12 pages
SSIS Interview Questions
No ratings yet
SSIS Interview Questions
11 pages
Python Programming
No ratings yet
Python Programming
17 pages
Sno. Reg No Student Name Sem Ex-1 Ex-2 Ex-3 Ex-4 Ex-5
No ratings yet
Sno. Reg No Student Name Sem Ex-1 Ex-2 Ex-3 Ex-4 Ex-5
46 pages
Next Generation Networks (NGN) & The IP Multimedia Subsystem (IMS)
100% (1)
Next Generation Networks (NGN) & The IP Multimedia Subsystem (IMS)
39 pages
Foc QP 2
No ratings yet
Foc QP 2
32 pages
Python
No ratings yet
Python
62 pages
1ST Semester Class Time-Tables1
No ratings yet
1ST Semester Class Time-Tables1
10 pages
List of ETL Tools
No ratings yet
List of ETL Tools
2 pages
Thinking Avant La Lettre A Review of 4E Cognition Carney 2020
No ratings yet
Thinking Avant La Lettre A Review of 4E Cognition Carney 2020
15 pages
Machine Learning
No ratings yet
Machine Learning
24 pages
Zeel1-3 CC
No ratings yet
Zeel1-3 CC
57 pages
P235GH Engl PDF
No ratings yet
P235GH Engl PDF
4 pages
Tableau Interview Questions and Answers
No ratings yet
Tableau Interview Questions and Answers
5 pages
It Skills Lab Manual 2021
No ratings yet
It Skills Lab Manual 2021
33 pages
Resume Mohit
No ratings yet
Resume Mohit
6 pages
Ssis
No ratings yet
Ssis
25 pages
20PM01T Aug-Sep-2022 QP
No ratings yet
20PM01T Aug-Sep-2022 QP
4 pages
Resume Prakhar Agrawal 2024 04-1
No ratings yet
Resume Prakhar Agrawal 2024 04-1
3 pages
Tentative Time Table 135 DTDM-DPM
No ratings yet
Tentative Time Table 135 DTDM-DPM
1 page
Feee Acti
No ratings yet
Feee Acti
20 pages
IT Skills Lab Manual
100% (1)
IT Skills Lab Manual
56 pages
20pm01t Feb March 2023 QP
No ratings yet
20pm01t Feb March 2023 QP
4 pages
Ass 2 IT SKILL
No ratings yet
Ass 2 IT SKILL
3 pages
Purnakanth Resume
No ratings yet
Purnakanth Resume
4 pages
Unit 1 - Machine Learning - NOTES1 - ML
No ratings yet
Unit 1 - Machine Learning - NOTES1 - ML
52 pages
AI and ML Notes
No ratings yet
AI and ML Notes
17 pages
Untitled
No ratings yet
Untitled
4 pages
IBM MDM 11.6 Installation: Topology, Software Bundles, Prerequisites, Steps and Issues
No ratings yet
IBM MDM 11.6 Installation: Topology, Software Bundles, Prerequisites, Steps and Issues
5 pages
Feee Cover Page
No ratings yet
Feee Cover Page
4 pages
Ex 17 Browser Settings
No ratings yet
Ex 17 Browser Settings
1 page
It Skill Lab Manual
100% (1)
It Skill Lab Manual
61 pages
Signals and Systems PDF
No ratings yet
Signals and Systems PDF
1 page
DF0LS35 - Celestial Navigation
No ratings yet
DF0LS35 - Celestial Navigation
12 pages
Radiator - Wikipedia
No ratings yet
Radiator - Wikipedia
8 pages
Understanding Scuffing and Micropitting of Gears: R W Snidle, H P Evans, M P Alanou, M J A Holmes
No ratings yet
Understanding Scuffing and Micropitting of Gears: R W Snidle, H P Evans, M P Alanou, M J A Holmes
18 pages
IT Skills Lab Manual by Subhash J R
No ratings yet
IT Skills Lab Manual by Subhash J R
62 pages
Arabic Pronunciation Activity - Azida Hazlin Binti Hayazi (MC200912233) (Section 2)
No ratings yet
Arabic Pronunciation Activity - Azida Hazlin Binti Hayazi (MC200912233) (Section 2)
3 pages
Cbsyllabus Bda 1
No ratings yet
Cbsyllabus Bda 1
4 pages
AI and ML
No ratings yet
AI and ML
16 pages
Inductive and Capacitive Sensors XS & XT - XT130B1NAL2
No ratings yet
Inductive and Capacitive Sensors XS & XT - XT130B1NAL2
7 pages
Study of Suspension System in All Terrain Vehicle: Presented by
No ratings yet
Study of Suspension System in All Terrain Vehicle: Presented by
14 pages
Tension 13: 5or1 He T TH Ro No H RD in
No ratings yet
Tension 13: 5or1 He T TH Ro No H RD in
1 page
Module 3 - AI and ML
No ratings yet
Module 3 - AI and ML
64 pages
Chap 10-Machine Learning
No ratings yet
Chap 10-Machine Learning
25 pages
Exercise 2: Nerve Conduction
No ratings yet
Exercise 2: Nerve Conduction
10 pages
1 - AML - Manish
No ratings yet
1 - AML - Manish
72 pages
Ai Faheem
No ratings yet
Ai Faheem
16 pages
Finite - Element - Modeling - of - Prestressed - Concrete - SP
No ratings yet
Finite - Element - Modeling - of - Prestressed - Concrete - SP
11 pages
Server Information Gathering Packet v1.0
No ratings yet
Server Information Gathering Packet v1.0
12 pages
Artificial Intelligence Lec 1 PDF
No ratings yet
Artificial Intelligence Lec 1 PDF
15 pages
ML Notes
No ratings yet
ML Notes
101 pages
CME113 Formula Excel
No ratings yet
CME113 Formula Excel
16 pages
Comparison of Shielding Methods
No ratings yet
Comparison of Shielding Methods
2 pages
Bhumika Di Ip
No ratings yet
Bhumika Di Ip
20 pages
Adobe Scan 30 Dec 2024
No ratings yet
Adobe Scan 30 Dec 2024
1 page
Asynch Exercise 2 WACC APV
No ratings yet
Asynch Exercise 2 WACC APV
2 pages
Intro To ML - 1
No ratings yet
Intro To ML - 1
29 pages
Grade 2 Tos Sum1
No ratings yet
Grade 2 Tos Sum1
5 pages
Chapter-4 Basic of Statistics
No ratings yet
Chapter-4 Basic of Statistics
4 pages