0% found this document useful (0 votes)

61 views14 pages

Machine Learning Spark ML

The document provides an introduction to machine learning, including defining machine learning, examples of machine learning in real life, supervised vs unsupervised learning, and machine learning as a process involving data preparation, feature engineering, model building, evaluation and deployment.

Uploaded by

Perike Chandra Sekhar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

61 views14 pages

Machine Learning Spark ML

Uploaded by

Perike Chandra Sekhar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Introduction to Machine Learning

Perike Chandra Sekhar

Machine Learning (ML)
• ML is a branch of artificial intelligence:
• Uses computing based systems to make sense out of data
• Extracting patterns, fitting data to functions, classifying data, etc
• ML systems can learn and improve
• With historical data, time and experience
• Bridges theoretical computer science and real noise data.

6
ML in real-life

7
Supervised and Unsupervised Learning
• Unsupervised Learning
• There are not predefined and known set of outcomes
• Look for hidden patterns and relations in the data
• A typical example: Clustering

8
Supervised and Unsupervised Learning
• Supervised Learning
• For every example in the data there is always a predefined outcome
• Models the relations between a set of descriptive features and a
target (Fits data to a function)
• 2 groups of problems:
• Classification
• Regression

9
Supervised Learning
• Classification
• Predicts which class a given sample of data (sample of descriptive
features) is part of (discrete value).

• Regression
• Predicts continuous values.

10
Machine Learning as a Process
- Define measurable and quantifiable goals
- Use this stage to learn about the problem
Define
Objectives
- Normalization
- Transformation
- Missing Values
- Outliers
Model
Deployment Data
- Study models accuracy Preparation
- Work better than the naïve - Data Splitting
approach or previous system - Features Engineering
- Do the results make sense in the - Estimating Performance
context of the problem - Evaluation and Model
Selection

Model Model
Evaluation Building

11
ML as a Process: Data Preparation
• Needed for several reasons
• Some Models have strict data requirements
• Scale of the data, data point intervals, etc
• Some characteristics of the data may impact dramatically on the model
performance
• Time on data preparation should not be underestimated
• Scaling
• Missing Values • Centering
• Error Values
Raw • Different Scales Data
Transform
• Skewness
• Outliers
Data Modeling
Data • Dimensionality
• Types Problems ation • Missing Ready phase
• Many others Values
• Errors

12
ML as a Process: Feature engineering
• Determine the predictors (features) to be used is one of the most critical
questions
• Some times we need to add predictors
• Reduce Number:
• Fewer predictors more interpretable model and less costly
• Most of the models are affected by high dimensionality, specially for non-informative predictors
Algorithms that use
Multiple models
Wrappers adding and removing
parameter
models as input and
performance as
Genetics Algorithms
output

Filters Evaluate the relevance

of the predictor
Based normally on
correlations

• Binning predictors

13
ML as a Process: Model Building
• Data Splitting
• Allocate data to different tasks
• model training
• performance evaluation
• Define Training, Validation and Test sets
• Feature Selection (Review the decision made previously)
• Estimating Performance
• Visualization of results – discovery interesting areas of the problem space
• Statistics and performance measures
• Evaluation and Model selection
• The ‘no free lunch’ theorem no a priory assumptions can be made
• Avoid use of favorite models if NEEDED
14

Machine Learning Spark ML
No ratings yet
Machine Learning Spark ML
11 pages
Machine Learning Spark ML
No ratings yet
Machine Learning Spark ML
10 pages
Understanding Machine Learning Basics
No ratings yet
Understanding Machine Learning Basics
11 pages
Machine Learning (Unit I)
No ratings yet
Machine Learning (Unit I)
12 pages
Module 2 - ML
No ratings yet
Module 2 - ML
53 pages
Machine Learning Introduction
100% (1)
Machine Learning Introduction
20 pages
Module 1
No ratings yet
Module 1
25 pages
Machine Learning (ML) - Comprehensive Summary
No ratings yet
Machine Learning (ML) - Comprehensive Summary
7 pages
Machine Learning 3
No ratings yet
Machine Learning 3
30 pages
ML MU Unit 1 Introduction To MLPDF 2025 02 07 10 53 02
No ratings yet
ML MU Unit 1 Introduction To MLPDF 2025 02 07 10 53 02
49 pages
Made By: Swati Tripathi
No ratings yet
Made By: Swati Tripathi
31 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
12 pages
Machine Learning
No ratings yet
Machine Learning
84 pages
Machine Learning Process Overview
No ratings yet
Machine Learning Process Overview
41 pages
ML Lectures 2022 Part 1
No ratings yet
ML Lectures 2022 Part 1
231 pages
Machine Learning Basics & kNN Guide
No ratings yet
Machine Learning Basics & kNN Guide
94 pages
Machine Learning
No ratings yet
Machine Learning
30 pages
Machine Learning Lecture-01
No ratings yet
Machine Learning Lecture-01
37 pages
DSF - UNIT III Notes
No ratings yet
DSF - UNIT III Notes
17 pages
Lesson 4 - Introduction Machine Learning
No ratings yet
Lesson 4 - Introduction Machine Learning
44 pages
ML Module I
No ratings yet
ML Module I
71 pages
Lec-7 Intro Machine Learning
No ratings yet
Lec-7 Intro Machine Learning
87 pages
An Enlightenment To Machine Learning
100% (1)
An Enlightenment To Machine Learning
16 pages
Machine Learning?
100% (6)
Machine Learning?
114 pages
ML Chap 2
No ratings yet
ML Chap 2
60 pages
Intro to Machine Learning & kNN
No ratings yet
Intro to Machine Learning & kNN
90 pages
Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
22 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
8 pages
Machine - Learning - Unit - 1
No ratings yet
Machine - Learning - Unit - 1
70 pages
ML Lectures Summary 2
No ratings yet
ML Lectures Summary 2
52 pages
AI and Machine Learning Basics
No ratings yet
AI and Machine Learning Basics
46 pages
ML Cahp 1
No ratings yet
ML Cahp 1
35 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
132 pages
Machine Learning 1
No ratings yet
Machine Learning 1
34 pages
AI-Lecture 8 (Machine Learning Overview)
No ratings yet
AI-Lecture 8 (Machine Learning Overview)
42 pages
ML Mdu 2024 10939237
No ratings yet
ML Mdu 2024 10939237
20 pages
Social Media Analytics Techniques
No ratings yet
Social Media Analytics Techniques
77 pages
Air Quality Prediction Using Machine Learning
No ratings yet
Air Quality Prediction Using Machine Learning
29 pages
Module 1 ML
No ratings yet
Module 1 ML
51 pages
AI Unit 1
No ratings yet
AI Unit 1
30 pages
Machine Learning Unit 1
No ratings yet
Machine Learning Unit 1
72 pages
Presenttion 33
No ratings yet
Presenttion 33
2 pages
ELE-COI-521 Machine Learning Topics
No ratings yet
ELE-COI-521 Machine Learning Topics
40 pages
Data - Analytics - Chapter 2
No ratings yet
Data - Analytics - Chapter 2
58 pages
Module - 1
No ratings yet
Module - 1
9 pages
Unit II
No ratings yet
Unit II
14 pages
Jntuk r20 ML Unit-I (Chapter-I)
No ratings yet
Jntuk r20 ML Unit-I (Chapter-I)
18 pages
Under Supervision DR/ Zainab Hassan Prepared by Group 2
No ratings yet
Under Supervision DR/ Zainab Hassan Prepared by Group 2
28 pages
Unit - 1
No ratings yet
Unit - 1
54 pages
Lecture 2 Unit 1
No ratings yet
Lecture 2 Unit 1
60 pages
ML - Lecture - 1 Introduction To ML
No ratings yet
ML - Lecture - 1 Introduction To ML
29 pages
Unit I1
No ratings yet
Unit I1
8 pages
Chapter 4 - Machine Learning
No ratings yet
Chapter 4 - Machine Learning
81 pages
Machine Learning for Level 5 Students
No ratings yet
Machine Learning for Level 5 Students
116 pages
ML Unit-I
No ratings yet
ML Unit-I
34 pages
Unit 1 Machine Learning
No ratings yet
Unit 1 Machine Learning
10 pages
1 - Machine Learning Overview
No ratings yet
1 - Machine Learning Overview
56 pages
Machine Learning With Python
No ratings yet
Machine Learning With Python
6 pages
Exercise On Chap 2 - Diode Circuit
No ratings yet
Exercise On Chap 2 - Diode Circuit
3 pages
Las Shs Gen - Chem Melc 9 q2 Week-5
No ratings yet
Las Shs Gen - Chem Melc 9 q2 Week-5
8 pages
Form 2 Agriculture
100% (1)
Form 2 Agriculture
8 pages
Chapter 2: Science and Technology: Their Natures and Relationship
No ratings yet
Chapter 2: Science and Technology: Their Natures and Relationship
5 pages
Astm D 1883-99
No ratings yet
Astm D 1883-99
8 pages
N.i.E QUESTIONS
No ratings yet
N.i.E QUESTIONS
23 pages
Skyscraper & Intelligent Buildings
No ratings yet
Skyscraper & Intelligent Buildings
21 pages
W 3 Dzzslides
No ratings yet
W 3 Dzzslides
19 pages
I-Shift Gearbox Specifications Guide
100% (2)
I-Shift Gearbox Specifications Guide
45 pages
TCP Reliable Data Transfer
No ratings yet
TCP Reliable Data Transfer
17 pages
Depth of Water: A. Soil and Water Conservation Engineering
No ratings yet
Depth of Water: A. Soil and Water Conservation Engineering
36 pages
Muscle Deprogramming - An Orthodontist's Perspective: Batra Laxman Ra Angshuman B Llach
No ratings yet
Muscle Deprogramming - An Orthodontist's Perspective: Batra Laxman Ra Angshuman B Llach
5 pages
Essential English Phrasal Verbs Guide
No ratings yet
Essential English Phrasal Verbs Guide
102 pages
Report Card Grades Odd Semester 2019-2020
No ratings yet
Report Card Grades Odd Semester 2019-2020
15 pages
Schwarz 1950
No ratings yet
Schwarz 1950
5 pages
2021 Integrated MIMO Fault Detection and Disturbance Observer-Based Contro
No ratings yet
2021 Integrated MIMO Fault Detection and Disturbance Observer-Based Contro
10 pages
Autocad Presentation Dimensioning and Measure
No ratings yet
Autocad Presentation Dimensioning and Measure
13 pages
A Feasibility Study of Producing Moringa Pancake
67% (6)
A Feasibility Study of Producing Moringa Pancake
49 pages
James Bleier - Innovative Architectural Visionary
No ratings yet
James Bleier - Innovative Architectural Visionary
5 pages
Chapter 20 Section 3 and 4
No ratings yet
Chapter 20 Section 3 and 4
2 pages
Poli 140907065955 Phpapp01 PDF
No ratings yet
Poli 140907065955 Phpapp01 PDF
8 pages
Fluid Statics Lecture Notes
No ratings yet
Fluid Statics Lecture Notes
8 pages
Hyundai Elantra Brochure
No ratings yet
Hyundai Elantra Brochure
10 pages
JEE Main-6 - JEE 2024 - Solution
No ratings yet
JEE Main-6 - JEE 2024 - Solution
14 pages
Revision Discover Grade 3 November - Answers
No ratings yet
Revision Discover Grade 3 November - Answers
11 pages
Detection of Adulteration in Edible Oils: Review
No ratings yet
Detection of Adulteration in Edible Oils: Review
8 pages
Romantic New Orleans Itinerary
No ratings yet
Romantic New Orleans Itinerary
19 pages
Final Research - 094055
No ratings yet
Final Research - 094055
60 pages
Happy Defense Feb 6 2025
No ratings yet
Happy Defense Feb 6 2025
28 pages
Non-Bilateral EASA Part-145 Revoked Approvals
No ratings yet
Non-Bilateral EASA Part-145 Revoked Approvals
7 pages

Machine Learning Spark ML

Uploaded by

Machine Learning Spark ML

Uploaded by

Introduction to Machine Learning

Perike Chandra Sekhar

Filters Evaluate the relevance

You might also like