0% found this document useful (0 votes)

42 views

Chap1 - Introduction To Machine Learning

This document provides an introduction to machine learning. It discusses the growth of large-scale data collection across many domains like cybersecurity, e-commerce, social media, and more. Machine learning can help analyze this data to gain useful insights. Specifically, it allows scientists to automatically analyze massive datasets and form hypotheses. It also provides opportunities to improve productivity and solve major societal problems in areas like healthcare, climate change, agriculture and more. The document then discusses data mining and the knowledge discovery in databases (KDD) process for applying machine learning techniques to extract patterns and knowledge from large datasets.

Uploaded by

mesemo tadiwos

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views

Chap1 - Introduction To Machine Learning

Uploaded by

mesemo tadiwos

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 40

Machine Learning: Introduction

Introduction to Machine Learning

Introduction to Machine Learning By Mr.

Gebreyes G.
Large-scale Data is Everywhere!
 There has been enormous data
growth in both commercial and
scientific databases due to advances
in data generation and collection
technologies Cyber Security
E-Commerce

 New mantra
 Gather whatever data you can
whenever and wherever
possible.
 Expectations Traffic Patterns Social Networking:
Twitter
 Gathered data will have value
either for the purpose collected
or for a purpose not envisioned.

Sensor Networks Computational Simula

Introduction to Machine Learning By Mr.
Gebreyes G.
Why ML? Commercial Viewpoint
 Lots of data is being collected and warehoused
– Web data
 Google has Peta Bytes of web data
 Facebook has billions of active users
– purchases at department/
grocery stores, e-commerce
 Amazon handles millions of visits/day
– Bank/Credit Card transactions
 Computers have become cheaper and more powerful
 Competitive Pressure is Strong
– Provide better, customized services for an edge (e.g. in
Customer Relationship Management)

Introduction to Machine Learning By Mr.

Gebreyes G.
Why ML? Scientific Viewpoint
Data collected and stored at enormous speeds
• remote sensors on a satellite
 NASA EOSDIS archives over
petabytes of earth science data / year
– telescopes scanning the skies fMRI Data from Brain Sky Survey Data
 Sky survey data

– High-throughput biological data

– scientific simulations
 terabytes of data generated in a few hours

MLhelps scientists Gene Expression Data

– in automated analysis of massive datasets

– In hypothesis formation

Introduction to Machine Learning By Mr. Surface Temperature of Earth

Gebreyes G.
Great opportunities to improve productivity in all walks of life

Introduction to Machine Learning By Mr.

Gebreyes G.
Great Opportunities to Solve Society’s Major Problems

Improving health care and reducing costs Predicting the impact of climate change

Reducing hunger and poverty by

Finding alternative/ green energy sources
increasing agriculture production
Introduction to Machine Learning By Mr.
Gebreyes G.
Data Mining and KDD process

 Process of automatically discovering useful information in

large data repositories.
Why data mining techniques are deployed?
 To scour large databases in order to find novel and useful
patterns that might otherwise remain unknown.
 To provide capabilities to predict the outcome of a future
observation, such as predicting whether a newly arrived.

Introduction to Machine Learning By Mr.

Gebreyes G.
What is Data Mining?
 Many Definitions
– Non-trivial extraction of implicit, previously unknown and
potentially useful information from data
– Exploration & analysis, by automatic or semi-automatic
means, of large quantities of data in order to discover
meaningful patterns

Introduction to Machine Learning By Mr.

Gebreyes G.
IR

Not all information discovery tasks are considered to be

data mining.
 For example, Looking up individual records using a
database management system
 finding particular Web pages via a query to an Internet
search engine are tasks related to the area of
information retrieval.

Introduction to Machine Learning By Mr.

Gebreyes G.
Origins of Data Mining
 Draws ideas from machine learning/AI, pattern recognition,
statistics, and database systems
 Traditional techniques may be unsuitable due to data that is
– Large-scale
– High dimensional
– Heterogeneous
– Complex
– Distributed

 A key component of the emerging field of data science and data-driven

discovery
Introduction to Machine Learning By Mr.
Gebreyes G.
KDD ( Knowledge Discovery in Databases)

 process of collecting data and methodically refining it.

 It refers to the broad procedure of discovering
knowledge in data and emphasizes the high-level
applications of specific Data Mining techniques.
The main Objective of KDD:
 To extract information from data in the context of large
databases.
How?
 By using Data Mining algorithms to identify what is
deemed knowledge.
Introduction to Machine Learning By Mr.
Gebreyes G.
Steps in KDD

Introduction to Machine Learning By Mr.

Gebreyes G.
The input data can be stored in a variety of format:
 flat files,
 spreadsheets,
 relational tables and
 may reside in a centralized data repository or be
distributed across multiple sites.
preprocessing
 To transform the raw input data into an appropriate
format for subsequent analysis.

Introduction to Machine Learning By Mr.

Gebreyes G.
The steps involved in data preprocessing include:
 fusing data from multiple sources,
 cleaning data to remove noise and duplicate observations,
 selecting records and features that are relevant to the data
mining task.
 Because of the many ways data can be collected and
stored, data preprocessing is perhaps the most laborious
and time-consuming step in the overall knowledge
discovery process.

Introduction to Machine Learning By Mr.

Gebreyes G.
 Postprocessing
 step that ensures that only valid and useful results are
incorporated into the decision support system
 Integrating data mining results into decision support
systems.
 For example, in business applications, the insights
offered by data mining results can be integrated with
campaign management tools so that effective marketing
promotions can be conducted and tested.

Introduction to Machine Learning By Mr.

Gebreyes G.
DM vs KDD
The Knowledge Discovery in Databases
 is considered as a programmed, exploratory analysis
and modeling of vast data repositories.
 is the organized procedure of recognizing valid, useful,
and understandable patterns from huge and complex
data sets.
Data Mining
 is the root of the KDD procedure, including the
inferring of algorithms that investigate the data, develop
the model, and find previously unknown patterns.
 The model is used for extracting the knowledge from
the data, analyzeIntroduction
the data, and predict the data.
to Machine Learning By Mr.
Gebreyes G.
KDD Approaches

Two most common Approaches

SEMMA(Sample, Explore, Modify, Model, Assess)
The data mining method can be used to solve a wide range of
business problems, including fraud identification, customer
retention and turnover, database marketing, customer loyalty,
bankruptcy forecasting, market segmentation, as well as risk,
affinity, and portfolio analysis.
CRoss Industry Standard Process for Data Mining (CRISP-
DM)
is a academic process model that serves as the base for a
data science process .
Introduction to Machine Learning By Mr.
Gebreyes G.
SEMMA

1. Sample: identify variables or factors (both dependent and

independent) influencing the process.
2. Explore: to study interconnected relationships between data
elements and to identify gaps in the data using univariate and
multi-variate analysis.
3. Modify: the data is parsed and cleaned, being then passed
onto the modeling stage.
4. Model: applying a variety of data mining techniques in order
to produce a projected model of how this data achieves the
final, desired outcome of the process.
5. Assess: evaluated for how useful and reliable it is for the
studied topic. The data can now be tested and used to estimate
the efficacy of its performance.
Introduction to Machine Learning By Mr.
Gebreyes G.
CRISP

1. Business understanding – What does the business need?

2. Data understanding – What data do we have / need? Is it
clean?
3. Data preparation – How do we organize the data for
modeling?
4. Modeling – What modeling techniques should we apply?
5. Evaluation – Which model best meets the business
objectives?
6. Deployment – How do stakeholders access the results?
For more information click on link provided below
https://medium.com/@gebreyesis56/data-mining-and-kdd-
88d43235d681 Introduction to Machine Learning By Mr.
Gebreyes G.
1. How you can analyze the relationship between
variables? Univariate and mult-variate?
2. Which Approach is the best and Popular one???

Introduction to Machine Learning By Mr.

Gebreyes G.
Data Mining Tasks

 Prediction Methods
 to predict the value of a particular attribute
based on the values of other attributes
 The attribute to be predicted is commonly
known as the target or dependent variable,
 while the attributes used for making the
prediction are known as the explanatory or
independent variables.
From [Fayyad, et.al.] Advances in Knowledge Discovery and Data Mining, 1996

Introduction to Machine Learning By Mr.

Gebreyes G.
 Description Methods
 Find human-interpretable patterns that
describe the data.
 to derive patterns (correlations, trends, clusters,
trajectories, and anomalies) that summarize the
underlying relationships in data.
 Descriptive data mining tasks are often exploratory
in nature and frequently require postprocessing
techniques to validate and explain the results.

Introduction to Machine Learning By Mr.

Gebreyes G.
Data Mining Tasks …

Clu
ste Data
ri ng
Tid Refund Marital
Status
Taxable
Income Cheat
l i ng
1 Yes Single 125K No
ode
2 No Married 100K No
M
i ve
3 No Single 70K No
4 Yes Married 120K No
ct
5 No Divorced 95K Yes

edi
6
7
No
Yes
Married 60K
Divorced 220K
No
No P r
8 No Single 85K Yes
9 No Married 75K No
10 No Single 90K Yes
An
De oma
11 No Married 60K No

ation 12 Yes Divorced 220K No

tec ly
oci 13 No Single 85K Yes

tio
s
As
14 No Married 75K No

10
15 No Single 90K Yes
n

l es
Ru

Milk

Introduction to Machine Learning By Mr.

Gebreyes G.
Predictive Modeling: Classification
 Find a model for class attribute as a function of
the values of other attributes Model for predicting credit
worthiness

Class Employed
# years at
Level of Credit Yes
Tid Employed present No
Education Worthy
address
1 Yes Graduate 5 Yes
2 Yes High School 2 No No Education
3 No Undergrad 1 No
{ High school,
4 Yes High School 10 Yes Graduate
Undergrad }
… … … … …
10

Number of Number of
years years

> 3 yr < 3 yr > 7 yrs < 7 yrs

Yes No Yes No

Introduction to Machine Learning By Mr.

Gebreyes G.
Classification Example

cal cal tive # years at

ori ori ita Level of Credit
g g nt s Tid Employed
Education
present
Worthy
te te a as address
ca ca qu cl 1 Yes Undergrad 7 ?
# years at 2 No Graduate 3 ?
Level of Credit
Tid Employed present 3 Yes High School 2 ?
Education Worthy
address
1 Yes Graduate 5 Yes … … … … …
10

2 Yes High School 2 No

3 No Undergrad 1 No
4 Yes High School 10 Yes
… … … … …
10 Test
Set

Training
Learn
Set Classifier Model

Introduction to Machine Learning By Mr.

Gebreyes G.
Examples of Classification Task

 Classifying credit card transactions

as legitimate or fraudulent
 Classifying land covers (water bodies, urban areas,
forests, etc.) using satellite data
 Categorizing news stories as finance,
weather, entertainment, sports, etc
 Identifying intruders in the cyberspace
 Predicting tumor cells as benign or malignant
 Classifying secondary structures of protein
as alpha-helix, beta-sheet, or random coil
Introduction to Machine Learning By Mr.
Gebreyes G.
Classification: Application 1

 Fraud Detection
– Goal: Predict fraudulent cases in credit card transactions.
– Approach:
 Use credit card transactions and the information on its account-
holder as attributes.
– When does a customer buy, what does he buy, how often he
pays on time, etc
 Label past transactions as fraud or fair transactions. This forms
the class attribute.
 Learn a model for the class of the transactions.
 Use this model to detect fraud by observing credit card
transactions on an account.

Introduction to Machine Learning By Mr.

Gebreyes G.
Classification: Application 2

 Churn prediction for telephone customers

– Goal: To predict whether a customer is likely to be lost
to a competitor.
– Approach:
 Use detailed record of transactions with each of the past and
present customers, to find attributes.
– How often the customer calls, where he calls, what time-of-the
day he calls most, his financial status, marital status, etc.
 Label the customers as loyal or disloyal.
 Find a model for loyalty.

From [Berry & Linoff] Data Mining Techniques, 1997

Introduction to Machine Learning By Mr.
Gebreyes G.
Classification: Application 3
 Sky Survey Cataloging
– Goal: To predict class (star or galaxy) of sky objects, especially
visually faint ones, based on the telescopic survey images (from
Palomar Observatory).
– 3000 images with 23,040 x 23,040 pixels per image.
– Approach:
 Segment the image.

 Measure image attributes (features) - 40 of them per object.

 Model the class based on these features.

 Success Story: Could find 16 new high red-shift quasars,

some of the farthest objects that are difficult to find!
From [Fayyad, et.al.] Advances in Knowledge Discovery and Data Mining, 1996

Introduction to Machine Learning By Mr.

Gebreyes G.
Classifying Galaxies
Courtesy: http://aps.umn.edu

Early Class: Attributes:

• Stages of Formation • Image features,
• Characteristics of light
waves received, etc.
Intermediate

Late

Data Size:
• 72 million stars, 20 million galaxies
• Object Catalog: 9 GB
• Image Database: 150 GB

Introduction to Machine Learning By Mr.

Gebreyes G.
Regression

 Predict a value of a given continuous valued variable

based on the values of other variables, assuming a
linear or nonlinear model of dependency.
 Extensively studied in statistics, neural network
fields.
 Examples:
– Predicting sales amounts of new product based on
advetising expenditure.
– Predicting wind velocities as a function of
temperature, humidity, air pressure, etc.
– Time series prediction of stock market indices.
Introduction to Machine Learning By Mr.
Gebreyes G.
Clustering

 Finding groups of objects such that the objects in a group

will be similar (or related) to one another and different
from (or unrelated to) the objects in other groups

Inter-cluster
Intra-cluster distances are
distances are maximized
minimized

Introduction to Machine Learning By Mr.

Gebreyes G.
Applications of Cluster Analysis
 Understanding
– Custom profiling for targeted marketing
– Group related documents for browsing
– Group genes and proteins that have
similar functionality
– Group stocks with similar price
fluctuations
 Summarization
– Reduce the size of large data sets

Courtesy: Michael Eisen

Clusters for Raw SST and Raw NPP

Use of K-means to partition

Sea Surface Temperature
60

Land Cluster 2

30
(SST) and Net Primary
Production (NPP) into clusters
Land Cluster 1
latitude

0
that reflect the Northern and
Ice or No NPP

-30
Southern Hemispheres.
Sea Cluster 2

-60

Sea Cluster 1
Introduction to Machine Learning By Mr.
-90
-180 -150 -120 -90 -60 -30 0 30

longitude
60 90 120 150 180
Cluster Gebreyes G.
Clustering: Application 1

 Market Segmentation:
– Goal: subdivide a market into distinct subsets of customers
where any subset may conceivably be selected as a market
target to be reached with a distinct marketing mix.
– Approach:
 Collect different attributes of customers based on their
geographical and lifestyle related information.
 Find clusters of similar customers.

 Measure the clustering quality by observing buying

patterns of customers in same cluster vs. those from
different clusters.

Introduction to Machine Learning By Mr.

Gebreyes G.
Clustering: Application 2

 Document Clustering:
– Goal: To find groups of documents that are similar to each
other based on the important terms appearing in them.

– Approach: To identify frequently occurring terms in each

document. Form a similarity measure based on the
frequencies of different terms. Use it to cluster.

Enron email dataset

Introduction to Machine Learning By Mr.

Gebreyes G.
Association Rule Discovery: Definition

 Given a set of records each of which contain some

number of items from a given collection
– Produce dependency rules which will predict occurrence of
an item based on occurrences of other items.

TID Items
1 Bread, Coke, Milk
Rules
RulesDiscovered:
Discovered:
2 Beer, Bread {Milk}
{Milk}-->
-->{Coke}
{Coke}
3 Beer, Coke, Diaper, Milk {Diaper,
{Diaper,Milk}
Milk}-->
-->{Beer}
{Beer}
4 Beer, Bread, Diaper, Milk
5 Coke, Diaper, Milk

Introduction to Machine Learning By Mr.

Gebreyes G.
Association Analysis: Applications

 Market-basket analysis
– Rules are used for sales promotion, shelf management, and
inventory management

 Telecommunication alarm diagnosis

– Rules are used to find combination of alarms that occur
together frequently in the same time period

 Medical Informatics
– Rules are used to find combination of patient symptoms
and test results associated with certain diseases

Introduction to Machine Learning By Mr.

Gebreyes G.
Association Analysis: Applications

 An Example Subspace Differential Coexpression Pattern from

lung cancer dataset Three lung cancer datasets [Bhattacharjee et al
2001], [Stearman et al. 2005], [Su et al. 2007]

Enriched with the TNF/NFB signaling pathway

which is well-known to be related to lung cancer
P-value: 1.4*10-5 (6/10 overlap with the pathway)

[Fang et al PSB 2010]

Introduction to Machine Learning By Mr.
Gebreyes G.
Deviation/Anomaly/Change Detection
 Detect significant deviations from normal
behavior
 Applications:
– Credit Card Fraud Detection
– Network Intrusion
Detection
– Identify anomalous behavior from sensor
networks for monitoring and
surveillance.
– Detecting changes in the global forest
cover.

Introduction to Machine Learning By Mr.

Gebreyes G.
Motivating Challenges

 are some of the specific challenges that motivated the

development of data mining.
 Scalability: data sets with sizes of gigabytes,
terabytes, or even petabytes are becoming common.
 High Dimensionality: number features increasement
 Heterogeneous and Complex Data: need for
techniques that can handle heterogeneous attributes
 Data Ownership and Distribution: the data needed
for an analysis is not stored in one location or owned
by one organization.

Introduction to Machine Learning By Mr.

Gebreyes G.

What Affects Usage Satisfaction in Mobile Payments? Modelling User Generated Content To Develop The "Digital Service Usage Satisfaction Model "
No ratings yet
What Affects Usage Satisfaction in Mobile Payments? Modelling User Generated Content To Develop The "Digital Service Usage Satisfaction Model "
21 pages
Build Guide - Landy Mini
0% (1)
Build Guide - Landy Mini
16 pages
BA4027 Datamining For BI
100% (1)
BA4027 Datamining For BI
67 pages
BIS 541 Ch01 20-21 S
No ratings yet
BIS 541 Ch01 20-21 S
129 pages
The Arab Academy For Managerial, Banking and Financial Science Name: Course: Semester
No ratings yet
The Arab Academy For Managerial, Banking and Financial Science Name: Course: Semester
4 pages
Data Mining Overview
No ratings yet
Data Mining Overview
24 pages
FDS Notes
No ratings yet
FDS Notes
143 pages
Data Science: Chapter 1: Introduction To Big Data
100% (2)
Data Science: Chapter 1: Introduction To Big Data
77 pages
Unit- 1
No ratings yet
Unit- 1
28 pages
Big data notes
No ratings yet
Big data notes
4 pages
BDA.Unit-1
No ratings yet
BDA.Unit-1
40 pages
Unit3 - Machine Learning With Big Data
No ratings yet
Unit3 - Machine Learning With Big Data
74 pages
Computer Science 3rd Year Specilization
No ratings yet
Computer Science 3rd Year Specilization
9 pages
Big Data Report 1
No ratings yet
Big Data Report 1
17 pages
Big data analytics notes
No ratings yet
Big data analytics notes
33 pages
23 Vol 2 No 4
No ratings yet
23 Vol 2 No 4
5 pages
11.course Materials (Unit Wise
No ratings yet
11.course Materials (Unit Wise
138 pages
Overview of Big Data
No ratings yet
Overview of Big Data
4 pages
data mining introduction
No ratings yet
data mining introduction
52 pages
Cs3352 FDS Question Bank
No ratings yet
Cs3352 FDS Question Bank
145 pages
P.Prabu (31x61c) CCS334-BDA.Unit-1
No ratings yet
P.Prabu (31x61c) CCS334-BDA.Unit-1
32 pages
P.prabu (31x61c) CCS334 BDA - Unit 1
No ratings yet
P.prabu (31x61c) CCS334 BDA - Unit 1
31 pages
Business Analytics
100% (5)
Business Analytics
46 pages
V3N2 121 PDF
No ratings yet
V3N2 121 PDF
4 pages
Fods Notes
No ratings yet
Fods Notes
139 pages
Data Mining
No ratings yet
Data Mining
18 pages
FDS NOTES
No ratings yet
FDS NOTES
137 pages
Data Mining and Data Warehousing
No ratings yet
Data Mining and Data Warehousing
21 pages
DM - Unit4
No ratings yet
DM - Unit4
15 pages
Foundation of Data Science
100% (2)
Foundation of Data Science
143 pages
UNIT 1 Introduction of Data Mining
No ratings yet
UNIT 1 Introduction of Data Mining
40 pages
Chapter 1 Data Science Fundamentals
No ratings yet
Chapter 1 Data Science Fundamentals
34 pages
BCA Lecture I
No ratings yet
BCA Lecture I
20 pages
Module-1 DM
No ratings yet
Module-1 DM
15 pages
B SC (IT) VI-DSE3-M5
No ratings yet
B SC (IT) VI-DSE3-M5
13 pages
IME 672-Chapter 1 PDF
No ratings yet
IME 672-Chapter 1 PDF
41 pages
Dmbi Unit-3
No ratings yet
Dmbi Unit-3
21 pages
New Note
No ratings yet
New Note
23 pages
(IJCST-V5I3P23) :fatima, Dr. Jawed Ikbal Khan
No ratings yet
(IJCST-V5I3P23) :fatima, Dr. Jawed Ikbal Khan
3 pages
Fds Module 1
No ratings yet
Fds Module 1
65 pages
e4f1fb7f-a61e-4090-9018-344695f0d7d4 (2)
No ratings yet
e4f1fb7f-a61e-4090-9018-344695f0d7d4 (2)
30 pages
DM Chapter 1
No ratings yet
DM Chapter 1
37 pages
What Is Data
No ratings yet
What Is Data
20 pages
Data Mining Tutorial - Javatpoint
No ratings yet
Data Mining Tutorial - Javatpoint
12 pages
Bigdata Documentation
No ratings yet
Bigdata Documentation
20 pages
Big Data A Survey Dinesh
No ratings yet
Big Data A Survey Dinesh
9 pages
AD3491 UNIT 1 NOTES EduEngg
100% (1)
AD3491 UNIT 1 NOTES EduEngg
35 pages
A Survey of Machine Learning Algorithms For Big Data Analytics
No ratings yet
A Survey of Machine Learning Algorithms For Big Data Analytics
4 pages
ADET - Lesson 2
No ratings yet
ADET - Lesson 2
21 pages
Inroduction To Data Science
No ratings yet
Inroduction To Data Science
62 pages
Introduction To Data Science and Big Data
No ratings yet
Introduction To Data Science and Big Data
6 pages
Review of Recent Technologies in Big Data Analysis
No ratings yet
Review of Recent Technologies in Big Data Analysis
3 pages
All in one
No ratings yet
All in one
362 pages
Data Mining Unit 1(Msc Ds 3 Sem)
No ratings yet
Data Mining Unit 1(Msc Ds 3 Sem)
119 pages
DM Module1
No ratings yet
DM Module1
15 pages
Machine Learning and Big Data Investing
No ratings yet
Machine Learning and Big Data Investing
11 pages
Data Mning Tools and TechniquesAIMA
No ratings yet
Data Mning Tools and TechniquesAIMA
97 pages
MBA933 - Lectures 1-2
No ratings yet
MBA933 - Lectures 1-2
45 pages
Unit 1 and Unit 2 notes bda
No ratings yet
Unit 1 and Unit 2 notes bda
11 pages
Big Data Searching FIRST Review
No ratings yet
Big Data Searching FIRST Review
10 pages
Data Mining: Fundamentals and Applications
From Everand
Data Mining: Fundamentals and Applications
Fouad Sabry
No ratings yet
Mastering Data Mining Techniques
From Everand
Mastering Data Mining Techniques
Dhaanyalakshmi Ahuja
No ratings yet
OOSAD Chapter 5
No ratings yet
OOSAD Chapter 5
56 pages
ML CH Part 2
No ratings yet
ML CH Part 2
19 pages
Introduction
No ratings yet
Introduction
11 pages
22 Chale
No ratings yet
22 Chale
18 pages
Exit Exam For Accounting and Finance PDF
89% (9)
Exit Exam For Accounting and Finance PDF
11 pages
World's First New Built SANHA LPG FPSO Ready: Japan Ship Exporters' Association
No ratings yet
World's First New Built SANHA LPG FPSO Ready: Japan Ship Exporters' Association
6 pages
Ai in Cyber Security Seminar
No ratings yet
Ai in Cyber Security Seminar
14 pages
Robotstudio 220412 Final
No ratings yet
Robotstudio 220412 Final
41 pages
Parlantes para Interior de Techo 6W
No ratings yet
Parlantes para Interior de Techo 6W
4 pages
Lab No. 7: Lab Manual For Operating System
No ratings yet
Lab No. 7: Lab Manual For Operating System
6 pages
Honda Accord At3w1717om
No ratings yet
Honda Accord At3w1717om
585 pages
Limbach l550 Ef Datasheet en
No ratings yet
Limbach l550 Ef Datasheet en
2 pages
PP in Ii 001
No ratings yet
PP in Ii 001
15 pages
Unit 1
No ratings yet
Unit 1
19 pages
ADE Base Devices: Manufacturer Device Name Application or Mobile/tethered Device
No ratings yet
ADE Base Devices: Manufacturer Device Name Application or Mobile/tethered Device
6 pages
Universal - Asynchronous - Receiver-Transmitter - EN
No ratings yet
Universal - Asynchronous - Receiver-Transmitter - EN
12 pages
Computer E-book English RBE (2)
No ratings yet
Computer E-book English RBE (2)
69 pages
1.2 RAID - Redundant Array of Independent Disks
No ratings yet
1.2 RAID - Redundant Array of Independent Disks
22 pages
Fredeluces Et Al. 2023. Level of Acceptance Towards AI in MSU SHS
No ratings yet
Fredeluces Et Al. 2023. Level of Acceptance Towards AI in MSU SHS
124 pages
Goodyear Truck Tires China
No ratings yet
Goodyear Truck Tires China
3 pages
What Is A BMS or Building Management System
No ratings yet
What Is A BMS or Building Management System
27 pages
ESA_2023 (1)
No ratings yet
ESA_2023 (1)
5 pages
Iso - Iec Fdis 27001 - Redline Iso - Iec Fdis 27001
No ratings yet
Iso - Iec Fdis 27001 - Redline Iso - Iec Fdis 27001
2 pages
Openeuler 24.03 LTS Technical White Paper
No ratings yet
Openeuler 24.03 LTS Technical White Paper
54 pages
Site Planning and Design
100% (5)
Site Planning and Design
79 pages
Solar Power Satellites
93% (15)
Solar Power Satellites
31 pages
13-Transistor Transceiver For Digital Kit Assembly Manual: Draft
No ratings yet
13-Transistor Transceiver For Digital Kit Assembly Manual: Draft
28 pages
Counterfeit Product Detection System Using Graphical Qrcode in Blockchain
No ratings yet
Counterfeit Product Detection System Using Graphical Qrcode in Blockchain
8 pages
SIGTRAN Protocol Analysis and Simulation: 818 West Diamond Avenue - Third Floor, Gaithersburg, MD 20878
No ratings yet
SIGTRAN Protocol Analysis and Simulation: 818 West Diamond Avenue - Third Floor, Gaithersburg, MD 20878
45 pages
Marksheet PDF
No ratings yet
Marksheet PDF
1 page
Datasheet
No ratings yet
Datasheet
5 pages
2006 Annual Report
No ratings yet
2006 Annual Report
84 pages
Full Download (Ebook) Proceedings of the 2015 Federated Conference on Software Development and Object Technologies by Jan Janech, Jozef Kostolny, Tomasz Gratkowski (eds.) ISBN 9783319465340, 9783319465357, 3319465341, 331946535X PDF DOCX
100% (8)
Full Download (Ebook) Proceedings of the 2015 Federated Conference on Software Development and Object Technologies by Jan Janech, Jozef Kostolny, Tomasz Gratkowski (eds.) ISBN 9783319465340, 9783319465357, 3319465341, 331946535X PDF DOCX
55 pages

Chap1 - Introduction To Machine Learning

Uploaded by

Chap1 - Introduction To Machine Learning

Uploaded by

Machine Learning: Introduction

Introduction to Machine Learning

Introduction to Machine Learning By Mr.

Sensor Networks Computational Simula

Introduction to Machine Learning By Mr.

– High-throughput biological data

MLhelps scientists Gene Expression Data

– in automated analysis of massive datasets

Introduction to Machine Learning By Mr. Surface Temperature of Earth

Introduction to Machine Learning By Mr.

Reducing hunger and poverty by

 Process of automatically discovering useful information in

Introduction to Machine Learning By Mr.

Introduction to Machine Learning By Mr.

Not all information discovery tasks are considered to be

Introduction to Machine Learning By Mr.

 A key component of the emerging field of data science and data-driven

 process of collecting data and methodically refining it.

Introduction to Machine Learning By Mr.

Introduction to Machine Learning By Mr.

Introduction to Machine Learning By Mr.

Introduction to Machine Learning By Mr.

Two most common Approaches

1. Sample: identify variables or factors (both dependent and

1. Business understanding – What does the business need?

Introduction to Machine Learning By Mr.

Introduction to Machine Learning By Mr.

Introduction to Machine Learning By Mr.

ation 12 Yes Divorced 220K No

Introduction to Machine Learning By Mr.

> 3 yr < 3 yr > 7 yrs < 7 yrs

Introduction to Machine Learning By Mr.

cal cal tive # years at

2 Yes High School 2 No

Introduction to Machine Learning By Mr.

 Classifying credit card transactions

Introduction to Machine Learning By Mr.

 Churn prediction for telephone customers

From [Berry & Linoff] Data Mining Techniques, 1997

 Measure image attributes (features) - 40 of them per object.

 Model the class based on these features.

 Success Story: Could find 16 new high red-shift quasars,

Introduction to Machine Learning By Mr.

Early Class: Attributes:

Introduction to Machine Learning By Mr.

 Predict a value of a given continuous valued variable

 Finding groups of objects such that the objects in a group

Introduction to Machine Learning By Mr.

Courtesy: Michael Eisen

Clusters for Raw SST and Raw NPP

Use of K-means to partition

 Measure the clustering quality by observing buying

Introduction to Machine Learning By Mr.

– Approach: To identify frequently occurring terms in each

Enron email dataset

Introduction to Machine Learning By Mr.

 Given a set of records each of which contain some

Introduction to Machine Learning By Mr.

 Telecommunication alarm diagnosis

Introduction to Machine Learning By Mr.

 An Example Subspace Differential Coexpression Pattern from

Enriched with the TNF/NFB signaling pathway

[Fang et al PSB 2010]

Introduction to Machine Learning By Mr.

 are some of the specific challenges that motivated the

Introduction to Machine Learning By Mr.

You might also like