0% found this document useful (0 votes)

29 views52 pages

Supersived Machine Learning

The document outlines the schedule for meetings after the midterm exam for an Artificial Intelligence course. It includes the topics, teaching methods, and time allocation for each of the 9 meetings between weeks 9-16. The topics progress from classification and clustering to decision trees, neural networks, machine learning algorithms for classification like logistic regression, random forests, and naive bayes. It allocates a total of 18 hours of instruction time over the 8 weeks.

Uploaded by

farhan yutub

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views52 pages

Supersived Machine Learning

Uploaded by

farhan yutub

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 52

Sistem Cerdas (TIF 150702)

M. Angga Gumilang
Rencana Pertemuan Setalah UTS

Minggu Ke Materi Metode Waktu

9 Klasifikasi dan Clustering Praktisi Mengajar 4 Jam
10 Logika Fuzzy & Sistem Pakar Praktisi Mengajar 4 Jam
11 Decision Tree & Jaringan Syaraf Tiruan Praktisi Mengajar 4 Jam
12 Machine Learning untuk Klasifikasi Ceramah 1 Jam
13 Logistic Regression & Decision Tree Diskusi 1 Jam
14 Random Forest & Support Vector Machine (SVM) Diskusi 1 Jam
15 K Nearest Neighbour (KNN) dan Naïve Bayes Diskusi 1 Jam
16 Ujian Akhir Semester Soal Subjektif 2 Jam
Total Waktu Pembelajaran 18 Jam
Machine Learning
Dan Penerapannya untuk Klasifikasi
Outline

● Konsep Machine Learning

● Algoritma Machine Learning
● Contoh Studi Kasus Klasifikasi
● Logistic Regression & Decision Tree
● Random Forest & Support Vector Machine (SVM)
● K Nearest Neighbour (KNN) dan Naïve Bayes
Konsep Machine
Learning
Konsep Machine Learning
Machine Learning = Algorithm + Math (Statistics)
Mind Map Machine Learning
Machine Learning in Classification
How Machine Learning Works ?
Supervised vs
Unsupervised
Learning
● The easiest way to
distinguish a
supervised learning
and unsupervised
learning is to see
whether the data is
labelled or not.
Unsupervised Learning
Confuse Machine Learning ?
Reinforcement Learning
Algoritma
Machine Learning
Machine Learning
Machine Learning ?
1. Logistic
Regression
● Logistics regression uses
sigmoid function above to return
the probability of a label. It is
widely used when the
classification problem is binary
— true or false, win or lose,
positive or negative ...

● The sigmoid function generates

a probability output. By
comparing the probability with a
pre-defined threshold, the object
is assigned to a label
accordingly.
Logistic Regression Illustration
Logistic Regression Code Snippets

● Kode Program dan Penjelasan Lebih Detail :

https://towardsdatascience.com/tuning-the-hyperparameters-of-your-machine-learning-model-using-
gridsearchcv-7fc2bb76ff27

logistic regression common hyperparameters: penalty, max_iter, C, solver

2. Decision Tree
● Decision tree builds tree
branches in a hierarchy
approach and each
branch can be considered
as an if-else statement.
The branches develop by
partitioning the dataset
into subsets based on
most important features.
Final classification
happens at the leaves of
the decision tree.
Decision Tree Illustration
Decision Tree Code Snippets

● Penjelasan Lebih Lanjut : https://towardsdatascience.com/how-to-tune-a-decision-tree-

f03721801680

● decision tree common hyperparameters: criterion, max_depth, min_samples_split,

min_samples_leaf; max_features
3. Random Forest
● Random forest is a collection of
decision trees. It is a common
type of ensemble methods
which aggregate results from
multiple predictors. Random
forest additionally utilizes
bagging technique that allows
each tree trained on a random
sampling of original dataset and
takes the majority vote from
trees.
● Compared to decision tree, it
has better generalization but
less interpretable, because of
more layers added to the model.
Random Forest Illustration
Random Forest Code Snippets

● decision tree common hyperparameters: criterion, max_depth, min_samples_split,

min_samples_leaf; max_features
● https://towardsdatascience.com/how-to-tune-a-decision-tree-f03721801680
4. Support Vector
Machine (SVM)
● Support vector machine finds
the best way to classify the data
based on the position in relation
to a border between positive
class and negative class. This
border is known as the
hyperplane which maximize the
distance between data points
from different classes.
● Similar to decision tree and
random forest, support vector
machine can be used in both
classification and regression,
SVC (support vector classifier)
is for classification problem.
SVM Illustration
SVM Code Snippets

● support vector machine common hyperparameters: c, kernel, gamma

● https://www.vebuso.com/2020/03/svm-hyperparameter-tuning-using-gridsearchcv/
5. K-Nearest
Neighbour (KNN)
● You can think of k nearest
neighbour algorithm as
representing each data point in
a n dimensional space — which
is defined by n features. And it
calculates the distance between
one point to another, then
assign the label of unobserved
data based on the labels of
nearest observed data points.

● KNN can also be used for

building recommendation
system,
KNN Illustrations

KNN has three basic steps.

1. Calculate the distance.

2. Find the k nearest

neighbours.

3. Vote for classes

KNN Code Snippets

● KNN common hyperparameters: n_neighbors, weights, leaf_size, p

● More detailed : https://towardsdatascience.com/knn-visualization-in-just-
13-lines-of-code-32820d72c6b6
6. Naïve Bayes

● Naive Bayes is based on Bayes’

Theorem — an approach to
calculate conditional probability
based on prior knowledge, and
the naive assumption that each
feature is independent to each
other.
● The biggest advantage of Naive
Bayes is that, while most
machine learning algorithms rely
on large amount of training data,
it performs relatively well even
when the training data size is
small. Gaussian Naive Bayes is
a type of Naive Bayes classifier
that follows the normal
distribution.
Naïve Bayes illustration

https://ranasinghiitkgp.medium.com/mathematic-behind-naive-bayes-and-its-application-9ec8cc4f0a91
Naïve Bayes Code Snippets

● gaussian naive bayes common hyperparameters: priors, var_smoothing

● https://www.analyticsvidhya.com/blog/2021/01/gaussian-naive-bayes-with-hyperpameter-tuning/
Case Study
(an Example)
1. Loading Dataset and Data Overview

● I chose the popular dataset Heart Disease UCI on Kaggle for predicting the presence of heart disease
based on several health related factors.
● https://www.kaggle.com/ronitf/heart-disease-uci
1. Loading Dataset
and Data Overview
● Use df.info()to have a
summarized view of dataset,
including data type, missing
data and number of records.
2. Exploratory Data
Analysis (EDA)
● Histogram, grouped
bar chart and box
plot are suitable EDA
techniques for
classification
machine learning
algorithms.
● Univariate Analysis
Categorical Features vs. Target — Grouped Bar Chart
Numerical Features vs. Target — Box Plot
3. Split Dataset into Training and Testing Set
4. Machine Learning Model Pipeline
5. Model Evaluation

Below is an abstraction explanation of commonly used evaluation methods for

classification models — accuracy, ROC & AUC and confusion matrix.
Accuracy Results
Confusion Matrix
Accuracy and Confusion Matrix
Some useful References

● https://destingong.medium.com/list/practical-guides-to-machine-learning-
a877c2a39884
● https://www.kaggle.com/
● https://towardsdatascience.com/top-machine-learning-algorithms-for-
classification-2197870ff501
● https://repository.unimal.ac.id/6707/1/Machine%20Learning.pdf (Ebook)
● https://wiragotama.github.io/resources/ebook/intro-to-ml-secured.pdf
(Ebook)
● https://scikit-learn.org/stable/ (Sklearn Documentation)
Mini Project (Tugas Kelompok)
Instruksi Mini Project (Tugas Kelompok)

● Bagi jumlah anggota dalam satu golongan menjadi 6 (enam) Kelompok !

● Bagi Topik bahasan berikut di setiap kelompok
1. Logistic Regression
2. Decision Tree
3. Random Forest
4. Support Vector Machine (SVM)
5. K Nearest Neighbour (KNN)
6. Naïve Bayes
Instruksi Mini Project (Tugas Kelompok) -2

1. Carilah Sebuah Dataset di Kaggle / Laman web lain yang cocok untuk
dipecahkan dengan Topik yang dipilih
2. Buat Pemodelan Classification sesuai dengan topik yang dipilih (Bahasa
pemrograman dan IDE bebas, yang direkomendasikan : Python dan
SKLearn).
3. Tulis Kembali hasil pemecahan studi kasus, pemodelan, dan analisis ke
dalam laman website Medium / LinkedIn, kumpulkan assignment di
elearning.
Contoh Sistematika Penulisan di Medium

1. Introduction : kenapa mengambil studi kasus tersebut ?

2. Dataset Overview : Bagaimana Sample Dataset yang telah didapat ?
3. Explanatory Data Analysis (EDA)
4. Splitting Dataset for Modelling Classification
5. Machine Learning Implementation
6. Model Evaluation
7. Conclusion : apakah berhasil memecahkan masalah ?
8. Referensi : Tulis seluruh artikel, website, dataset, dan seluruh sumber yang
anda gunakan !

Machine Learning Business Report
75% (55)
Machine Learning Business Report
60 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
167 pages
UNIT II 2.1 ML Decision Tree Learning
No ratings yet
UNIT II 2.1 ML Decision Tree Learning
55 pages
Unit 3,4,5 ML (CS - AI)
No ratings yet
Unit 3,4,5 ML (CS - AI)
37 pages
6th - SEM Machine Learning Notes PDF
100% (1)
6th - SEM Machine Learning Notes PDF
36 pages
Chapter 2,3,4
No ratings yet
Chapter 2,3,4
8 pages
ML Ch-2 Supervised Learning
No ratings yet
ML Ch-2 Supervised Learning
23 pages
Machine Learning Classification Bootcamp Cheatsheet
No ratings yet
Machine Learning Classification Bootcamp Cheatsheet
7 pages
ML Unit-Ii Notes
No ratings yet
ML Unit-Ii Notes
17 pages
CZ4032 Data Analytics & Mining Notes
No ratings yet
CZ4032 Data Analytics & Mining Notes
16 pages
U02Lecture08 Statistical Machine Learning
No ratings yet
U02Lecture08 Statistical Machine Learning
41 pages
UCS551 Chapter 6 - Classification
No ratings yet
UCS551 Chapter 6 - Classification
20 pages
Supervised Learning
No ratings yet
Supervised Learning
30 pages
Algorithms For ML
No ratings yet
Algorithms For ML
3 pages
Decision Tree
No ratings yet
Decision Tree
16 pages
Interview AI Algo
No ratings yet
Interview AI Algo
3 pages
ML Classification Techniques
No ratings yet
ML Classification Techniques
6 pages
Machine Learning
100% (6)
Machine Learning
115 pages
Machine Learning - Iii
No ratings yet
Machine Learning - Iii
53 pages
MLP U2
No ratings yet
MLP U2
7 pages
Machine Learning Algorithms
No ratings yet
Machine Learning Algorithms
14 pages
DL
No ratings yet
DL
10 pages
Supervised Learning
No ratings yet
Supervised Learning
71 pages
11 W11NSE6220 - Fall 2023 - Zeng
No ratings yet
11 W11NSE6220 - Fall 2023 - Zeng
43 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
21 pages
Unit-5 MECH 3-2
No ratings yet
Unit-5 MECH 3-2
14 pages
Chatgpt Unit - 3
No ratings yet
Chatgpt Unit - 3
4 pages
ML Unit4
No ratings yet
ML Unit4
10 pages
Unit 3
No ratings yet
Unit 3
61 pages
Assignment 2
No ratings yet
Assignment 2
111 pages
Mod 7 Smote ML
No ratings yet
Mod 7 Smote ML
40 pages
Pa Unit-Iii
No ratings yet
Pa Unit-Iii
75 pages
Classification
No ratings yet
Classification
34 pages
Unit 4 Introduction To Algorithm
No ratings yet
Unit 4 Introduction To Algorithm
10 pages
Machine Learning Algorithms Laiki
No ratings yet
Machine Learning Algorithms Laiki
123 pages
Spam Not Spam
No ratings yet
Spam Not Spam
7 pages
Bike Buyer Prediction Using Classification Algorithm
No ratings yet
Bike Buyer Prediction Using Classification Algorithm
19 pages
Module 5
No ratings yet
Module 5
5 pages
Chapter Four
No ratings yet
Chapter Four
75 pages
Non Parametric Methods 8
No ratings yet
Non Parametric Methods 8
23 pages
Session 5
No ratings yet
Session 5
36 pages
Machine Learning Notes ?
No ratings yet
Machine Learning Notes ?
14 pages
Module Iii
No ratings yet
Module Iii
15 pages
Anuraag Rath MBA Dissertation
100% (1)
Anuraag Rath MBA Dissertation
74 pages
ML Unit 2
No ratings yet
ML Unit 2
37 pages
Unit 3 Ds
No ratings yet
Unit 3 Ds
10 pages
191IT7310Machine LearningQB
No ratings yet
191IT7310Machine LearningQB
27 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
ML UNIT 2 Sir
No ratings yet
ML UNIT 2 Sir
46 pages
ML ModuleUntitled 2
No ratings yet
ML ModuleUntitled 2
8 pages
Supervised ML Algorithms
No ratings yet
Supervised ML Algorithms
9 pages
Chapter5 - Machine Learning
No ratings yet
Chapter5 - Machine Learning
37 pages
ML Assignment 2 PDF
No ratings yet
ML Assignment 2 PDF
9 pages
ML Models
No ratings yet
ML Models
21 pages
Module 1 & 2
No ratings yet
Module 1 & 2
21 pages
Accelerated Data Science Introduction To Machine Learning Algorithms
No ratings yet
Accelerated Data Science Introduction To Machine Learning Algorithms
37 pages
Interview Preparing - ML Draft
No ratings yet
Interview Preparing - ML Draft
12 pages
Classification
No ratings yet
Classification
4 pages
Report of Comparing 5 Classification Algorithms of Machine Learning PDF
No ratings yet
Report of Comparing 5 Classification Algorithms of Machine Learning PDF
4 pages
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
No ratings yet
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
12 pages
ML Notes
No ratings yet
ML Notes
10 pages
ML - Interview Prep
No ratings yet
ML - Interview Prep
9 pages
3.popular Machine Learning Algorithm
No ratings yet
3.popular Machine Learning Algorithm
11 pages
Scodeen Global Python DS - ML - Django Syllabus Version 13
No ratings yet
Scodeen Global Python DS - ML - Django Syllabus Version 13
22 pages
Human3 6m
No ratings yet
Human3 6m
37 pages
Assignment 0.2
No ratings yet
Assignment 0.2
8 pages
Vor-Tree: R-Trees With Voronoi Diagrams For Efficient Processing of Spatial Nearest Neighbor Queries
No ratings yet
Vor-Tree: R-Trees With Voronoi Diagrams For Efficient Processing of Spatial Nearest Neighbor Queries
12 pages
A Novel PMU Fog Based Early Anomaly Detection For An Efficient Wide Area PMU Network
No ratings yet
A Novel PMU Fog Based Early Anomaly Detection For An Efficient Wide Area PMU Network
10 pages
Machine Learning Algorithms For Breast Cancer Prediction
No ratings yet
Machine Learning Algorithms For Breast Cancer Prediction
8 pages
Does BERT Make Any Sense? Interpretable Word Sense Disambiguation With Contextualized Embeddings
No ratings yet
Does BERT Make Any Sense? Interpretable Word Sense Disambiguation With Contextualized Embeddings
10 pages
Credit Card Fraud Detection Using A Deep Learning Multistage Model
No ratings yet
Credit Card Fraud Detection Using A Deep Learning Multistage Model
26 pages
36-708 Statistical Machine Learning Homework #3 Solutions: DUE: March 29, 2019
No ratings yet
36-708 Statistical Machine Learning Homework #3 Solutions: DUE: March 29, 2019
22 pages
Voice Recognition System Using Machine L
No ratings yet
Voice Recognition System Using Machine L
7 pages
A Strategy For Automatically Extracting References From PDF Documents
No ratings yet
A Strategy For Automatically Extracting References From PDF Documents
6 pages
PDS Imp
No ratings yet
PDS Imp
43 pages
Fish V8N2P8 2015 - Ajocict 1
No ratings yet
Fish V8N2P8 2015 - Ajocict 1
8 pages
An Ensemble Method For Phishing Websites Detection Based On XGBoost
No ratings yet
An Ensemble Method For Phishing Websites Detection Based On XGBoost
6 pages
Intelligent Control of Robotic Arm Using Brain Computer Interface and Artificial Intelligence
No ratings yet
Intelligent Control of Robotic Arm Using Brain Computer Interface and Artificial Intelligence
14 pages
ML Project Report-1
No ratings yet
ML Project Report-1
34 pages
ML Unit3
No ratings yet
ML Unit3
21 pages
Final Unit 4
No ratings yet
Final Unit 4
107 pages
Fire Alarm System Through Smoke Detectio
No ratings yet
Fire Alarm System Through Smoke Detectio
4 pages
ACSML0502
No ratings yet
ACSML0502
4 pages
Uddin Et Al (2023)
No ratings yet
Uddin Et Al (2023)
21 pages
Paper 2
No ratings yet
Paper 2
19 pages
Diabetes Prediction Using Machine Learning
No ratings yet
Diabetes Prediction Using Machine Learning
12 pages
AI For Earthquake Prediction
No ratings yet
AI For Earthquake Prediction
14 pages
Arora 2019
No ratings yet
Arora 2019
29 pages
Sheet1 1
No ratings yet
Sheet1 1
2 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet

Supersived Machine Learning

Uploaded by

Supersived Machine Learning

Uploaded by

Sistem Cerdas (TIF 150702)

Minggu Ke Materi Metode Waktu

● Konsep Machine Learning

● The sigmoid function generates

● Kode Program dan Penjelasan Lebih Detail :

logistic regression common hyperparameters: penalty, max_iter, C, solver

● Penjelasan Lebih Lanjut : https://towardsdatascience.com/how-to-tune-a-decision-tree-

● decision tree common hyperparameters: criterion, max_depth, min_samples_split,

● decision tree common hyperparameters: criterion, max_depth, min_samples_split,

● support vector machine common hyperparameters: c, kernel, gamma

● KNN can also be used for

KNN has three basic steps.

2. Find the k nearest

3. Vote for classes

● KNN common hyperparameters: n_neighbors, weights, leaf_size, p

● Naive Bayes is based on Bayes’

● gaussian naive bayes common hyperparameters: priors, var_smoothing

Below is an abstraction explanation of commonly used evaluation methods for

● Bagi jumlah anggota dalam satu golongan menjadi 6 (enam) Kelompok !

1. Introduction : kenapa mengambil studi kasus tersebut ?

You might also like