Assignment 2

The document discusses key machine learning concepts including logistic regression, decision trees, random forests, k-nearest neighbors, Naive Bayes, support vector machines, the bias-variance tradeoff, neural networks, cross-validation, overfitting, regularization, hyperparameters, precision, recall, and ROC curves.

Uploaded by

shanmukhaadityavenkat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views2 pages

Assignment 2

Uploaded by

shanmukhaadityavenkat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

1.

The logis c func on, also known as the sigmoid func on, is used in logis c regression to compute
probabili es. It maps any input value to a value between 0 and 1, which can be interpreted as the
probability of the input belonging to a certain class.

2. The criterion commonly used to split nodes in decision tree construc on is the informa on gain. It
is calculated by subtrac ng the weighted average of the entropy of the child nodes from the entropy
of the parent node.

3. Entropy is a measure of impurity in a set of examples. Informa on gain is the reduc on in entropy
achieved by par oning the examples based on a certain a ribute. In decision tree construc on, the
a ribute with the highest informa on gain is chosen as the spli ng criterion.

4. The random forest algorithm u lizes bagging and feature randomiza on to improve classiﬁca on
accuracy. Bagging involves training mul ple decision trees on diﬀerent subsets of the training data,
while feature randomiza on involves randomly selec ng a subset of features for each tree.

5. The distance metric typically used in k-nearest neighbors (KNN) classiﬁca on is the Euclidean
distance. It measures the straight-line distance between two points in a mul dimensional space. The
choice of distance metric can impact the algorithm's performance.

6. The Naïve-Bayes assump on of feature independence assumes that the features used for
classiﬁca on are condi onally independent given the class label. This assump on simpliﬁes the
computa on of the posterior probability of a class given a set of features.

7. The kernel func on in SVMs is used to transform the input data into a higher-dimensional space
where it is easier to separate the classes. Some commonly used kernel func ons include the linear
kernel, polynomial kernel, and radial basis func on (RBF) kernel.

8. The bias-variance tradeoff refers to the tradeoff between model complexity and overfi ng. A
model with high bias (e.g., a linear model) may underfit the data, while a model with high variance
(e.g., a complex model) may overfit the data. The goal is to find a model with an appropriate balance
between bias and variance.

9. TensorFlow facilitates the crea on and training of neural networks by providing a high-level API for
building and training models. It also includes a variety of pre-built neural network layers and
ac va on func ons.

10. Cross-valida on is a technique used to evaluate model performance by par oning the data into
training and valida on sets. It involves training the model on mul ple subsets of the data and
evalua ng its performance on the remaining subset. Cross-valida on is important for detec ng
overﬁ ng and selec ng the best model.

11. Techniques that can be employed to handle overﬁ ng in machine learning models include
regulariza on, early stopping, and dropout. Regulariza on involves adding a penalty term to the loss
func on to discourage overﬁ ng, while early stopping involves stopping the training process when
the valida on error stops improving. Dropout involves randomly dropping out some neurons during
training to prevent over-reliance on certain features.

12. The purpose of regulariza on in machine learning is to prevent overﬁ ng by adding a penalty
term to the loss func on. The penalty term encourages the model to have smaller weights, which
can help prevent over-reliance on certain features.
13. Hyper-parameters are parameters that are set before training the model and cannot be learned
from the data. They include parameters such as the learning rate, regulariza on strength, and
number of hidden layers in a neural network. Hyper-parameters are tuned for op mal performance
using techniques such as grid search or random search.

14. Precision and recall are metrics used to evaluate the performance of a classiﬁca on model.
Precision measures the propor on of true posi ves among all posi ve predic ons, while recall
measures the propor on of true posi ves among all actual posi ves. Accuracy measures the
propor on of correct predic ons among all predic ons.

15. The ROC curve is a graphical representa on of the performance of a binary classifier. It plots the
true posi ve rate (TPR) against the false posi ve rate (FPR) for different threshold values. The area
under the ROC curve (AUC) is a commonly used metric for evalua ng the performance of a binary
classifier.

Int3209 - Data Mining: Week 5: Classification Model Improvements
No ratings yet
Int3209 - Data Mining: Week 5: Classification Model Improvements
56 pages
Unit8 (Evaluation Method)
No ratings yet
Unit8 (Evaluation Method)
43 pages
AI & DS-II MU QPaper Solution (June 2023)
No ratings yet
AI & DS-II MU QPaper Solution (June 2023)
21 pages
Aml - Module - 4
No ratings yet
Aml - Module - 4
12 pages
Irtm Edit
No ratings yet
Irtm Edit
3 pages
ML Mindbenders: Interview Questions That'll Make You Sweat (Smartly) !
No ratings yet
ML Mindbenders: Interview Questions That'll Make You Sweat (Smartly) !
21 pages
Supervised Learning
No ratings yet
Supervised Learning
30 pages
Brain, Bytes & Bias: ML Interview Questions You Can't Miss!
No ratings yet
Brain, Bytes & Bias: ML Interview Questions You Can't Miss!
21 pages
ML 2 PPT Unit 2
No ratings yet
ML 2 PPT Unit 2
214 pages
MLT Notes
No ratings yet
MLT Notes
28 pages
MLquestions
No ratings yet
MLquestions
26 pages
Unit 3 PDF
No ratings yet
Unit 3 PDF
7 pages
T1 ML QB Soln
No ratings yet
T1 ML QB Soln
23 pages
Question Bank
No ratings yet
Question Bank
67 pages
WK 07
No ratings yet
WK 07
8 pages
Supervised Learning - SVM - DT
No ratings yet
Supervised Learning - SVM - DT
43 pages
Unit 3 (MLT)
No ratings yet
Unit 3 (MLT)
42 pages
1.what Is Data Cleaning in Rapidminer?
No ratings yet
1.what Is Data Cleaning in Rapidminer?
9 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
49 pages
Machine Learning
No ratings yet
Machine Learning
11 pages
ML Endsem
No ratings yet
ML Endsem
14 pages
AIML Solved Paper Nov-Dec 2024
No ratings yet
AIML Solved Paper Nov-Dec 2024
2 pages
Machine Learning Notes ?
No ratings yet
Machine Learning Notes ?
14 pages
ML Unit3,4,5 Ans
No ratings yet
ML Unit3,4,5 Ans
11 pages
CH-5 ML
No ratings yet
CH-5 ML
36 pages
Aids2 QB Ut2
No ratings yet
Aids2 QB Ut2
24 pages
MAD111 - Chap 1
No ratings yet
MAD111 - Chap 1
237 pages
Breast Cancer Classification
100% (2)
Breast Cancer Classification
16 pages
Machine Learning # 2
No ratings yet
Machine Learning # 2
17 pages
BHUSAN
No ratings yet
BHUSAN
58 pages
Unit6 - 7 Issues
No ratings yet
Unit6 - 7 Issues
53 pages
DWDM Unit IV Note
No ratings yet
DWDM Unit IV Note
21 pages
Long Term Rakshi
No ratings yet
Long Term Rakshi
48 pages
ML Important
No ratings yet
ML Important
11 pages
04 - Model Selection
No ratings yet
04 - Model Selection
62 pages
Solar Bull Store Report
No ratings yet
Solar Bull Store Report
12 pages
Chapter3 Classification Summary Final
No ratings yet
Chapter3 Classification Summary Final
11 pages
SemVII MachineLearning
No ratings yet
SemVII MachineLearning
22 pages
ML Unit-3 - RTU
No ratings yet
ML Unit-3 - RTU
20 pages
جلسه 13
No ratings yet
جلسه 13
76 pages
Deep-Learning Notes 01
No ratings yet
Deep-Learning Notes 01
8 pages
Unit3 7 Issues
No ratings yet
Unit3 7 Issues
24 pages
Performance Evaluation
No ratings yet
Performance Evaluation
24 pages
ML Assignment
No ratings yet
ML Assignment
4 pages
Template 5KW (3no.s) - Bom - Domestic DCR
No ratings yet
Template 5KW (3no.s) - Bom - Domestic DCR
6 pages
Software Verification & Validation
No ratings yet
Software Verification & Validation
18 pages
Brosur Grolen HP19R
No ratings yet
Brosur Grolen HP19R
2 pages
International Project Management Guide 2.0 (IAPM)
100% (1)
International Project Management Guide 2.0 (IAPM)
44 pages
Material Request01
No ratings yet
Material Request01
4 pages
ML Short Question and Answers
No ratings yet
ML Short Question and Answers
11 pages
Machine Learning Most Important Question For Mid Term Ipu University
No ratings yet
Machine Learning Most Important Question For Mid Term Ipu University
36 pages
DL Unit2
No ratings yet
DL Unit2
22 pages
Invoice Report Stores
No ratings yet
Invoice Report Stores
3 pages
UNIT-1-2.Binary Classification and Related Tasks
No ratings yet
UNIT-1-2.Binary Classification and Related Tasks
22 pages
Semester
No ratings yet
Semester
8 pages
Đê DX Duyên H I Final
No ratings yet
Đê DX Duyên H I Final
14 pages
Sartorius PR5510 X4
No ratings yet
Sartorius PR5510 X4
4 pages
RB's ML2 Notes
No ratings yet
RB's ML2 Notes
5 pages
Model Evaluation in ML
No ratings yet
Model Evaluation in ML
12 pages
Sumana Bandyopadhyay - Kolkata The Colonial City in Transition - Reflections in Geographies of Urban India-Routledge (2022)
100% (1)
Sumana Bandyopadhyay - Kolkata The Colonial City in Transition - Reflections in Geographies of Urban India-Routledge (2022)
395 pages
SC MCQ
0% (1)
SC MCQ
10 pages
Team 5
No ratings yet
Team 5
12 pages
DL Unit1
No ratings yet
DL Unit1
10 pages
Strength of Materials
No ratings yet
Strength of Materials
115 pages
SBTET - Home
No ratings yet
SBTET - Home
1 page
Y Py KJ1 BG 9 X U8 WN RURuh Dza TKT 3 JUBtng Ao I3 V4 YPTVg 9 CCB MMQF YTFq VL A2 Q
No ratings yet
Y Py KJ1 BG 9 X U8 WN RURuh Dza TKT 3 JUBtng Ao I3 V4 YPTVg 9 CCB MMQF YTFq VL A2 Q
1 page
WS - 3 Class X Phy CH - 10 (Light - Refraction) - 1
No ratings yet
WS - 3 Class X Phy CH - 10 (Light - Refraction) - 1
3 pages
DocScanner 6 Dec 2024 17-47
No ratings yet
DocScanner 6 Dec 2024 17-47
1 page
Coincent - Data Science With Python Assignment
100% (2)
Coincent - Data Science With Python Assignment
23 pages
Unit - 1 - Ohs352-Project Report Writing
No ratings yet
Unit - 1 - Ohs352-Project Report Writing
23 pages
60. Đề Thi Thử TN THPT 2021 - Môn Tiếng Anh - Sở GD & ĐT Hưng Yên - File Word Có Lời Giải
No ratings yet
60. Đề Thi Thử TN THPT 2021 - Môn Tiếng Anh - Sở GD & ĐT Hưng Yên - File Word Có Lời Giải
6 pages
Assingment On Database
No ratings yet
Assingment On Database
16 pages
Practical 7 Classification Revision Questions
No ratings yet
Practical 7 Classification Revision Questions
8 pages
ML MAKAUT Unit-3
No ratings yet
ML MAKAUT Unit-3
6 pages
Design and Optimization of Spur Gear: Second Review
No ratings yet
Design and Optimization of Spur Gear: Second Review
44 pages
5.classification and Prediction
No ratings yet
5.classification and Prediction
9 pages
ML Probable Questions 2026 - أسئلة محتملة لامتحان تعلم الآلة 2026 ??
No ratings yet
ML Probable Questions 2026 - أسئلة محتملة لامتحان تعلم الآلة 2026 ??
2 pages
5 Versionfinal
No ratings yet
5 Versionfinal
8 pages
ML Unit 2
No ratings yet
ML Unit 2
8 pages
1.write The Formula For Sigmoid, Hyperbolic Tangen...
No ratings yet
1.write The Formula For Sigmoid, Hyperbolic Tangen...
3 pages
Paper 4 PDF
No ratings yet
Paper 4 PDF
5 pages
Ways To Integrate Social Emotional Learning
No ratings yet
Ways To Integrate Social Emotional Learning
21 pages
Daftar Referensi Jurnal Enzim1
No ratings yet
Daftar Referensi Jurnal Enzim1
7 pages
APPGECET2024 Exam Schedule
No ratings yet
APPGECET2024 Exam Schedule
1 page
Data Minning Unit 2-1
No ratings yet
Data Minning Unit 2-1
10 pages
Guidelines ITR 2020-21-For Mentor and Students
No ratings yet
Guidelines ITR 2020-21-For Mentor and Students
2 pages
Recruitment Selection Training
No ratings yet
Recruitment Selection Training
29 pages
41 Advanced Supply Results May-2024
No ratings yet
41 Advanced Supply Results May-2024
4 pages
42 R16 R19 Supply Results May-2024
No ratings yet
42 R16 R19 Supply Results May-2024
2 pages
Lesson Plan Subject/Grade Unit/Skill/Topic of Lesson Standards Addressed Va:Re9.1. 2 Va:Cr2.1.2 Vacr3.1.2
100% (1)
Lesson Plan Subject/Grade Unit/Skill/Topic of Lesson Standards Addressed Va:Re9.1. 2 Va:Cr2.1.2 Vacr3.1.2
4 pages
MSA Case Studies
No ratings yet
MSA Case Studies
10 pages
T 14.419.003 SH1 AA - CEF - Signed PDF
No ratings yet
T 14.419.003 SH1 AA - CEF - Signed PDF
33 pages
A New Genus of Terraranas Anura Brachycephaloidea From Northern South America With A Systematic Review of Tachiramantis
No ratings yet
A New Genus of Terraranas Anura Brachycephaloidea From Northern South America With A Systematic Review of Tachiramantis
26 pages
Inolab Cond 730
No ratings yet
Inolab Cond 730
80 pages
Skill Development Under RKVY-2016-17
No ratings yet
Skill Development Under RKVY-2016-17
10 pages
The Three Lines of Defence: Audit Committee Institute
No ratings yet
The Three Lines of Defence: Audit Committee Institute
4 pages
Business Case Studies
No ratings yet
Business Case Studies
10 pages
Molas Lubes-Products List
No ratings yet
Molas Lubes-Products List
2 pages
XXXXX: Important Instructions To Examiners
No ratings yet
XXXXX: Important Instructions To Examiners
16 pages
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
From Everand
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
Fouad Sabry
No ratings yet
Ril Painting Procedure
No ratings yet
Ril Painting Procedure
3 pages

Assignment 2

Uploaded by

Assignment 2

Uploaded by

1.

You might also like