FAQ's - Supervised Learning

Uploaded by

shreyasgawade12

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views4 pages

FAQ's - Supervised Learning

Uploaded by

shreyasgawade12

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

AIML Online

Frequently Asked Questions in Problem Statement

Course: Supervised Learning
PART - A [30 Marks]
* Direct or Self-explanatory questions are not covered in this FAQ.

1. Data Understanding:
1 C. Compare Column names of all the 3 DataFrames and clearly write observations. [1 Mark]
→ Compare the column names of all the three dataframes. As we are going to merge datasets by rows,
checking the column names, order and type is mandatory. Use a simple compare operator to check
whether all 3 dataframes have the same column names. And write your observations from the result.

1 D. Print DataTypes of all the 3 DataFrames. [1 Mark]

→ Print the datatypes of all the 3 dataframes and write your observations.

1 E. Observe and share variation in ‘Class’ feature of all the 3 DaraFrames. [1 Mark]
→ Check the ‘Class’ variable’s distribution and categories.

2. Data Preparation and Exploration:

2 A. Unify all the variations in ‘Class’ feature for all the 3 DataFrames. [1 Marks]
→ Unify the variations reported in the previous step 1.E.
Example - If the ‘Class’ variable of ‘normal’ dataframe has ‘Normal’, ‘normal’ or ‘Nrml’ replace them
with ‘normal’. Similarly, check and unify the ‘class’ for type_s and type_h dataframes.

2 B. Combine all the 3 DataFrames to form a single DataFrame [1 Marks]

→ Combine the 3 datasets into 1. Look at the checkpoint that the final dataframe should have 310 rows
and 7 columns.

1
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
3. Data Analysis:
3 C. Visualize a pairplot with 3 classes distinguished by colors and share insights. [2 Marks]
→ Create a pairplot for the given variables and the color of the data points in the pairplot should be
distinguished by ‘Class’ categories.

4. Model Building:
4 D. Print all the possible performance metrics for both train and test data. [2 Marks]
→ Print the performance metric of classification models that include accuracy, precision, recall, F1 score
etc.

5. Performance Improvement:
5 A. Experiment with various parameters to improve performance of the base model. [2 Marks]
→ So far you would have run the default model, now you can tune the model by changing the
parameters in KNeighborsClassifier() or svm function. Firstly, self-explore what are the parameters
available in the models and check how you can fine-tune it by changing the options. You have to just
research a bit and do it. (Detailed parameter tuning will be covered in feature engineering course)
Reference link for Hyperparameter tuning for a KNN problem -
https://medium.datadriveninvestor.com/k-nearest-neighbors-in-python-hyperparameters-tuning-71673
4bc557f
You can explore and tune the hyperparameters for other models too. You can learn about Gridsearch,
Random search cross validation techniques and use them.

PART - B [30 Marks]

1. Data Understanding and Preparation:
1 D. Change Datatype of below features to ‘Object’ [1 Marks]
‘CreditCard’, ‘InternetBanking’, ‘FixedDepositAccount’, ‘Security’, ‘Level’, ‘HiddenScore’.
[Reason behind performing this operation: - Values in these features are binary i.e. 1/0. But DataType is
‘int’/’float’ which is not expected.]

2
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
→ The variables are of object type with Binary or multi-class outputs like 0,1 or 1,2,3 etc. Hence,
convert them to ‘Object’ type

2. Data Exploration and Analysis:

2 A. Visualize distribution of Target variable ‘LoanOnCard’ and clearly share insights. [2 Marks]
→ Plot a suitable plot to display distribution of Target variable.

2 C. Check for unexpected values in each categorical variable and impute them with the best suitable
value. [2 Marks]
→ Unexpected values mean if all values in a feature are 0/1 then ‘?’, ‘a’, 1.5 are unexpected values
which needs treatment

3. Data Preparation and model building:

3 D. Print evaluation metrics for the model and clearly share insights. [1 Marks]
→ Print the performance metric of classification models that include accuracy, precision, recall, F1 score
etc.

3 E. Balance the data using the right balancing technique. [2 Marks]

→ Target balancing can be done by upsampling the minority class or downsampling the majority class
or by using SMOTE as per target distribution. You can research a bit and do this task.

4. Performance Improvement:
4 A. Train a base model each for SVM, KNN. [4 Marks]
→You have to build a base model without tuning any parameters on the balanced data.

4 B. Tune parameters for each of the models wherever required and finalize a model. [3 Marks]
(Optional: Experiment with various Hyperparameters - Research required)
→ Tune the parameters as performed in Part A, Question 5 A.

3
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
You can tune the model by changing the parameters in KNeighborsClassifier() or svm function. Firstly,
self-explore what are the parameters available in the models and check how you can fine-tune it by
changing the options. You have to just research a bit and do it. (Detailed parameter tuning will be
covered in feature engineering course)
Reference link for Hyperparameter tuning for a KNN problem -
https://medium.datadriveninvestor.com/k-nearest-neighbors-in-python-hyperparameters-tuning-71673
4bc557f
You can explore and tune the hyperparameters for other models too.

4 C. Print evaluation metrics for final model. [1 Marks]

→ Print the performance metric of the final model that includes accuracy, precision, recall, F1 score etc.

4 D. Share improvement achieved from base model to final model. [2 Marks]

→ Show the performance improvement of that model (comparing its base model & final model
performance report).

Project Report - Lendingclub - FINAL
No ratings yet
Project Report - Lendingclub - FINAL
24 pages
Lab 04
No ratings yet
Lab 04
2 pages
SL - Problem Statement
No ratings yet
SL - Problem Statement
3 pages
Shubham Pract 6 - Merged
No ratings yet
Shubham Pract 6 - Merged
12 pages
Lab 08 - Data Preprocessing
No ratings yet
Lab 08 - Data Preprocessing
9 pages
MLLab Manual
No ratings yet
MLLab Manual
24 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
Train
No ratings yet
Train
17 pages
MBAN Assignment
No ratings yet
MBAN Assignment
2 pages
Machine Learning 20CSE09
No ratings yet
Machine Learning 20CSE09
3 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
26 pages
Python Code For KNN Classifier 1. Initial Message
No ratings yet
Python Code For KNN Classifier 1. Initial Message
7 pages
100 Days of Machine Learning
No ratings yet
100 Days of Machine Learning
14 pages
Machine Learning Solutions
No ratings yet
Machine Learning Solutions
6 pages
Python For Data Science Cheat Sheet: Scikit-Learn Create Your Model Evaluate Your Model's Performance
100% (1)
Python For Data Science Cheat Sheet: Scikit-Learn Create Your Model Evaluate Your Model's Performance
1 page
83 Sklearn Pipeline
No ratings yet
83 Sklearn Pipeline
8 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
9 pages
Mlalllabprgs
No ratings yet
Mlalllabprgs
17 pages
ML Full For Print New 1
No ratings yet
ML Full For Print New 1
38 pages
01 134192 066 9559671601 28052022 103753pm
No ratings yet
01 134192 066 9559671601 28052022 103753pm
1 page
FIT1043 A2 Specification - S2 2024 - Gks6arg
No ratings yet
FIT1043 A2 Specification - S2 2024 - Gks6arg
5 pages
FAQ's - FMT Project
No ratings yet
FAQ's - FMT Project
3 pages
Project - Machine Learning-Business Report: By: K Ravi Kumar PGP-Data Science and Business Analytics (PGPDSBA.O.MAR23.A)
No ratings yet
Project - Machine Learning-Business Report: By: K Ravi Kumar PGP-Data Science and Business Analytics (PGPDSBA.O.MAR23.A)
38 pages
Lesson 4 - Supervised Learning
No ratings yet
Lesson 4 - Supervised Learning
36 pages
Building Good Training Sets UNIT 1 PART2
No ratings yet
Building Good Training Sets UNIT 1 PART2
46 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
18 pages
ML Questions Answers
No ratings yet
ML Questions Answers
4 pages
ML Lab Manual
No ratings yet
ML Lab Manual
24 pages
Data Mining Lab Manual CSE VII Sem
No ratings yet
Data Mining Lab Manual CSE VII Sem
63 pages
Lecture Material 10
No ratings yet
Lecture Material 10
9 pages
Data Preprocessing Example Programs1
No ratings yet
Data Preprocessing Example Programs1
9 pages
HW 02
No ratings yet
HW 02
3 pages
ML Practical 205160694034
No ratings yet
ML Practical 205160694034
33 pages
Titanic Akshaya
No ratings yet
Titanic Akshaya
12 pages
Machine Learning Programs
No ratings yet
Machine Learning Programs
10 pages
SLC 70 Marks Set 1
No ratings yet
SLC 70 Marks Set 1
3 pages
30 Days ML Projects Challenge
No ratings yet
30 Days ML Projects Challenge
288 pages
Ai Chapter 3
No ratings yet
Ai Chapter 3
8 pages
CP4252 Machine Learning Lab Manual
No ratings yet
CP4252 Machine Learning Lab Manual
33 pages
Test 1
No ratings yet
Test 1
3 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
33 pages
Cheat Sheet: Python For Data Science
100% (1)
Cheat Sheet: Python For Data Science
1 page
Supervised Learning - Milestones
No ratings yet
Supervised Learning - Milestones
2 pages
Al3451 - Question Bank
100% (1)
Al3451 - Question Bank
12 pages
22K61A0654 2 Sasi Auto
No ratings yet
22K61A0654 2 Sasi Auto
24 pages
DSASSign 4
No ratings yet
DSASSign 4
11 pages
Activity 01: Python Set/s of Source Code Use in The Activity (Paste Below)
No ratings yet
Activity 01: Python Set/s of Source Code Use in The Activity (Paste Below)
2 pages
Artificial Intelligence Lab 7
No ratings yet
Artificial Intelligence Lab 7
10 pages
DWM - END SEM LAB Questions
No ratings yet
DWM - END SEM LAB Questions
9 pages
MachineLearning MidTerm UMT Spring 2021
100% (1)
MachineLearning MidTerm UMT Spring 2021
12 pages
Scikit-Learn Cheat Sheet
No ratings yet
Scikit-Learn Cheat Sheet
1 page
Scikit-Learn Cheat Sheet
No ratings yet
Scikit-Learn Cheat Sheet
1 page
SampleQuestion - AIOL 2024
No ratings yet
SampleQuestion - AIOL 2024
5 pages
Complete Data Science Questions
No ratings yet
Complete Data Science Questions
5 pages
EDA Explanations
No ratings yet
EDA Explanations
22 pages
Unit2 ML Programs
No ratings yet
Unit2 ML Programs
7 pages
Scikit Learn Cheat Sheet Python
No ratings yet
Scikit Learn Cheat Sheet Python
1 page
Administering Microsoft Azure SQL Solutions DP 300
From Everand
Administering Microsoft Azure SQL Solutions DP 300
Manish Soni
No ratings yet
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Microsoft Azure Database Administrator DP 300
From Everand
Microsoft Azure Database Administrator DP 300
Manish Soni
No ratings yet
Wine Quality Classification Using Weka
No ratings yet
Wine Quality Classification Using Weka
21 pages
Application of Computer in Pharmacy
No ratings yet
Application of Computer in Pharmacy
109 pages
Credit Card Default Prediction: Final Project Report
No ratings yet
Credit Card Default Prediction: Final Project Report
28 pages
60 Classification of Retinal OCTImages
No ratings yet
60 Classification of Retinal OCTImages
6 pages
Lecture1 Introduction
No ratings yet
Lecture1 Introduction
67 pages
Iit M Diploma Quiz2 Exam QPD2
No ratings yet
Iit M Diploma Quiz2 Exam QPD2
221 pages
Fake News Detection Project Documentation
No ratings yet
Fake News Detection Project Documentation
16 pages
ML CM
No ratings yet
ML CM
17 pages
Performance Evaluation of Machine Learning Algorithms in Post-Operative Life Expectancy in The Lung Cancer Patients
No ratings yet
Performance Evaluation of Machine Learning Algorithms in Post-Operative Life Expectancy in The Lung Cancer Patients
12 pages
Fingerprint Liveliness Detection Using Stacked Ensemble and Transfer Learning Technique
No ratings yet
Fingerprint Liveliness Detection Using Stacked Ensemble and Transfer Learning Technique
7 pages
Lecture 15
No ratings yet
Lecture 15
37 pages
Class 10 Holiday Homework (2025 26)
No ratings yet
Class 10 Holiday Homework (2025 26)
17 pages
Immunocto - A Massive Immune Cell Database Auto-Generated For Histopathology
No ratings yet
Immunocto - A Massive Immune Cell Database Auto-Generated For Histopathology
9 pages
AK Sample Paper 1 AI Class 10 For 2023
No ratings yet
AK Sample Paper 1 AI Class 10 For 2023
6 pages
Crime Data Analysis in Toronto - Group 4
No ratings yet
Crime Data Analysis in Toronto - Group 4
22 pages
Machine Learning Cheatsheet Compiled and Curated by Robins Yadav
No ratings yet
Machine Learning Cheatsheet Compiled and Curated by Robins Yadav
14 pages
Cassava Leaf Disease Detection: A Project Report
No ratings yet
Cassava Leaf Disease Detection: A Project Report
35 pages
Machine Learning: Huawei AI Academy Training Materials
No ratings yet
Machine Learning: Huawei AI Academy Training Materials
46 pages
Synergizing Unsupervised and Supervised Learning: A Hybrid Approach For Accurate Natural Language Task Modeling
No ratings yet
Synergizing Unsupervised and Supervised Learning: A Hybrid Approach For Accurate Natural Language Task Modeling
10 pages
r206668v AMutenda Model
No ratings yet
r206668v AMutenda Model
62 pages
Researchpaper 2
No ratings yet
Researchpaper 2
4 pages
Balancing Privacy and Accuracy Exploring The Impact of Data Anonymization On Deep Learning Models in Computer Vision
No ratings yet
Balancing Privacy and Accuracy Exploring The Impact of Data Anonymization On Deep Learning Models in Computer Vision
13 pages
Malicious URL Detection Using Machine Learning 2
No ratings yet
Malicious URL Detection Using Machine Learning 2
24 pages
A Deep Learning-Based Experiment On Forest
No ratings yet
A Deep Learning-Based Experiment On Forest
13 pages
Aicb Unit 4
No ratings yet
Aicb Unit 4
15 pages
Smartphone Based Detection and Classification of Poultry Diseases From Chicken Fecal Images Using Deep Learning Techniques
No ratings yet
Smartphone Based Detection and Classification of Poultry Diseases From Chicken Fecal Images Using Deep Learning Techniques
9 pages
Designing An Artificial Intelligence System
No ratings yet
Designing An Artificial Intelligence System
13 pages
A Novel Integrated Logistic Regression Model Enhanced With Rec 2024 Healthca
No ratings yet
A Novel Integrated Logistic Regression Model Enhanced With Rec 2024 Healthca
16 pages
DFS A Dataset For Fire and Smoke Object Detection
No ratings yet
DFS A Dataset For Fire and Smoke Object Detection
20 pages