0% found this document useful (0 votes)

35 views9 pages

Data Science MCQs

The document contains a set of multiple-choice questions (MCQs) related to data science concepts, focusing on data preprocessing, feature scaling, handling missing values, and techniques for dealing with class imbalance. Each question presents four options, from which the correct answer must be selected. The total time for completing the MCQs is 30 minutes, and the total marks available are 30.

Uploaded by

ganesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views9 pages

Data Science MCQs

Uploaded by

ganesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Data Science MCQs

Time: 30 Minutes Marks: 30

[email protected] Switch account Saving…

* Indicates required question

Email *

[email protected]

Which of the following is a method for dealing with high cardinality categorical *
variables?

A) One-hot encoding

B) Min-Max Scaling

C) Frequency encoding

D) Imputation

What is the purpose of standardization in data preprocessing? *

A) To scale data to a range of 0 to 1

B) To remove outliers from the dataset

C) To ensure the data has a mean of 0 and a standard deviation of 1

D) To handle missing values

How does feature scaling help in machine learning models like k-NN or SVM? *

A) It reduces the dataset size

B) It prevents overfitting

C) It ensures that features contribute equally to distance calculations

D) It improves model interpretability

Which of the following is NOT a common technique to handle class imbalance in *
a dataset?

A) Oversampling

B) Undersampling

C) One-hot encoding

D) Synthetic data generation (SMOTE)

Why is it important to shuffle data before splitting it into training and testing sets? *

A) To remove outliers

B) To avoid any bias due to the order of the data

C) To improve model accuracy

D) To reduce dimensionality

What type of scaling method would you use for features that follow a normal *
distribution?

A) Min-Max scaling

B) Z-score normalization

C) One-hot encoding

D) Log scaling

Which data preprocessing step is essential when dealing with categorical *

variables in a linear regression model?

A) One-hot encoding

B) Min-Max Scaling

C) Log transformation

D) Imputation
Which technique is commonly used to handle missing data? *

A) One-hot encoding

B) Imputation

C) Dimensionality reduction

D) PCA

Which of the following can cause a machine learning model to perform poorly? *

A) Feature scaling

B) Feature engineering

C) Irrelevant or redundant features

D) Data splitting

Which of the following is NOT a data preprocessing step? *

A) Data normalization

B) Data augmentation

C) Model evaluation

D) Missing value imputation

Which of the following methods can be used to detect outliers in a dataset? *

A) Min-Max Scaling

B) Z-Score Method

C) One-hot encoding

D) Imputation
One-hot encoding is typically applied to which type of data? *

A) Numerical data

B) Ordinal data

C) Categorical data

D) Continuous data

In which situation would you apply dimensionality reduction techniques like PCA? *

A) When the dataset contains missing values

B) When the dataset contains a large number of correlated features

C) When you want to remove outliers

D) When the dataset has no categorical variables

Which of the following is a method to reduce overfitting in decision trees? *

A) Feature scaling

B) Pruning

C) Z-Score normalization

D) One-hot encoding

Which of the following is used to deal with multicollinearity in regression *

problems?

A) Standardization

B) L2 Regularization

C) One-hot encoding

D) Min-Max scaling
What is the result of applying Principal Component Analysis (PCA) on a dataset? *

A) Reduced number of features while retaining as much variance as possible

B) Elimination of duplicate rows in the dataset

C) Removal of outliers

D) Increase in the number of features

Which of the following is an example of feature extraction? *

A) Scaling numeric features

B) Transforming categorical data into numerical format

C) Using PCA to reduce feature dimensionality

D) Filling missing values in the dataset

Which of the following can be used to fill missing numerical values? *

A) Mean, median, or mode

B) One-hot encoding

C) PCA

D) Z-Score

What does it mean if a dataset is said to be “highly imbalanced”? *

A) The dataset contains a large number of features

B) One or more classes occur much more frequently than others

C) The dataset has many missing values

D) The dataset contains outliers

Which technique reduces the dimensionality of a dataset by creating new *
features based on the old ones?

A) Feature scaling

B) Feature selection

C) Feature extraction

D) Data augmentation

What is the main reason for splitting a dataset into training and testing sets? *

A) To improve the performance of the model

B) To prevent overfitting

C) To assess the model’s generalization ability

D) To generate more data

Min-Max scaling transforms the data by bringing all values between: *

A) 0 and 10

B) -1 and 1

C) 0 and 1

D) -10 and 10

How does SMOTE (Synthetic Minority Over-sampling Technique) handle *

imbalanced datasets?

A) By undersampling the majority class

B) By oversampling the majority class

C) By generating synthetic examples for the minority class

D) By removing outliers in the dataset

What is the primary goal of feature selection? *

A) To remove noise from the data

B) To select features that have the most impact on the target variable

C) To standardize features

D) To impute missing data

Which of the following is NOT a common strategy for dealing with missing data? *

A) Deleting rows with missing values

B) Filling missing values with zeros

C) Filling missing values using a machine learning model

D) Ignoring the missing values during training

What is the main purpose of data preprocessing in machine learning? *

A) To create new features

B) To improve the quality of the data

C) To discard irrelevant data

D) To balance the dataset

What is the primary function of the Box-Cox transformation? *

A) To reduce the number of features

B) To normalize a distribution to make it more Gaussian

C) To handle missing values

D) To encode categorical variables

When would you apply log transformation to a feature in a dataset? *

A) When the feature contains negative values

B) When the feature has a normal distribution

C) When the feature has a skewed distribution

D) When the feature is categorical

Which of the following is NOT a characteristic of robust scaling? *

A) It uses the median for centering the data

B) It scales the data based on percentiles

C) It is highly sensitive to outliers

D) It handles data with outliers better than Min-Max scaling

What is the main purpose of data normalization? *

A) To reduce the number of features

B) To encode categorical variables

C) To scale numeric features to a common range

D) To remove missing values

Submit Page 1 of 1 Clear form

This form was created inside of Indian Institute of Information Technology, Nagpur. Report Abuse

Forms

Huawei Final Written Exam
50% (2)
Huawei Final Written Exam
18 pages
Applied Data Science Questions
No ratings yet
Applied Data Science Questions
15 pages
14S Operator Manual
100% (1)
14S Operator Manual
106 pages
Tenses - Ready Reckoner: Tense Affirmative/Negative/Question Use Signal Words
100% (2)
Tenses - Ready Reckoner: Tense Affirmative/Negative/Question Use Signal Words
7 pages
MCQ 3 Aiml
No ratings yet
MCQ 3 Aiml
2 pages
Quiz 2
No ratings yet
Quiz 2
8 pages
Software Dev
No ratings yet
Software Dev
3 pages
Prelims
No ratings yet
Prelims
3 pages
Data Science Quiz Questions
No ratings yet
Data Science Quiz Questions
7 pages
Set-B - CT2 - AnswerKey
No ratings yet
Set-B - CT2 - AnswerKey
10 pages
Data Analytics Mid
No ratings yet
Data Analytics Mid
15 pages
Exam 1
No ratings yet
Exam 1
3 pages
DS 1
No ratings yet
DS 1
20 pages
ML Self Unit 2
No ratings yet
ML Self Unit 2
20 pages
ML MCQ Unit 2
No ratings yet
ML MCQ Unit 2
8 pages
CSE1703 - Fundamental of Data Science
No ratings yet
CSE1703 - Fundamental of Data Science
6 pages
NASHEEEEYYYYYY
No ratings yet
NASHEEEEYYYYYY
30 pages
Ai ML Unit 1
No ratings yet
Ai ML Unit 1
15 pages
Set-C AnsKey CT2
No ratings yet
Set-C AnsKey CT2
10 pages
Set-D CT2 Answerkey
No ratings yet
Set-D CT2 Answerkey
11 pages
Dip Ii-Unit
No ratings yet
Dip Ii-Unit
7 pages
Data Science 100 MCQs
No ratings yet
Data Science 100 MCQs
16 pages
Assignment - Md. Arifur Rahman Akib-2121359642-Task-01
No ratings yet
Assignment - Md. Arifur Rahman Akib-2121359642-Task-01
2 pages
HCIA-AI V3.0 @xzZEROx
No ratings yet
HCIA-AI V3.0 @xzZEROx
3 pages
MLP1
No ratings yet
MLP1
20 pages
Data Mining
No ratings yet
Data Mining
33 pages
Quiz 4 - Data Preparation
100% (1)
Quiz 4 - Data Preparation
2 pages
Machine Learning Suggestion (2 Marks) MCQ
No ratings yet
Machine Learning Suggestion (2 Marks) MCQ
5 pages
ML Paper
No ratings yet
ML Paper
23 pages
Machine Learning With Big Data Final
No ratings yet
Machine Learning With Big Data Final
120 pages
MCQ Dlei
No ratings yet
MCQ Dlei
16 pages
Grade 12 Ai Ws Booklet - Unit 2 & 3
No ratings yet
Grade 12 Ai Ws Booklet - Unit 2 & 3
34 pages
This Sheet Is For 1 Mark Questions S.R No
No ratings yet
This Sheet Is For 1 Mark Questions S.R No
63 pages
Data Science Final Mock Test
No ratings yet
Data Science Final Mock Test
47 pages
Khoi KHDL - de On
No ratings yet
Khoi KHDL - de On
6 pages
Practice Paper 2
No ratings yet
Practice Paper 2
10 pages
Document
No ratings yet
Document
3 pages
Select The Correct Answer
No ratings yet
Select The Correct Answer
5 pages
Unit 4 Basics of Feature Engineering
100% (1)
Unit 4 Basics of Feature Engineering
33 pages
Experiment No. 5: Objective
No ratings yet
Experiment No. 5: Objective
5 pages
Ds
No ratings yet
Ds
22 pages
20 Questions On Feature Engineering and Eda
No ratings yet
20 Questions On Feature Engineering and Eda
9 pages
AWS Certified Machine Learning - Specialty - Sample Questions
No ratings yet
AWS Certified Machine Learning - Specialty - Sample Questions
5 pages
Test DS
No ratings yet
Test DS
7 pages
This Sheet Is For 1 Mark Questions S.R No
No ratings yet
This Sheet Is For 1 Mark Questions S.R No
56 pages
This Sheet Is For 1 Mark Questions S.R No
100% (1)
This Sheet Is For 1 Mark Questions S.R No
69 pages
Itae002 Test 2
No ratings yet
Itae002 Test 2
150 pages
UNIT 1 Practice Quiz - MCQs - ML
100% (1)
UNIT 1 Practice Quiz - MCQs - ML
10 pages
Foundation of Data Science Previous Year Question Paper
No ratings yet
Foundation of Data Science Previous Year Question Paper
40 pages
CAPSTONE
No ratings yet
CAPSTONE
16 pages
Mcq's (6 Topics)
No ratings yet
Mcq's (6 Topics)
42 pages
Hatdog 1.2
No ratings yet
Hatdog 1.2
18 pages
100 Days of Machine Learning
No ratings yet
100 Days of Machine Learning
14 pages
MLP Question Bank of AI and ML and NLP
No ratings yet
MLP Question Bank of AI and ML and NLP
7 pages
ML MCQs Set
No ratings yet
ML MCQs Set
18 pages
MLfinal 1
No ratings yet
MLfinal 1
7 pages
AIP-210 CertNexus Certified Artificial Intelligence Practitioner Practice Questions
No ratings yet
AIP-210 CertNexus Certified Artificial Intelligence Practitioner Practice Questions
8 pages
Data Mining Exam Questions
No ratings yet
Data Mining Exam Questions
25 pages
Ds 5
No ratings yet
Ds 5
9 pages
IGNOU MCA Data Science and Big Data Previous Years Unsolved Papers MCS 226
From Everand
IGNOU MCA Data Science and Big Data Previous Years Unsolved Papers MCS 226
Manish Soni
No ratings yet
100 Puzzles to Learn Data Warehousing
From Everand
100 Puzzles to Learn Data Warehousing
Cristian Scutaru
No ratings yet
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
From Everand
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
Sama Alshatali
No ratings yet
Som Mod4
No ratings yet
Som Mod4
14 pages
Water Air Soil Pollution
No ratings yet
Water Air Soil Pollution
17 pages
Abm 18CV643 Notes
No ratings yet
Abm 18CV643 Notes
68 pages
Shrinkage Limit Test Results
No ratings yet
Shrinkage Limit Test Results
18 pages
QS&CM - 17CV81 - Notes - Module 1
No ratings yet
QS&CM - 17CV81 - Notes - Module 1
80 pages
PD - 1
No ratings yet
PD - 1
4 pages
Q1 - Q5 Carry One Mark Each.: GA - General Aptitude
No ratings yet
Q1 - Q5 Carry One Mark Each.: GA - General Aptitude
19 pages
Hand Book
No ratings yet
Hand Book
32 pages
Q1 - Q5 Carry One Mark Each.: GA - General Aptitude
No ratings yet
Q1 - Q5 Carry One Mark Each.: GA - General Aptitude
20 pages
Design of Foundation Systems: Principles and Practices, Third Edition by Nainan P. Kurian
0% (1)
Design of Foundation Systems: Principles and Practices, Third Edition by Nainan P. Kurian
6 pages
Ground Improvement: Methods of Compaction
No ratings yet
Ground Improvement: Methods of Compaction
13 pages
Manual Phonic
0% (1)
Manual Phonic
46 pages
Urological Oncology: A Comparison Between Clinical and Pathologic Staging in Patients With Bladder Cancer
No ratings yet
Urological Oncology: A Comparison Between Clinical and Pathologic Staging in Patients With Bladder Cancer
5 pages
Service Manual, PM7100, English PT00112534 Rev A Release 8-2020
No ratings yet
Service Manual, PM7100, English PT00112534 Rev A Release 8-2020
64 pages
James Hou - Salesforce - Com Developer Resume
No ratings yet
James Hou - Salesforce - Com Developer Resume
3 pages
Sachin Pawar Resume
No ratings yet
Sachin Pawar Resume
6 pages
Taller de Circuitos
No ratings yet
Taller de Circuitos
9 pages
Occult Herbmaster - Theras
No ratings yet
Occult Herbmaster - Theras
1 page
Lesson 3 Classification of Drugs 9learners
No ratings yet
Lesson 3 Classification of Drugs 9learners
50 pages
Q3 Gender 2018 Sex Gender Nature Nurture
No ratings yet
Q3 Gender 2018 Sex Gender Nature Nurture
5 pages
RevRes PDF
No ratings yet
RevRes PDF
1,134 pages
Introduction To Modern Industrial Engineering
100% (2)
Introduction To Modern Industrial Engineering
221 pages
Module 5 Reflection 1
No ratings yet
Module 5 Reflection 1
7 pages
Instruction For AVIC F-Series In-Dash 2.008 Firmware Update
No ratings yet
Instruction For AVIC F-Series In-Dash 2.008 Firmware Update
4 pages
Type A Type B 72 78 78 76 73 81 69 74 75 82 74 75 69 75 Heaters? Find The Approximate P-Value For The Test and Interpret Its Value
No ratings yet
Type A Type B 72 78 78 76 73 81 69 74 75 82 74 75 69 75 Heaters? Find The Approximate P-Value For The Test and Interpret Its Value
9 pages
Automobile Road Test
No ratings yet
Automobile Road Test
2 pages
Irr 7920
No ratings yet
Irr 7920
15 pages
Century Iib: Autopilot Flight System
No ratings yet
Century Iib: Autopilot Flight System
24 pages
UNIT U03 02 Grammar Summary
No ratings yet
UNIT U03 02 Grammar Summary
5 pages
Mabini Colleges, Inc.: College of Nursing and Midwifery
No ratings yet
Mabini Colleges, Inc.: College of Nursing and Midwifery
2 pages
Matthew Cabral
No ratings yet
Matthew Cabral
1 page
ENGLISH-8-Quarter 2-Week 5
100% (1)
ENGLISH-8-Quarter 2-Week 5
6 pages
Material Test Report: Cse. Chiang Sung Enterprise Co., LTD
No ratings yet
Material Test Report: Cse. Chiang Sung Enterprise Co., LTD
3 pages
Harshit Ipr PPT Mba Sec B First Sem
No ratings yet
Harshit Ipr PPT Mba Sec B First Sem
12 pages
Daniel Science
No ratings yet
Daniel Science
10 pages
Anthropology 14th Edition Carol R Ember HQ File Fast Access
No ratings yet
Anthropology 14th Edition Carol R Ember HQ File Fast Access
312 pages
SCBA Pre-Use Inspection
No ratings yet
SCBA Pre-Use Inspection
2 pages
Syllabus 2021 Foundation Engineering
No ratings yet
Syllabus 2021 Foundation Engineering
4 pages
Camry - EF932 - Instructions - For - Use - Manual 21
No ratings yet
Camry - EF932 - Instructions - For - Use - Manual 21
8 pages