Test DS

Uploaded by

pablo.villegas.mills

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views7 pages

Test DS

Uploaded by

pablo.villegas.mills

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

1. What is the purpose of a scatter plot in data visualization?

a. Displaying the distribution of categorical variables

b. b) Showing the relationship between two numerical variables
c. c) Highlighting the correlation between features
d. d) Visualizing time series data
2. In machine learning, what is the main goal of dimensionality reduction?
a. a) Increasing the number of features
b. b) Improving model complexity
c. c) Reducing the size of the dataset
d. d) Capturing relevant information while reducing noise
3. Which technique is commonly used to address the issue of overfitting in machine
learning?
a. a) Regularization
b. b) Data augmentation
c. c) Feature engineering
d. d) Ensemble methods
4. What does the AUC-ROC curve measure in binary classification?
a. a) Model accuracy
b. b) Precision
c. c) Recall
d. d) True positive rate vs. false positive rate
5. Which algorithm is suitable for clustering data when the number of clusters is
known in advance?
a. a) K-means
b. b) Hierarchical clustering
c. c) DBSCAN
d. d) Random Forest
6. Which statistical test is used to determine if there is a significant difference between
the means of two groups?
a. a) ANOVA
b. b) Chi-squared test
c. c) T-test
d. d) Pearson correlation
7. What is the purpose of the Levenshtein distance metric in natural language
processing?
a. a) Measuring document similarity
b. b) Evaluating sentiment analysis
c. c) Calculating word embeddings
d. d) Quantifying the difference between two strings
8. Which machine learning technique can handle both classification and regression
tasks?
a. a) Linear regression
b. b) Decision trees
c. c) Naive Bayes
d. d) Support Vector Machines
9. What is the "bias-variance trade-off" in machine learning?
a. a) Balancing the trade-off between bias and fairness in models
b. b) Balancing the trade-off between underfitting and overfitting
c. c) Balancing the trade-off between feature selection and feature extraction
d. d) Balancing the trade-off between model accuracy and interpretability
10. Which method is used for handling class imbalance in a binary classification
problem?
a. a) Data augmentation
b. b) Feature scaling
c. c) Regularization
d. d) Principal Component Analysis (PCA)
11. Which technique is used to assess the significance of variables in a linear regression
model?
a. a) p-value
b. b) R-squared
c. c) F-statistic
d. d) Mean squared error
12. What does the term "bagging" refer to in ensemble learning?
a. a) Training multiple models sequentially
b. b) Training multiple models in parallel and averaging their predictions
c. c) Reducing the number of features in a dataset
d. d) Combining models using a weighted average
13. What is the purpose of the sigmoid activation function in a neural network?
a. a) Introducing non-linearity
b. b) Regularizing model parameters
c. c) Calculating the mean squared error
d. d) Scaling input features
14. In a decision tree, what is the "Gini impurity" used for?
a. a) Measuring the variance of the target variable
b. b) Calculating the entropy of the target variable
c. c) Quantifying the purity of a node's class distribution
d. d) Assessing the correlation between features
15. Which algorithm is used for optimizing hyperparameters in machine learning
models?
a. a) Gradient descent
b. b) K-means
c. c) Grid search
d. d) Hierarchical clustering
16. What is the purpose of the L1 regularization term in linear regression?
a. a) Reducing bias in the model
b. b) Penalizing large coefficients
c. c) Increasing model complexity
d. d) Improving convergence of optimization algorithms
17. Which technique is used to prevent the "curse of dimensionality" in machine
learning?
a. a) Regularization
b. b) Feature scaling
c. c) Dimensionality reduction
d. d) Ensemble learning
18. What is the Kullback-Leibler (KL) divergence used for in probability theory?
a. a) Measuring the similarity between two probability distributions
b. b) Calculating the variance of a dataset
c. c) Evaluating the goodness of fit of a model
d. d) Assessing the linearity of a regression model
19. Which method is commonly used for imputing missing values in a dataset?
a. a) Removing rows with missing values
b. b) Filling missing values with the mean of the feature
c. c) Ignoring missing values during analysis
d. d) Replacing missing values with the mode of the feature
20. What is the purpose of the Viterbi algorithm in Hidden Markov Models (HMM)?
a. a) Calculating the likelihood of an observation sequence
b. b) Estimating the parameters of the model
c. c) Decoding the most likely sequence of hidden states
d. d) Smoothing noisy observations
21. Which technique is used to prevent overfitting in decision trees?
a. a) Pruning
b. b) Bagging
c. c) Boosting
d. d) Feature scaling
22. What is the goal of natural language processing (NLP)?
a. a) Simulating human intelligence
b. b) Generating random text
c. c) Reducing the dimensionality of text data
d. d) Extracting and understanding information from text
23. Which evaluation metric is appropriate for imbalanced multi-class classification
problems?
a. a) Accuracy
b. b) Precision-recall curve
c. c) F1-score
d. d) Mean squared error
24. What does the term "one-hot encoding" refer to in data preprocessing?
a. a) Converting categorical variables into numerical values
b. b) Combining multiple features into a single feature
c. c) Reducing the dimensionality of data
d. d) Transforming continuous variables into binary vectors
25. Which algorithm is used for reducing the dimensionality of high-dimensional data?
a. a) Naive Bayes
b. b) K-means clustering
c. c) Principal Component Analysis (PCA)
d. d) Random Forest
26. What is the purpose of the Jensen-Shannon divergence in probability theory?
a. a) Measuring the similarity between two probability distributions
b. b) Calculating the mean of a dataset
c. c) Estimating the variance of a distribution
d. d) Assessing the linearity of a regression model
27. Which method is used for text data preprocessing to remove unnecessary words and
reduce dimensionality?
a. a) One-hot encoding
b. b) Word embedding
c. c) Stopword removal
d. d) Lemmatization
28. What is the primary purpose of cross-validation in machine learning?
a. a) Training a model on all available data
b. b) Evaluating a model's performance on a separate dataset
c. c) Dividing data into training and testing sets
d. d) Visualizing the distribution of data
29. Which technique is used for reducing variance and improving the generalization of
an ensemble model?
a. a) Bagging
b. b) Boosting
c. c) Pruning
d. d) Regularization
30. In a support vector machine (SVM), what is the "kernel trick" used for?
a. a) Reducing model complexity
b. b) Adding new features to the dataset
c. c) Transforming data into a higher-dimensional space
d. d) Improving convergence of the optimization algorithm
31. What does the term "precision" refer to in binary classification?
a. a) The ratio of true positives to true negatives
b. b) The ratio of true positives to the sum of true positives and false positives
c. c) The ratio of true positives to the sum of true positives and false negatives
d. d) The ratio of true negatives to the sum of true negatives and false negatives
32. Which technique is used for generating new data samples using a trained model?
a. a) Clustering
b. b) Dimensionality reduction
c. c) Data augmentation
d. d) Regularization
33. What is the goal of feature scaling in machine learning?
a. a) Converting categorical features into numerical values
b. b) Balancing class distribution
c. c) Scaling numerical features to a similar range
d. d) Increasing the complexity of the model
34. What is the primary purpose of a confusion matrix in binary classification?
a. a) Evaluating the model's performance
b. b) Calculating the mean squared error
c. c) Identifying the number of features
d. d) Visualizing the data distribution
35. Which algorithm is used for extracting important features from text data?
a. a) Principal Component Analysis (PCA)
b. b) Linear Discriminant Analysis (LDA)
c. c) K-means clustering
d. d) Gradient Boosting
36. What does the term "bag of words" represent in natural language processing?
a. a) A technique for analyzing sentence structure
b. b) A method for encoding categorical variables
c. c) A model for sequence generation
d. d) A representation of text as a collection of word occurrences
37. Which method is used to mitigate the issue of multicollinearity in linear regression?
a. a) Feature scaling
b. b) L1 regularization
c. c) L2 regularization
d. d) Removing one of the correlated features
38. What is the primary purpose of a learning rate in gradient descent optimization?
a. a) Balancing the trade-off between bias and variance
b. b) Adjusting the number of iterations in training
c. c) Controlling the step size during parameter updates
d. d) Calculating the regularization term
39. Which technique is used for evaluating the importance of features in a random
forest model?
a. a) Gini impurity
b. b) Area Under the Curve (AUC)
c. c) Recursive Feature Elimination (RFE)
d. d) Mean squared error
40. What is the purpose of the log loss (binary cross-entropy) loss function in
classification?
a. a) Calculating the mean squared error
b. b) Minimizing the difference between predicted and actual values
c. c) Penalizing large model coefficients
d. d) Encouraging confident predictions and penalizing uncertainty
41. In time series forecasting, what is the role of the "lag" parameter?
a. a) Balancing class distribution
b. b) Specifying the number of clusters
c. c) Defining the number of previous time steps to consider
d. d) Determining the learning rate
42. Which technique is used for representing text data in a continuous vector space?
a. a) One-hot encoding
b. b) Word embedding
c. c) TF-IDF
d. d) Bag of words
43. What is the purpose of the Hessian matrix in optimization algorithms?
a. a) Calculating the gradient of the loss function
b. b) Regularizing model parameters
c. c) Determining the step size during optimization
d. d) Improving the convergence of gradient descent
44. Which algorithm is commonly used for sentiment analysis in text data?
a. a) Linear regression
b. b) Support Vector Machines (SVM)
c. c) Naive Bayes
d. d) Decision trees
45. What is the goal of the Expectation-Maximization (EM) algorithm?
a. a) Calculating the mean squared error
b. b) Training deep neural networks
c. c) Clustering data into groups
d. d) Optimizing hyperparameters
46. Which method is used for reducing variance in a model by averaging multiple
instances of it?
a. a) Regularization
b. b) Ensemble learning
c. c) Feature scaling
d. d) Dimensionality reduction
47. What is the purpose of the inverted dropout technique in neural networks?
a. a) Preventing overfitting by dropping out neurons during training
b. b) Scaling the input features to a similar range
c. c) Increasing model complexity by adding more layers
d. d) Introducing non-linearity
48. Which technique is used for finding the optimal number of clusters in K-means
clustering?
a. a) The Elbow method
b. b) Principal Component Analysis (PCA)
c. c) The Silhouette score
d. d) Regularization
49. What is the goal of gradient boosting in ensemble learning?
a. a) Increasing the variance of individual models
b. b) Training multiple models in parallel
c. c) Combining weak learners to create a strong model
d. d) Reducing the bias of the model
50. Which method is used for reducing the dimensionality of high-dimensional data
while preserving its variance?
a. a) Principal Component Analysis (PCA)
b. b) K-means clustering
c. c) Support Vector Machines (SVM)
d. d) Bagging

Answers:

1. b) Showing the relationship between two numerical variables

2. d) Capturing relevant information while reducing noise
3. a) Regularization
4. d) True positive rate vs. false positive rate
5. a) K-means
6. c) T-test
7. a) Measuring document similarity
8. d) Support Vector Machines
9. b) Data augmentation
10. c) F1-score
11. a) p-value
12. b) Training multiple models in parallel and averaging their predictions
13. a) Introducing non-linearity
14. c) Quantifying the purity of a node's class distribution
15. c) Grid search
16. b) Penalizing large coefficients
17. c) Dimensionality reduction
18. a) Measuring the similarity between two probability distributions
19. b) Filling missing values with the mean of the feature
20. c) Decoding the most likely sequence of hidden states
21. a) Pruning
22. d) Extracting and understanding information from text
23. c) F1-score
24. a) Converting categorical variables into numerical values
25. c) Principal Component Analysis (PCA)
26. a) Measuring the similarity between two probability distributions
27. c) Stopword removal
28. b) Evaluating a model's performance on a separate dataset
29. a) Bagging
30. c) Transforming data into a higher-dimensional space
31. b) The ratio of true positives to the sum of true positives and false positives
32. c) Data augmentation
33. c) Scaling numerical features to a similar range
34. a) Evaluating the model's performance
35. b) Linear Discriminant Analysis (LDA)
36. d) A representation of text as a collection of word occurrences
37. b) L1 regularization
38. c) Controlling the step size during parameter updates
39. a) Gini impurity
40. d) Encouraging confident predictions and penalizing uncertainty
41. c) Defining the number of previous time steps to consider
42. b) Word embedding
43. c) Determining the step size during optimization
44. c) Naive Bayes
45. c) Clustering data into groups
46. b) Ensemble learning
47. a) Preventing overfitting by dropping out neurons during training
48. a) The Elbow method
49. c) Combining weak learners to create a strong model
50. a) Principal Component Analysis (PCA)

SEC III Artificial Intelligence Question Bank
No ratings yet
SEC III Artificial Intelligence Question Bank
86 pages
Data Science 100 MCQs
No ratings yet
Data Science 100 MCQs
16 pages
Data Science Quiz Questions
No ratings yet
Data Science Quiz Questions
7 pages
Interview Prep Data Science, Machine Learning, Deep Learning MCQs
No ratings yet
Interview Prep Data Science, Machine Learning, Deep Learning MCQs
31 pages
Practice Paper 2
No ratings yet
Practice Paper 2
10 pages
Ai ML Unit 1
No ratings yet
Ai ML Unit 1
15 pages
ML MCQ QB
No ratings yet
ML MCQ QB
5 pages
MCQS ML
No ratings yet
MCQS ML
27 pages
Ai ML Unit 3
No ratings yet
Ai ML Unit 3
15 pages
Practice Paper 3
No ratings yet
Practice Paper 3
9 pages
ML 1-100
No ratings yet
ML 1-100
21 pages
Practice Paper 4
No ratings yet
Practice Paper 4
9 pages
ML Objective
No ratings yet
ML Objective
5 pages
MLfinal 1
No ratings yet
MLfinal 1
7 pages
Unit 1 - Capstone Project-Answer Key
No ratings yet
Unit 1 - Capstone Project-Answer Key
21 pages
Khoi KHDL - de On
No ratings yet
Khoi KHDL - de On
6 pages
CHP 1,2
No ratings yet
CHP 1,2
18 pages
CAPSTONE
No ratings yet
CAPSTONE
16 pages
MLT QN Bank Merged
No ratings yet
MLT QN Bank Merged
26 pages
Set 3
No ratings yet
Set 3
6 pages
Huawei Final Written Exam
50% (2)
Huawei Final Written Exam
18 pages
Capstone Project
No ratings yet
Capstone Project
17 pages
Machine Learning MCQ
No ratings yet
Machine Learning MCQ
4 pages
Machine Learning Question Bank
No ratings yet
Machine Learning Question Bank
7 pages
MCQ of Machine Learning
100% (2)
MCQ of Machine Learning
151 pages
Lect 7 Q
No ratings yet
Lect 7 Q
4 pages
Final Quiz Statistical Modeling ML Ai
No ratings yet
Final Quiz Statistical Modeling ML Ai
15 pages
Sem3 Asmt Answers
No ratings yet
Sem3 Asmt Answers
20 pages
Da CH2 Slqa
No ratings yet
Da CH2 Slqa
9 pages
Machine Learning Vapnik-Chervonenkis (VC) Dimension
No ratings yet
Machine Learning Vapnik-Chervonenkis (VC) Dimension
4 pages
ML Objectives Mid 1
No ratings yet
ML Objectives Mid 1
5 pages
Data Science Final Mock Test
No ratings yet
Data Science Final Mock Test
47 pages
MLP Question Bank of AI and ML and NLP
No ratings yet
MLP Question Bank of AI and ML and NLP
7 pages
Exam Preparation - Machine Learning Applications
No ratings yet
Exam Preparation - Machine Learning Applications
4 pages
ML MCQ
No ratings yet
ML MCQ
7 pages
Made Easy
No ratings yet
Made Easy
11 pages
Mcqs 1
No ratings yet
Mcqs 1
34 pages
Questions For ML - Built A Thon
No ratings yet
Questions For ML - Built A Thon
7 pages
Semester Suggestion Solution
No ratings yet
Semester Suggestion Solution
26 pages
Assignment 11 Day 19 (Macchine Learning Assignment) Sandip Kendre
No ratings yet
Assignment 11 Day 19 (Macchine Learning Assignment) Sandip Kendre
4 pages
The Foos Full
No ratings yet
The Foos Full
147 pages
ML Suggestion 2
No ratings yet
ML Suggestion 2
11 pages
Machine Learning Imp Questions
100% (2)
Machine Learning Imp Questions
95 pages
Data Science
No ratings yet
Data Science
35 pages
ML BIT Ans
No ratings yet
ML BIT Ans
5 pages
Set 2
No ratings yet
Set 2
6 pages
Machine Learning
No ratings yet
Machine Learning
2 pages
Practice MCQ AI
No ratings yet
Practice MCQ AI
4 pages
Questions and Answers
No ratings yet
Questions and Answers
7 pages
Pa - Imp Qus
No ratings yet
Pa - Imp Qus
4 pages
ML QB Ans
No ratings yet
ML QB Ans
48 pages
Machine Learning
No ratings yet
Machine Learning
5 pages
d3 PDF
No ratings yet
d3 PDF
7 pages
10 Plug N Play Email Templates
100% (3)
10 Plug N Play Email Templates
13 pages
Lecture 3 Mcqs
No ratings yet
Lecture 3 Mcqs
7 pages
Machine Learning
No ratings yet
Machine Learning
9 pages
How To Use Matlab and Simulink With Arduino
No ratings yet
How To Use Matlab and Simulink With Arduino
16 pages
Applied Data Science Questions
No ratings yet
Applied Data Science Questions
15 pages
Shivaji University, Kolhapur
No ratings yet
Shivaji University, Kolhapur
12 pages
Mcqs Bank Unit 1: A) The Autonomous Acquisition of Knowledge Through The Use of Computer Programs
100% (1)
Mcqs Bank Unit 1: A) The Autonomous Acquisition of Knowledge Through The Use of Computer Programs
8 pages
Conected Car India Market Study
No ratings yet
Conected Car India Market Study
8 pages
Data Mining Weka Classic
No ratings yet
Data Mining Weka Classic
36 pages
Dynamically in XML Publisher
No ratings yet
Dynamically in XML Publisher
2 pages
en
No ratings yet
en
4 pages
WCCustomizers Guide
No ratings yet
WCCustomizers Guide
571 pages
FE Lab 1
No ratings yet
FE Lab 1
23 pages
Pricelist
No ratings yet
Pricelist
6 pages
Self Test
No ratings yet
Self Test
99 pages
As 1199.1-2003 Sampling Procedures For Inspection by Attributes Sampling Schemes Indexed by Acceptance Qualit
No ratings yet
As 1199.1-2003 Sampling Procedures For Inspection by Attributes Sampling Schemes Indexed by Acceptance Qualit
10 pages
LoadTracer - A Load Testing Tool
No ratings yet
LoadTracer - A Load Testing Tool
9 pages
Bluetooth Setup Quick Reference Guide (QR-Bluetooth Rev 2)
No ratings yet
Bluetooth Setup Quick Reference Guide (QR-Bluetooth Rev 2)
7 pages
Dect PDF
No ratings yet
Dect PDF
5 pages
AWS Bill
No ratings yet
AWS Bill
2 pages
FM Read - Text
No ratings yet
FM Read - Text
3 pages
Database
No ratings yet
Database
53 pages
P 2M: Generating Deployable Models From Natural Language Instructions
No ratings yet
P 2M: Generating Deployable Models From Natural Language Instructions
10 pages
Comp611-Turbo C (Chap 2)
No ratings yet
Comp611-Turbo C (Chap 2)
12 pages
Herat University Library Management System English User Manual
No ratings yet
Herat University Library Management System English User Manual
25 pages
Department of Mechanical Engineering Mentofmechanicalengineering Ofmechanicalengineering
No ratings yet
Department of Mechanical Engineering Mentofmechanicalengineering Ofmechanicalengineering
2 pages
Coding Guidelines IOS Swift
No ratings yet
Coding Guidelines IOS Swift
15 pages
Zigbee Motion Detector Zmove: Revision: 4.0 Document: Um - Zmove - 20090731 - 001 - 04 - 00
No ratings yet
Zigbee Motion Detector Zmove: Revision: 4.0 Document: Um - Zmove - 20090731 - 001 - 04 - 00
18 pages
Implementing A Stack On A Xilinx Spartan 3e CW558 - Nov 2013
No ratings yet
Implementing A Stack On A Xilinx Spartan 3e CW558 - Nov 2013
10 pages
Answer The Following Questions.: Print Post Test
No ratings yet
Answer The Following Questions.: Print Post Test
5 pages
Pakistan Map With Eastings Northings Without Google Base Map
No ratings yet
Pakistan Map With Eastings Northings Without Google Base Map
1 page
Dilip Sir
No ratings yet
Dilip Sir
1 page
Mr. Ashok Ramchandra Patel: Professional Objective
No ratings yet
Mr. Ashok Ramchandra Patel: Professional Objective
3 pages
IGNOU MCA Digital Image Processing and Computer Vision Unsolved Paper Book MCS 230
From Everand
IGNOU MCA Digital Image Processing and Computer Vision Unsolved Paper Book MCS 230
Manish Soni
No ratings yet
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
From Everand
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
Sama Alshatali
No ratings yet
IGNOU MCA Data Science and Big Data Previous Years Unsolved Papers MCS 226
From Everand
IGNOU MCA Data Science and Big Data Previous Years Unsolved Papers MCS 226
Manish Soni
No ratings yet

Test DS

Uploaded by

Test DS

Uploaded by

1. What is the purpose of a scatter plot in data visualization?

a. Displaying the distribution of categorical variables

1. b) Showing the relationship between two numerical variables

You might also like