MCQS ML
MCQS ML
b) A method for computers to learn from data without being explicitly programmed
Answer: b) A method for computers to learn from data without being explicitly programmed
a) Supervised learning
b) Unsupervised learning
c) Reinforcement learning
d) Deterministic learning
b) Image classification
c) Customer segmentation
d) Handwriting recognition
d) Mechanical engineering
a) When a model performs well on training data but poorly on unseen data
Answer: a) When a model performs well on training data but poorly on unseen data
8. Which of the following evaluation metrics is commonly used for classification problems?
a) Mean squared error (MSE)
b) Accuracy
d) R-squared
Answer: b) Accuracy
c) To extract useful information from raw data and create informative features
Answer: c) To extract useful information from raw data and create informative features
10. Which of the following is a challenge in deploying machine learning models in real-world
applications?
a) Lack of data
b) Overfitting
c) Interpretability
d) High accuracy
Answer: c) Interpretability
d) Personalized medicine
13. Which industry often uses machine learning for demand forecasting and inventory management?
a) Retail
b) Education
c) Entertainment
d) Agriculture
Answer: a) Retail
14. In which field is machine learning used to improve customer service and user experience?
a) Aerospace
b) Hospitality
c) Construction
d) Mining
Answer: b) Hospitality
a) Weather prediction
b) Cybersecurity
c) Targeted advertising
d) Climate modeling
16. Which of the following is an example of a machine learning application in the transportation
industry?
a) Predicting earthquakes
d) Drug discovery
d) DNA sequencing
18. In which industry is machine learning applied to optimize energy consumption and reduce costs?
a) Fashion
b) Real estate
c) Energy management
d) Publishing
a) Healthcare
b) Telecommunications
c) Manufacturing
d) Government
Answer: c) Manufacturing
a) Weather forecasting
b) Language translation
c) Intrusion detection
d) Agricultural monitoring
a) Data preprocessing
b) Model training
c) Feature selection
d) Problem formulation
22. What does the term "feature engineering" refer to in the machine learning process?
23. Which of the following techniques can be used for handling missing data in a dataset?
b) Imputation
24. What is the purpose of splitting a dataset into training and testing sets?
a) To reduce overfitting
25. Which of the following is NOT a common machine learning algorithm evaluation metric?
a) Accuracy
c) Precision
d) Data preprocessing
26. What does the term "hyperparameter tuning" refer to in machine learning?
27. Which of the following techniques is used to prevent overfitting in machine learning?
a) Cross-validation
b) Feature engineering
Answer: a) Cross-validation
28. What is the purpose of model evaluation in the machine learning process?
b) To optimize hyperparameters
29. In which step of the machine learning process does the actual learning take place?
a) Data preprocessing
b) Feature engineering
c) Model training
d) Model evaluation
a) Model training
b) Model evaluation
c) Model deployment
d) Data preprocessing
b) The process of selecting the most relevant features for model training
Answer: a) The process of cleaning raw data to remove noise and inconsistencies
32. Which of the following techniques is used for handling missing data?
b) The process of identifying and handling data points that deviate significantly from the rest of the
dataset
c) The process of scaling numerical features to a similar range
Answer: b) The process of identifying and handling data points that deviate significantly from the rest
of the dataset
a) Min-Max scaling
b) Standardization
c) Normalization
b) Feature selection
c) Polynomial regression
c) To extract useful information from raw data and create informative features
Answer: c) To extract useful information from raw data and create informative features
38. Which of the following is a common technique for handling imbalanced datasets?
a) Feature scaling
b) Model training
42. Which of the following techniques is commonly used for splitting a dataset into training and testing
sets?
c) Cross-validation
d) Gradient descent
Answer: c) Cross-validation
43. What does the loss function measure during model training?
44. In gradient descent optimization, what does the learning rate control?
a) The number of iterations
46. Which of the following is a commonly used regularization technique in linear regression?
a) L1 regularization (Lasso)
b) K-means clustering
c) Decision trees
47. What does early stopping refer to in the context of model training?
a) Stopping the training process when the loss function reaches a minimum
48. Which of the following is NOT a common method for improving model performance during training?
a) Feature scaling
b) Data augmentation
c) Dropout regularization
d) Batch normalization
50. Which of the following evaluation metrics is commonly used for regression problems?
a) Accuracy
b) F1-score
d) Precision
52. Which type of machine learning algorithm is commonly used for prediction tasks?
a) Supervised learning
b) Unsupervised learning
c) Reinforcement learning
d) Semi-supervised learning
53. In supervised learning for prediction, what are the input variables typically referred to as?
a) Features
b) Labels
c) Predictions
d) Observations
Answer: a) Features
54. Which of the following algorithms is often used for time-series prediction?
a) Decision trees
b) K-nearest neighbors
d) K-means clustering
56. Which evaluation metric is commonly used for regression tasks in prediction?
a) Accuracy
b) Precision
d) Recall
58. Which of the following techniques can help prevent overfitting in predictive modeling?
c) Regularization
59. What is the term used to describe the difference between predicted values and actual values in
prediction tasks?
a) Residuals
b) Errors
c) Deviations
d) Losses
Answer: a) Residuals
60. Which approach is commonly used to handle missing data in prediction tasks?
b) Imputation
Answer: b) Imputation
c) The process of preprocessing the dataset before feeding it into the model
a) Learning rate
b) Number of iterations
c) Number of features
d) Regularization parameter
64. Which method is typically used to evaluate the performance of different hyperparameter
configurations during grid search?
b) Accuracy
c) Cross-validation
d) F1 score
Answer: c) Cross-validation
a) Grid search
b) Random search
c) Bayesian optimization
d) Genetic algorithms
67. What is the main drawback of grid search compared to random search in parameter tuning?
b) Grid search may miss optimal hyperparameters located between grid points
Answer: b) Grid search may miss optimal hyperparameters located between grid points
68. In Bayesian optimization, what information is used to guide the search for optimal
hyperparameters?
a) Randomness
b) Gradient descent
70. Which hyperparameter is commonly tuned for regularization purposes in machine learning models?
a) Learning rate
c) Dropout rate
d) Batch size
72. Which of the following is NOT a commonly used data collection method in machine learning?
A) Web scraping
B) Random guessing
74. Which of the following is a disadvantage of using web scraping for data collection?
75. What is a key consideration when collecting data through surveys and questionnaires for machine
learning?
76. Which data collection method is commonly used for gathering information from IoT devices?
A) Random sampling
B) Crowdsourcing
C) Sensor data collection
78. Which of the following is NOT a consideration when selecting data collection methods?
A) Cost-effectiveness
B) Ethical implications
C) Removing outliers
81. In natural language processing (NLP), what data augmentation technique is used to generate
additional training examples?
83. What is the purpose of applying data augmentation techniques such as rotation and scaling to
images?
Answer: B) To introduce variability and make the model robust to different viewpoints
85. What is an important consideration when applying data augmentation to imbalanced datasets?
86. How does data augmentation contribute to reducing overfitting in machine learning models?
88. Which of the following techniques is commonly used for synthetic data generation?
B) Data augmentation
C) Feature engineering
90. Which of the following is an example of a technique used for generating synthetic tabular data?
A) Image rotation
D) Random cropping
Answer: C) SMOTE (Synthetic Minority Over-sampling Technique)
91. What is the main advantage of using synthetic data in machine learning?
92. Which of the following is NOT a common approach to data generation in deep learning?
B) Autoencoders
C) Decision Trees
93. What challenge is often associated with using generated data in machine learning models?
94. How does data generation contribute to addressing class imbalance in classification tasks?