0% found this document useful (0 votes)
5 views2 pages

Data Science MCQ

The document contains multiple-choice questions (MCQs) related to data science concepts, including the storage of hashtags in social media posts, the purpose of the random_state parameter in classifiers, how SMOTE addresses class imbalance, interpretation of the Silhouette Score, and the output of the predict() method in regression tasks. Each question is followed by the correct answer. The content is focused on key data science principles and practices.

Uploaded by

Raja Meenakshi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
5 views2 pages

Data Science MCQ

The document contains multiple-choice questions (MCQs) related to data science concepts, including the storage of hashtags in social media posts, the purpose of the random_state parameter in classifiers, how SMOTE addresses class imbalance, interpretation of the Silhouette Score, and the output of the predict() method in regression tasks. Each question is followed by the correct answer. The content is focused on key data science principles and practices.

Uploaded by

Raja Meenakshi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

Data Science MCQ

1. How are hashtags and mentions stored in the generated social media posts?

a) As separate columns in the DataFrame

b) As dictionaries

c) As comma-separated strings

d) As lists

Answer: c) As comma-separated strings

2. What is the purpose of the random_state parameter set to 42 in each classifier?

a) It ensures reproducibility of the results

b) It adds randomness to the model

c) It optimizes hyperparameters automatically

d) It prevents overfitting of the model

Answer: a) It ensures reproducibility of the results

3. How does SMOTE address class imbalance?

a) By duplicating instances of the majority class

b) By removing instances of the minority class

c) By generating synthetic samples for the minority class

d) By introducing noise into the dataset

Answer: c) By generating synthetic samples for the minority class


4. How is the Silhouette Score interpreted?

a) Higher scores indicate better cluster quality, with a maximum value of 1.

b) Lower scores indicate better cluster quality, with a minimum value of -1.

c) Scores close to 0 indicate better cluster quality.

d) Scores are not interpretable for clustering evaluation.

Answer: a) Higher scores indicate better cluster quality, with a maximum value of 1.

5. What is the output of the predict() method in the regression task with new data as user
input?

a) A single predicted engagement rate

b) A probability distribution of engagement rates

c) A list of engagement rates for each user input

d) A binary classification of engagement levels

Answer: a) A single predicted engagement rate

You might also like