0% found this document useful (0 votes)
8 views9 pages

Ds Quiz

Data science quiz

Uploaded by

priyajenat
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
8 views9 pages

Ds Quiz

Data science quiz

Uploaded by

priyajenat
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 9

### Set 1: Benefits and Uses of Data Science

1. **What is a primary benefit of data science in business?**

A) Increased data storage

**B) Improved decision-making based on data insights**

C) More complex algorithms

2. **Which of the following is an application of data science in healthcare?**

**A) Predicting patient readmission rates**

B) Managing hospital inventory

C) Scheduling staff shifts

3. **What does 'structured data' refer to?**

**A) Data that is organized in a fixed format**

B) Data that is unorganized and free-form

C) Data that can only be read by humans

4. **Which type of data is characterized by its ability to change over time?**

A) Static data

**B) Dynamic data**

C) Historical data

5. **Which step comes first in the data science process?**

A) Data preparation
**B) Defining research goals**

C) Presenting findings

6. **What is the purpose of exploratory data analysis (EDA)?**

A) To clean data

**B) To visualize data and uncover patterns**

C) To build predictive models

7. **Why is it important to define research goals in data science?**

A) To collect as much data as possible

**B) To guide the data collection and analysis process**

C) To increase computational power

8. **Which of the following is a common method for retrieving data?**

**A) Data scraping**

B) Data cleaning

C) Data visualization

9. **What is data normalization?**

A) Removing duplicate records

**B) Adjusting values to a common scale**

C) Changing data types

10. **Which visualization is commonly used in EDA to display the distribution of a dataset?**

**A) Box plot**


B) Bar chart

C) Pie chart

11. **What is the primary goal of building a predictive model?**

A) To understand past data

**B) To predict future outcomes based on historical data**

C) To clean the data

12. **What is an effective way to present data findings to stakeholders?**

A) Using technical jargon

**B) Creating clear visualizations and summaries**

C) Providing raw data without context

13. **Which of the following is a key technique used in data mining?**

**A) Clustering**

B) Data entry

C) Data formatting

14. **What is the primary purpose of a data warehouse?**

A) To store operational data for daily transactions

**B) To consolidate and analyze large amounts of historical data**

C) To perform real-time data processing

15. **What does the term 'mean' refer to in statistics?**

A) The most frequently occurring value


**B) The average of a dataset**

C) The middle value in a dataset

16. **Which of the following is an example of unstructured data?**

A) Customer names

**B) Social media posts**

C) Product prices

17. **What is a common challenge in data science?**

A) Excessive data cleaning

B) Too few algorithms

**C) Data privacy concerns**

18. **What role does data visualization play in data science?**

A) Only for aesthetic purposes

**B) To communicate insights effectively**

C) To complicate data analysis

19. **Which of the following techniques is often used for predictive modeling?**

A) Data entry

**B) Regression analysis**

C) Data storage

20. **What is the significance of data quality in data science?**

A) It does not matter if data is large


B) It only affects visualizations

**C) It directly impacts analysis results and decisions**

---

### Set 2: Types of Data and Descriptive Statistics

1. **Which of the following is an example of qualitative data?**

A) Height of individuals

**B) Colors of cars**

C) Temperature readings

2. **What type of data is represented by numerical values?**

A) Categorical data

B) Ordinal data

**C) Quantitative data**

3. **What is a continuous variable?**

**A) A variable that can take on any value within a range**

B) A variable that has a fixed number of categories

C) A variable that can only take on whole numbers

4. **Which type of variable represents categories with a meaningful order?**

A) Nominal variable

**B) Ordinal variable**


C) Discrete variable

5. **Which of the following is a common graphical representation of categorical data?**

A) Histogram

**B) Bar chart**

C) Line graph

6. **What is a frequency table used for?**

**A) To summarize data values and their counts**

B) To display data trends over time

C) To show the relationship between two variables

7. **What is the median?**

A) The sum of all values divided by the number of values

**B) The middle value when data is ordered**

C) The most frequently occurring value

8. **Which measure of central tendency is most affected by outliers?**

**A) Mean**

B) Median

C) Mode

9. **What does standard deviation measure?**

A) The average of a dataset

**B) The spread of data points around the mean**


C) The maximum value in a dataset

10. **Which of the following indicates less variability in a dataset?**

A) A high standard deviation

**B) A low standard deviation**

C) A high range

11. **What characterizes a normal distribution?**

A) Data is skewed to the left

**B) Data is symmetrically distributed around the mean**

C) Data has multiple peaks

12. **What does a z-score represent?**

A) The percentage of data below a certain value

**B) The number of standard deviations a data point is from the mean**

C) The average of a dataset

13. **In a standard normal distribution, what is the mean and standard deviation?**

**A) Mean = 0, Standard Deviation = 1**

B) Mean = 1, Standard Deviation = 0

C) Mean = 0, Standard Deviation = 0

14. **If a z-score is positive, what does that indicate?**

A) The value is below the mean

B) The value is equal to the mean


**C) The value is above the mean**

15. **What type of graph is typically used to display the frequency distribution of a continuous variable?
**

**A) Histogram**

B) Bar chart

C) Pie chart

16. **Which of the following is an example of a discrete variable?**

A) Height of a person

**B) Number of students in a class**

C) Temperature

17. **What is the range of a dataset?**

A) The sum of all values

B) The difference between the highest and lowest values

**C) The average of the values**

18. **Which of the following best describes a bimodal distribution?**

A) One peak

B) No peaks

**C) Two distinct peaks**

19. **What does it mean if data is positively skewed?**

A) Most values are on the right side of the distribution

**B) Most values are on the left side of the distribution**


C) The mean is greater than the median

20. **What is the mode of a dataset?**

A) The middle value when ordered

**B) The most frequently occurring value**

C) The average of the dataset

Feel free to use or modify these questions as needed!

You might also like