DS - Question Paper
DS - Question Paper
SECTION – A
I Answer ALL THE Questions. 10 X 1 = 10
1. What is the primary purpose of Data Science?
a) Store data in databases b) Visualize all datasets
c) Extract meaningful insights from data d) Write algorithms for devices
2. Which of the following is an unstructured data source?
a) JSON files b) Excel spreadsheets
c) Text documents d) Relational databases
3. What is the purpose of data cleansing in the Data Science process?
a) To ensure data quality and accuracy
b) To visualize patterns in the data
c) To deploy models in production
d) To collect more data
4. What step follows exploratory data analysis (EDA) in the Data Science process?
a) Building Models b) Data Cleaning
c) Setting the Research Goal d) Retrieving Data
5. Which step in the Data Science process involves combining data from multiple sources?
a) Data Retrieval b) Data Integration
c) Data Analysis d) Data Visualization
6. What does a frequency distribution show?
a) A summary of how often each value occurs in a dataset
b) The proportion of data points below a specific value
c) The mean and median of a dataset
d) A representation of outliers in data
7. What is the mode in a dataset?
a) The most frequently occurring value b) The average of all values
c) The middle value in an ordered dataset d) The range of data points
Downloaded by sindu B
Which of the following measures is best for detecting outliers?
a) Mean b) Median
c) Interquartile Range (IQR) d) Frequency
2. A cumulative frequency distribution shows:
a) Total frequencies in each interval
b) The running total of frequencies
c) A graph of mean and mode
d) The proportion of each category
3. What is the primary purpose of a histogram?
a) To compare two datasets
b) To visualize the frequency distribution of continuous data
c) To calculate averages
d) To display relationships between variables
SECTION – B
SECTION – C
Answer ALL Questions. 3 X 10= 30
Downloaded by sindu B
8. Which of the following measures is best for detecting outliers?
a) Mean b) Median
c) Interquartile Range (IQR) d) Frequency
9. A cumulative frequency distribution shows:
a) Total frequencies in each interval
b) The running total of frequencies
c) A graph of mean and mode
d) The proportion of each category
10. What is the primary purpose of a histogram?
a) To compare two datasets
b) To visualize the frequency distribution of continuous data
c) To calculate averages
d) To display relationships between variables
SECTION – B
SECTION – C
Answer ALL Questions. 3 X 10= 30
14. Explain the role of exploratory data analysis (EDA) in model building.
15. Elaborate on data retrieval, cleansing, and transformation techniques in Data Science.
16. Describe the process of calculating quartiles and interquartile range in a dataset.
17. i) Describe Outliers, and its types with example.
ii) Explain Outlier detection and its types with examples.
Downloaded by sindu B