DS Question Bank Unit-2 Part-1
DS Question Bank Unit-2 Part-1
QUESTION BANK
UNIT-2
PART-1
1. Discuss the problems and its solutions for handling large volume of Data.
3. How the data in pandas are combined together? Discuss all the ways.
[Hint: Combining and Merging Datasets Merging on Index Concatenate Combining
with overlap]
5. Discuss the challenges encountered when merging and combining datasets, such as
handling missing values, duplicate entries, and performance issues. How can these
challenges be mitigated?
6. Load a dataset and identify duplicate records.
a) Remove them while keeping the first occurrence.
b) Replace missing values in a dataset using:
a. Mean for numerical columns
b. Mode for categorical columns
c) Group data by multiple columns and calculate the mean for each group.
d) Compute the percentage contribution of each category in a column.
e) Count unique values in each column of a dataset.