Question Bank (1&2)
Question Bank (1&2)
2MARKS
1. 1.What does EDA stand for in data science?
2. Can you recall two primary goals of Exploratory Data Analysis (EDA)?
3. Name two software tools commonly used for EDA.
4. Define data transformation in the context of EDA.
5. Mention one key benefit of merging databases in EDA.
6. What is the primary purpose of reshaping and pivoting data during EDA?
7. Give an example of a visual aid commonly used in EDA.
8. Explain the significance of EDA in the data science process.
9. Compare and contrast EDA with classical statistical analysis. How are they different?
10. How does Bayesian analysis differ from EDA in terms of data exploration?
11. Describe how data transformation techniques can improve the quality of EDA.
12. Why is it important to compare and contrast data during EDA?
13. How can visual aids, such as histograms, help in understanding data distribution?
14. Explain the concept of merging databases and its role in EDA.
15. What is the purpose of data reshaping and pivoting, and when is it typically
performed during EDA?
16. Describe one common data transformation technique used in EDA.
16 MARKS
***************************
1. What is the primary data structure used in Pandas for handling tabular data?
2. Name two common Pandas data structures for one-dimensional data.
3. How do you access the first five rows of a DataFrame using Pandas?
4. In Pandas, what method is used to check the shape (number of rows and columns) of a
DataFrame?
5. What function is used to load a CSV file into a Pandas DataFrame?
6. Explain the purpose of the head() method in Pandas.
7. What is the default index for a newly created DataFrame in Pandas?
8. How do you drop a column from a DataFrame in Pandas?
9. Describe the difference between a Series and a DataFrame in Pandas.
10. Explain the concept of hierarchical indexing in Pandas. Provide an example.
11. How can you handle missing data in a Pandas DataFrame?
12. What is the difference between the concat() and merge() functions in Pandas for
combining DataFrames?
13. Describe the process of grouping data in Pandas and mention a function used for
aggregation within groups.
14. What is the purpose of a pivot table in Pandas, and how is it created?
15. How do vectorized string operations differ from regular string operations in Pandas?
16. Explain the difference between the append() and join() methods when combining
DataFrames in Pandas.
16 MARKS