Question Bank (1&2)

Uploaded by

kokila sadeesh1986

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views4 pages

Question Bank (1&2)

Uploaded by

kokila sadeesh1986

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

QUESTIONS BANK FOR EDA UNIT 1

2MARKS
1. 1.What does EDA stand for in data science?
2. Can you recall two primary goals of Exploratory Data Analysis (EDA)?
3. Name two software tools commonly used for EDA.
4. Define data transformation in the context of EDA.
5. Mention one key benefit of merging databases in EDA.
6. What is the primary purpose of reshaping and pivoting data during EDA?
7. Give an example of a visual aid commonly used in EDA.
8. Explain the significance of EDA in the data science process.
9. Compare and contrast EDA with classical statistical analysis. How are they different?
10. How does Bayesian analysis differ from EDA in terms of data exploration?
11. Describe how data transformation techniques can improve the quality of EDA.
12. Why is it important to compare and contrast data during EDA?
13. How can visual aids, such as histograms, help in understanding data distribution?
14. Explain the concept of merging databases and its role in EDA.
15. What is the purpose of data reshaping and pivoting, and when is it typically
performed during EDA?
16. Describe one common data transformation technique used in EDA.

16 MARKS

1. What does EDA stand for in data science?

2. Name two fundamental components of EDA.
3. List two software tools commonly used for EDA.
4. Define data transformation in the context of EDA
5. Explain the significance of EDA in the data science process.
6. Compare and contrast EDA with classical statistical analysis.
7. How does Bayesian analysis differ from EDA in terms of approach and goals?
8. Can you provide an example of a situation where EDA is more appropriate than
classical analysis?
9. Given a dataset, describe a specific EDA technique you would use to explore its
distribution.
10. Imagine you have two datasets you need to merge for analysis. What EDA
considerations should you take into account before merging them?
11. Provide an example of reshaping and pivoting data in EDA. How can this help in data
exploration?
12. What are the potential challenges or limitations of using EDA in real-world data
analysis projects?
13. Compare the advantages and disadvantages of visual aids, such as histograms and
scatter plots, in EDA.
14. How can EDA uncover hidden patterns or anomalies in data that might not be evident
through classical analysis?
15. Assess the impact of data quality on the effectiveness of EDA. How can poor data
quality affect the results of EDA?
16. In a given business scenario, explain why a data scientist might choose EDA over
Bayesian analysis when trying to gain insights from a large dataset.

***************************

QUESTION BANK FOR UNIT 2

2MARKS

1. What is the primary data structure used in Pandas for handling tabular data?
2. Name two common Pandas data structures for one-dimensional data.
3. How do you access the first five rows of a DataFrame using Pandas?
4. In Pandas, what method is used to check the shape (number of rows and columns) of a
DataFrame?
5. What function is used to load a CSV file into a Pandas DataFrame?
6. Explain the purpose of the head() method in Pandas.
7. What is the default index for a newly created DataFrame in Pandas?
8. How do you drop a column from a DataFrame in Pandas?
9. Describe the difference between a Series and a DataFrame in Pandas.
10. Explain the concept of hierarchical indexing in Pandas. Provide an example.
11. How can you handle missing data in a Pandas DataFrame?
12. What is the difference between the concat() and merge() functions in Pandas for
combining DataFrames?
13. Describe the process of grouping data in Pandas and mention a function used for
aggregation within groups.
14. What is the purpose of a pivot table in Pandas, and how is it created?
15. How do vectorized string operations differ from regular string operations in Pandas?
16. Explain the difference between the append() and join() methods when combining
DataFrames in Pandas.

16 MARKS

1. Define what Pandas is and explain its significance in data manipulation.

2. List and briefly describe the primary Pandas objects used for data manipulation.
3. Recall the purpose of data indexing in Pandas and how it facilitates data selection.
4. Name two common methods for handling missing data in Pandas.
5. Explain the concept of hierarchical indexing in Pandas. Provide an example to
illustrate its use.
6. Differentiate between concatenation and merging of datasets in Pandas. When would
you use each?
7. Describe how vectorized string operations work in Pandas. Give an example of a
practical use case.
8. Why is it important to use the appropriate aggregation functions when grouping data
in Pandas? Provide an example.
9. Given a dataset, write code in Pandas to perform a left join between two DataFrames
using a common key.
10. Create a Pandas DataFrame with hierarchical indexing and demonstrate how to select
data from specific levels.
11. Given a dataset with missing values, apply suitable Pandas methods to fill in missing
data based on a chosen strategy.
12. Using Pandas, create a pivot table from a dataset, and explain the steps involved.
13. Given a real-world dataset, describe a scenario where hierarchical indexing would be
particularly useful for data analysis.
14. Analyze a dataset using Pandas to find the mean, median, and standard deviation of a
specific numeric column. Interpret the results.
15. Compare and contrast the benefits and drawbacks of using the "concat" and "merge"
methods in Pandas for combining datasets.
16. Given a dataset containing text data, perform text preprocessing and analysis using
Pandas' vectorized string operations to extract meaningful insights.
17. Evaluate the impact of missing data on the results of a statistical analysis. Discuss
strategies to handle missing data effectively using Pandas.
18. Compare and contrast the "append" and "concat" methods in Pandas for combining
DataFrames. When would you choose one method over the other?
19. Critically analyze a case study where hierarchical indexing in Pandas was employed
to solve a complex data analysis problem. What were the key benefits of using
hierarchical indexing in this context?
20. Design a Pandas workflow to merge and clean two separate datasets with different
structures and create a single cohesive DataFrame for further analysis.
21. Create a step-by-step guide on how to perform a pivot table operation in Pandas,
including data preparation, indexing, aggregation, and visualization of results.
22. Develop a custom function in Pandas that automates the process of handling missing
data based on user-defined criteria. Provide an example of its usage.
23. Propose a data analysis project that leverages Pandas' capabilities for string
operations. Explain the problem statement and the expected outcomes.

American Graffiti
No ratings yet
American Graffiti
194 pages
Sac QB 2023-2024
No ratings yet
Sac QB 2023-2024
2 pages
Unit 2 PART B-F
No ratings yet
Unit 2 PART B-F
2 pages
Unit Ii 2M
No ratings yet
Unit Ii 2M
8 pages
DVW 203105491 - 6697 - Question - Paper
No ratings yet
DVW 203105491 - 6697 - Question - Paper
2 pages
Set-D CT2 Answerkey
No ratings yet
Set-D CT2 Answerkey
11 pages
Revision Questions
No ratings yet
Revision Questions
19 pages
Unit 1 Eda Qa (2marks)
No ratings yet
Unit 1 Eda Qa (2marks)
4 pages
VIP Question Bank For DPV For Theory Exam
No ratings yet
VIP Question Bank For DPV For Theory Exam
6 pages
De&v Two Marks Questions With Answers
No ratings yet
De&v Two Marks Questions With Answers
19 pages
Python CAT Papers
No ratings yet
Python CAT Papers
6 pages
Question Bank CIA 2
No ratings yet
Question Bank CIA 2
3 pages
Journal
No ratings yet
Journal
48 pages
Python Interview Questions For Data Analytics
No ratings yet
Python Interview Questions For Data Analytics
2 pages
Set-C AnsKey CT2
No ratings yet
Set-C AnsKey CT2
10 pages
DS Question Bank Unit-2 Part-1
No ratings yet
DS Question Bank Unit-2 Part-1
1 page
Python Interview Questions
No ratings yet
Python Interview Questions
8 pages
Set-B - CT2 - AnswerKey
No ratings yet
Set-B - CT2 - AnswerKey
10 pages
DATASCIENCE (Unit-1) Question Bank
No ratings yet
DATASCIENCE (Unit-1) Question Bank
6 pages
Python Unit 2 Question Bank
No ratings yet
Python Unit 2 Question Bank
5 pages
Python Interview Questions by Skill Arbitrage
No ratings yet
Python Interview Questions by Skill Arbitrage
3 pages
DVW 203105491 - 5926 - Question - Paper
No ratings yet
DVW 203105491 - 5926 - Question - Paper
2 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
Ocs353 DCF
No ratings yet
Ocs353 DCF
4 pages
Data Science Mid-II Question Bank
No ratings yet
Data Science Mid-II Question Bank
1 page
DAL Oral QB
No ratings yet
DAL Oral QB
2 pages
Data Science Exam Solution
No ratings yet
Data Science Exam Solution
12 pages
Unit 4 Fod
100% (1)
Unit 4 Fod
21 pages
IA1
No ratings yet
IA1
3 pages
Question Bank DEV
No ratings yet
Question Bank DEV
16 pages
Xii - Cs Chapter - 11 To 16
No ratings yet
Xii - Cs Chapter - 11 To 16
2 pages
MY Question Bank
No ratings yet
MY Question Bank
3 pages
Ad3301 - Dev - 5 Units Question Bank
No ratings yet
Ad3301 - Dev - 5 Units Question Bank
16 pages
Question Bank
No ratings yet
Question Bank
18 pages
DS MCQ Semester Suggesstion
No ratings yet
DS MCQ Semester Suggesstion
26 pages
MCQ FDS
No ratings yet
MCQ FDS
5 pages
DS Final
No ratings yet
DS Final
46 pages
Python Pandas
No ratings yet
Python Pandas
15 pages
Dav End Sem
No ratings yet
Dav End Sem
2 pages
Ip QP 1
No ratings yet
Ip QP 1
11 pages
Q.1 Explain Process of Working With Data From Files in Data Science
No ratings yet
Q.1 Explain Process of Working With Data From Files in Data Science
20 pages
14 - QP Ip-01-1
No ratings yet
14 - QP Ip-01-1
8 pages
Class 12 KVS Material 2024-25 Part - II
No ratings yet
Class 12 KVS Material 2024-25 Part - II
61 pages
PCED Aufgaben en
No ratings yet
PCED Aufgaben en
40 pages
IP Pyq's
No ratings yet
IP Pyq's
109 pages
Where
No ratings yet
Where
22 pages
All in One Xii Ip QP Ms 2024
No ratings yet
All in One Xii Ip QP Ms 2024
172 pages
Computational Thinking Theory Answers
No ratings yet
Computational Thinking Theory Answers
2 pages
12pb24ip01 QP
No ratings yet
12pb24ip01 QP
12 pages
Data Science in Society Cat
No ratings yet
Data Science in Society Cat
5 pages
Question Bank - Python Cia 3
No ratings yet
Question Bank - Python Cia 3
3 pages
Common Python Data Science Interview Questions1
No ratings yet
Common Python Data Science Interview Questions1
5 pages
QP-1PB-IP-2024 Set 1
No ratings yet
QP-1PB-IP-2024 Set 1
9 pages
FDS Model
No ratings yet
FDS Model
4 pages
DBDM, FDS, Ds Model QP
No ratings yet
DBDM, FDS, Ds Model QP
5 pages
Asm 11152
No ratings yet
Asm 11152
70 pages
6205solved Ip CL Xii 2020
No ratings yet
6205solved Ip CL Xii 2020
11 pages
Unit 2 - Visualizing Using Matplotlib
No ratings yet
Unit 2 - Visualizing Using Matplotlib
32 pages
Unit 1 - Exploratory Data Analysis
No ratings yet
Unit 1 - Exploratory Data Analysis
60 pages
Eda Unit 5 Notes
No ratings yet
Eda Unit 5 Notes
13 pages
ccs346 Eda
No ratings yet
ccs346 Eda
2 pages
Database Management System CS3492 - REGULATION 2021 Downloaded From Stucor App
No ratings yet
Database Management System CS3492 - REGULATION 2021 Downloaded From Stucor App
13 pages
Dbms Unit 3 Notes
No ratings yet
Dbms Unit 3 Notes
29 pages
Dbms Unit V Notes
No ratings yet
Dbms Unit V Notes
27 pages
Productblad Loctite 5922 229862 NL
No ratings yet
Productblad Loctite 5922 229862 NL
1 page
Unit-3-Waves-Definitions and Formula Sheet
No ratings yet
Unit-3-Waves-Definitions and Formula Sheet
3 pages
Influence of Apparatus Geometry and Deposition Conditions On The Structure and Topography of Thick Sputtered Coatings
No ratings yet
Influence of Apparatus Geometry and Deposition Conditions On The Structure and Topography of Thick Sputtered Coatings
6 pages
Affidavit To Designate Guardian
No ratings yet
Affidavit To Designate Guardian
4 pages
Untitled
No ratings yet
Untitled
48 pages
Apollo Vs MRF: An Analysis of The Indian Tyre Industry
No ratings yet
Apollo Vs MRF: An Analysis of The Indian Tyre Industry
17 pages
Advance Python Programming
0% (1)
Advance Python Programming
184 pages
Seedfolks Reflective Journal Entry
No ratings yet
Seedfolks Reflective Journal Entry
1 page
0 07 A 0114 NOVA Transmitter Insert 22072013-BM-web
No ratings yet
0 07 A 0114 NOVA Transmitter Insert 22072013-BM-web
2 pages
Literary Criticism (LITT 501) October 13, 2018 Deautomatizing Perception
No ratings yet
Literary Criticism (LITT 501) October 13, 2018 Deautomatizing Perception
4 pages
Chap 17 Reading Worksheet
No ratings yet
Chap 17 Reading Worksheet
5 pages
History of Japanese Culture - Wiki - Wiki
No ratings yet
History of Japanese Culture - Wiki - Wiki
1 page
Folklore - An Encyclopedia of Beliefs, Customs, Tales, Music and Art (Gnv64)
100% (1)
Folklore - An Encyclopedia of Beliefs, Customs, Tales, Music and Art (Gnv64)
930 pages
Epson WF C5790 Product Brochure
No ratings yet
Epson WF C5790 Product Brochure
2 pages
Mini Series
100% (1)
Mini Series
66 pages
BIM Modeler Designer Portfolio 1744863806
No ratings yet
BIM Modeler Designer Portfolio 1744863806
42 pages
Present Simple
No ratings yet
Present Simple
4 pages
Unit 4 Grammar Summary
No ratings yet
Unit 4 Grammar Summary
14 pages
Arsha Adkar Business Worksheet
No ratings yet
Arsha Adkar Business Worksheet
4 pages
Documentation Report - Ammungan Festival 2019
No ratings yet
Documentation Report - Ammungan Festival 2019
12 pages
Saiva Siddhanta Church Act, No 22 of 1988
No ratings yet
Saiva Siddhanta Church Act, No 22 of 1988
2 pages
Partying in Prague
No ratings yet
Partying in Prague
4 pages
Unit - II Architectural Framework For IoT Systems
No ratings yet
Unit - II Architectural Framework For IoT Systems
13 pages
Current Electricity f1
No ratings yet
Current Electricity f1
4 pages
MCIRMARCH0B
No ratings yet
MCIRMARCH0B
4 pages
Comics and Novelization A Literary History of Bandes Dessines Benot Glaude PDF Download
No ratings yet
Comics and Novelization A Literary History of Bandes Dessines Benot Glaude PDF Download
76 pages
Micro Optical Tech Letters - 2007 - Luo - Multilayer Frequency Selective Surface With Grating Lobe Suppression
No ratings yet
Micro Optical Tech Letters - 2007 - Luo - Multilayer Frequency Selective Surface With Grating Lobe Suppression
3 pages
The Weakest Link, But Not Goodbye: India and Southeast Asia: A Plus' Up in Relations
No ratings yet
The Weakest Link, But Not Goodbye: India and Southeast Asia: A Plus' Up in Relations
14 pages
9685 2018 2019 AGU Int Students Req
No ratings yet
9685 2018 2019 AGU Int Students Req
22 pages

Question Bank (1&2)

Uploaded by

Question Bank (1&2)

Uploaded by

QUESTIONS BANK FOR EDA UNIT 1

1. What does EDA stand for in data science?

QUESTION BANK FOR UNIT 2

1. Define what Pandas is and explain its significance in data manipulation.

You might also like