0% found this document useful (0 votes)
777 views8 pages

Data Exploration and Visualization - AD3301 - Important Questions With Answer - Unit 1 - Exploratory Data Analysis

Uploaded by

jeyastephen07
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
777 views8 pages

Data Exploration and Visualization - AD3301 - Important Questions With Answer - Unit 1 - Exploratory Data Analysis

Uploaded by

jeyastephen07
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 8

Click on Subject/Paper under Semester to enter.

Professional English Discrete Mathematics Environmental Sciences


Professional English - - II - HS3252 - MA3354 and Sustainability -
I - HS3152 GE3451
Digital Principles and
Statistics and Probability and
Computer Organization
Matrices and Calculus Numerical Methods - Statistics - MA3391
- CS3351
- MA3151 MA3251
3rd Semester
1st Semester

4th Semester
2nd Semester

Database Design and Operating Systems -


Engineering Physics - Engineering Graphics
Management - AD3391 AL3452
PH3151 - GE3251

Physics for Design and Analysis of Machine Learning -


Engineering Chemistry Information Science Algorithms - AD3351 AL3451
- CY3151 - PH3256
Data Exploration and Fundamentals of Data
Basic Electrical and
Visualization - AD3301 Science and Analytics
Problem Solving and Electronics Engineering -
BE3251 - AD3491
Python Programming -
GE3151 Artificial Intelligence
Data Structures Computer Networks
- AL3391
Design - AD3251 - CS3591

Deep Learning -
AD3501

Embedded Systems
Data and Information Human Values and
and IoT - CS3691
5th Semester

Security - CW3551 Ethics - GE3791


6th Semester

7th Semester

8th Semester

Open Elective-1
Distributed Computing Open Elective 2
- CS3551 Project Work /
Elective-3
Open Elective 3 Intership
Big Data Analytics - Elective-4
CCS334 Open Elective 4
Elective-5
Elective 1 Management Elective
Elective-6
Elective 2
All Computer Engg Subjects - [ B.E., M.E., ] (Click on Subjects to enter)
Programming in C Computer Networks Operating Systems
Programming and Data Programming and Data Problem Solving and Python
Structures I Structure II Programming
Database Management Systems Computer Architecture Analog and Digital
Communication
Design and Analysis of Microprocessors and Object Oriented Analysis
Algorithms Microcontrollers and Design
Software Engineering Discrete Mathematics Internet Programming
Theory of Computation Computer Graphics Distributed Systems
Mobile Computing Compiler Design Digital Signal Processing
Artificial Intelligence Software Testing Grid and Cloud Computing
Data Ware Housing and Data Cryptography and Resource Management
Mining Network Security Techniques
Service Oriented Architecture Embedded and Real Time Multi - Core Architectures
Systems and Programming
Probability and Queueing Theory Physics for Information Transforms and Partial
Science Differential Equations
Technical English Engineering Physics Engineering Chemistry
Engineering Graphics Total Quality Professional Ethics in
Management Engineering
Basic Electrical and Electronics Problem Solving and Environmental Science and
and Measurement Engineering Python Programming Engineering
www.BrainKart.com
4931_Grace College of Engineering, Thoothukudi

DEPARTMENT OF ARTIFICIAL INTELLIGENCE AND DATA SCIENCE

B.Tech. – Artificial Intelligence and Data Science

Anna University Regulation: 2021

AD3301 – Data Exploration and Visualization

II Year / III Semester

QUESTION BANK

Unit I - Exploratory Data Analysis

Prepared By,

Dr. I. Felcia Jerlin, ASP/CSE

AD3301_DEV

https://play.google.com/store/apps/details?id=info.therithal.brainkart.annauniversitynotes&hl=en_IN
www.BrainKart.com
4931_Grace College of Engineering, Thoothukudi

QUESTION BANK
AD3301 – DATA EXPLORATION AND VISUALIZATION

UNIT I EXPLORATORY DATA ANALYSIS


EDA fundamentals – Understanding data science – Significance of EDA – Making sense of
data – Comparing EDA with classical and Bayesian analysis – Software tools for EDA -
Visual Aids for EDA- Data transformation techniques-merging database, reshaping and
pivoting, Transformation techniques - Grouping Datasets - data aggregation – Pivot tables
and cross-tabulations.

PART – A

1. Define Exploratory Data Analysis (EDA)?

EDA is the process of examining and visualizing data to uncover patterns, trends, and
insights before more advanced analyses.
2. What is the significance of EDA in data science?
EDA is crucial in data science as it helps identify patterns, outliers, and data quality issues,
providing a foundation for further analysis.
3. Differentiate EDA from classical statistical analysis?
EDA focuses on visual exploration, while classical statistical analysis involves hypothesis
testing and parameter estimation.
4. Why is making sense of data important in EDA?
Making sense of data involves extracting meaningful information, enabling informed
decisions and insights.
5. Compare EDA with Bayesian analysis?
EDA is non-parametric and exploratory, while Bayesian analysis incorporates prior
knowledge and updates probabilities based on new data.
6. Name two software tools commonly used for EDA?
Pandas and Matplotlib are commonly used tools for EDA in Python.
7. Define data transformation techniques in EDA?
Data transformation techniques include normalization, scaling, and handling missing values
to prepare data for analysis.

AD3301_DEV

https://play.google.com/store/apps/details?id=info.therithal.brainkart.annauniversitynotes&hl=en_IN
www.BrainKart.com
4931_Grace College of Engineering, Thoothukudi

8. What is the purpose of merging databases in EDA?


Merging databases combines datasets based on common identifiers to create a unified dataset
for analysis.
9. Differentiate between reshaping and pivoting in EDA?
Reshaping transforms data between wide and long formats, while pivoting reorganizes data to
create a new structure.
10. Define data aggregation in EDA?
Data aggregation involves summarizing grouped data using functions like sum, mean, or
count.
11. How do pivot tables aid in EDA?
Pivot tables facilitate multidimensional analysis and summarization of data in a tabular
format.
12. What visual aids are commonly used in EDA?
Histograms, box plots, scatter plots, and heatmaps are common visual aids in EDA for
understanding data distributions and relationships.
13. Define the concept of grouping datasets in EDA?
Grouping datasets involves creating subsets based on certain criteria, enabling focused
analysis on specific segments.
14. Why is cross-tabulation useful in EDA?
Cross-tabulation is useful in EDA for displaying the frequency distribution of variables in a
contingency table.
15. Name a transformation technique in EDA for handling outliers?
Winsorizing is a transformation technique that involves replacing extreme values with less
extreme values to handle outliers.
16. Define the term "data normalization" in EDA?
Data normalization in EDA is the process of rescaling variables to a standard range, typically
between 0 and 1.
17. What is the role of visual aids like violin plots in EDA?
Violin plots display the distribution of data, providing insights into both central tendency and
spread.
18. Define the concept of data scaling in EDA?
Data scaling in EDA involves transforming variables to have a similar scale, preventing
dominance by certain features.

AD3301_DEV

https://play.google.com/store/apps/details?id=info.therithal.brainkart.annauniversitynotes&hl=en_IN
www.BrainKart.com
4931_Grace College of Engineering, Thoothukudi

19. How does EDA contribute to data science projects?


EDA contributes by providing an initial understanding of data, guiding subsequent modeling
and analysis decisions.
20. Why are pivot tables and cross-tabulations useful in summarizing data?
Pivot tables and cross-tabulations provide a concise summary of data, making it easier to
identify patterns and trends across different dimensions.

PART – B

1. Explain the Purpose of EDA


2. Differentiate EDA from Classical Analysis
3. Illustrate Visual Aids in EDA
4. Describe Data Transformation in EDA
5. Explore the Significance of Grouping Datasets and how it aids in focused analysis.
6. Explain the Role of Data Aggregation
7. Illustrate the Application of Pivot Tables
8. Compare EDA with Bayesian Analysis:

AD3301_DEV

https://play.google.com/store/apps/details?id=info.therithal.brainkart.annauniversitynotes&hl=en_IN
Click on Subject/Paper under Semester to enter.
Professional English Discrete Mathematics Environmental Sciences
Professional English - - II - HS3252 - MA3354 and Sustainability -
I - HS3152 GE3451
Digital Principles and
Statistics and Probability and
Computer Organization
Matrices and Calculus Numerical Methods - Statistics - MA3391
- CS3351
- MA3151 MA3251
3rd Semester
1st Semester

4th Semester
2nd Semester

Database Design and Operating Systems -


Engineering Physics - Engineering Graphics
Management - AD3391 AL3452
PH3151 - GE3251

Physics for Design and Analysis of Machine Learning -


Engineering Chemistry Information Science Algorithms - AD3351 AL3451
- CY3151 - PH3256
Data Exploration and Fundamentals of Data
Basic Electrical and
Visualization - AD3301 Science and Analytics
Problem Solving and Electronics Engineering -
BE3251 - AD3491
Python Programming -
GE3151 Artificial Intelligence
Data Structures Computer Networks
- AL3391
Design - AD3251 - CS3591

Deep Learning -
AD3501

Embedded Systems
Data and Information Human Values and
and IoT - CS3691
5th Semester

Security - CW3551 Ethics - GE3791


6th Semester

7th Semester

8th Semester

Open Elective-1
Distributed Computing Open Elective 2
- CS3551 Project Work /
Elective-3
Open Elective 3 Intership
Big Data Analytics - Elective-4
CCS334 Open Elective 4
Elective-5
Elective 1 Management Elective
Elective-6
Elective 2
All Computer Engg Subjects - [ B.E., M.E., ] (Click on Subjects to enter)
Programming in C Computer Networks Operating Systems
Programming and Data Programming and Data Problem Solving and Python
Structures I Structure II Programming
Database Management Systems Computer Architecture Analog and Digital
Communication
Design and Analysis of Microprocessors and Object Oriented Analysis
Algorithms Microcontrollers and Design
Software Engineering Discrete Mathematics Internet Programming
Theory of Computation Computer Graphics Distributed Systems
Mobile Computing Compiler Design Digital Signal Processing
Artificial Intelligence Software Testing Grid and Cloud Computing
Data Ware Housing and Data Cryptography and Resource Management
Mining Network Security Techniques
Service Oriented Architecture Embedded and Real Time Multi - Core Architectures
Systems and Programming
Probability and Queueing Theory Physics for Information Transforms and Partial
Science Differential Equations
Technical English Engineering Physics Engineering Chemistry
Engineering Graphics Total Quality Professional Ethics in
Management Engineering
Basic Electrical and Electronics Problem Solving and Environmental Science and
and Measurement Engineering Python Programming Engineering

You might also like