0% found this document useful (0 votes)
177 views8 pages

Data Exploration and Visualization - AD3301 - Important Questions With Answer - Unit 3 - Univariate Analysis

DE&V QB

Uploaded by

nathanarokia9
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
177 views8 pages

Data Exploration and Visualization - AD3301 - Important Questions With Answer - Unit 3 - Univariate Analysis

DE&V QB

Uploaded by

nathanarokia9
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 8

Click on Subject/Paper under Semester to enter.

Professional English Discrete Mathematics Environmental Sciences


Professional English - - II - HS3252 - MA3354 and Sustainability -
I - HS3152 GE3451
Digital Principles and
Statistics and Probability and
Computer Organization
Matrices and Calculus Numerical Methods - Statistics - MA3391
- CS3351
- MA3151 MA3251
3rd Semester
1st Semester

4th Semester
2nd Semester

Database Design and Operating Systems -


Engineering Physics - Engineering Graphics
Management - AD3391 AL3452
PH3151 - GE3251

Physics for Design and Analysis of Machine Learning -


Engineering Chemistry Information Science Algorithms - AD3351 AL3451
- CY3151 - PH3256
Data Exploration and Fundamentals of Data
Basic Electrical and
Visualization - AD3301 Science and Analytics
Problem Solving and Electronics Engineering -
BE3251 - AD3491
Python Programming -
GE3151 Artificial Intelligence
Data Structures Computer Networks
- AL3391
Design - AD3251 - CS3591

Deep Learning -
AD3501

Embedded Systems
Data and Information Human Values and
and IoT - CS3691
5th Semester

Security - CW3551 Ethics - GE3791


6th Semester

7th Semester

8th Semester

Open Elective-1
Distributed Computing Open Elective 2
- CS3551 Project Work /
Elective-3
Open Elective 3 Intership
Big Data Analytics - Elective-4
CCS334 Open Elective 4
Elective-5
Elective 1 Management Elective
Elective-6
Elective 2
All Computer Engg Subjects - [ B.E., M.E., ] (Click on Subjects to enter)
Programming in C Computer Networks Operating Systems
Programming and Data Programming and Data Problem Solving and Python
Structures I Structure II Programming
Database Management Systems Computer Architecture Analog and Digital
Communication
Design and Analysis of Microprocessors and Object Oriented Analysis
Algorithms Microcontrollers and Design
Software Engineering Discrete Mathematics Internet Programming
Theory of Computation Computer Graphics Distributed Systems
Mobile Computing Compiler Design Digital Signal Processing
Artificial Intelligence Software Testing Grid and Cloud Computing
Data Ware Housing and Data Cryptography and Resource Management
Mining Network Security Techniques
Service Oriented Architecture Embedded and Real Time Multi - Core Architectures
Systems and Programming
Probability and Queueing Theory Physics for Information Transforms and Partial
Science Differential Equations
Technical English Engineering Physics Engineering Chemistry
Engineering Graphics Total Quality Professional Ethics in
Management Engineering
Basic Electrical and Electronics Problem Solving and Environmental Science and
and Measurement Engineering Python Programming Engineering
www.BrainKart.com Page 1 of 4

4931_Grace College of Engineering, Thoothukudi

DEPARTMENT OF ARTIFICIAL INTELLIGENCE AND DATA SCIENCE

B.Tech. – Artificial Intelligence and Data Science

Anna University Regulation: 2021

AD3301 – Data Exploration and Visualization

II Year / III Semester

QUESTION BANK

Unit III - Univariate Analysis

Prepared By,

Dr. I. Felcia Jerlin, ASP/CSE

AD3301_DEV

https://play.google.com/store/apps/details?id=info.therithal.brainkart.annauniversitynotes&hl=en_IN
www.BrainKart.com Page 2 of 4

4931_Grace College of Engineering, Thoothukudi

QUESTION BANK
AD3301 – DATA EXPLORATION AND VISUALIZATION

UNIT III UNIVARIATE ANALYSIS


Introduction to Single variable: Distributions and Variables - Numerical Summaries of Level
and Spread - Scaling and Standardizing – Inequality - Smoothing Time Series.

PART – A
1. Define Single Variable?
A single variable represents an individual characteristic, attribute, or measurement that can
take different values in a dataset.
2. Differentiate between Level and Spread in Data?
Level refers to the central tendency or average of a dataset, while spread represents the
variability or dispersion of the data.
3. What are Numerical Summaries of Level and Spread?
Numerical summaries of level include measures like mean and median, while those of spread
involve measures like range, variance, and standard deviation.
4. Define Scaling and Standardizing?
Scaling adjusts the numerical range of a variable, while standardizing transforms it to have a
mean of 0 and a standard deviation of 1.
5. Differentiate between Scaling and Standardizing?
Scaling changes the range of values, while standardizing rescales to have a mean of 0 and a
standard deviation of 1.
6. Explain the Concept of Inequality in Data?
Inequality in data refers to the uneven distribution of values, where some values occur more
frequently than others.
7. Define Time Series Smoothing?
Time series smoothing involves removing noise or short-term fluctuations from a dataset to
highlight underlying trends.
8. Differentiate between Mean and Median?
The mean is the average value, while the median is the middle value in a sorted dataset.
9. What is the Purpose of Scaling Variables?
Scaling variables is done to bring them to a comparable numerical range, avoiding dominance
by variables with larger magnitudes.

AD3301_DEV

https://play.google.com/store/apps/details?id=info.therithal.brainkart.annauniversitynotes&hl=en_IN
www.BrainKart.com Page 3 of 4

4931_Grace College of Engineering, Thoothukudi

10. Define Interquartile Range (IQR)?


The Interquartile Range (IQR) is the range between the first and third quartiles, representing
the central 50% of the data.
11. Differentiate between Range and Standard Deviation?
Range is the difference between the maximum and minimum values, while standard deviation
measures the average deviation of values from the mean.
12. Explain the Role of Boxplots in Data Visualization?
Boxplots visually represent the distribution of data, showing median, quartiles, and potential
outliers.
13. Define Skewness in a Distribution?
Skewness measures the asymmetry of a distribution; positive skewness indicates a tail to the
right, and negative skewness indicates a tail to the left.
14. What is Z-Score in Standardization?
Z-Score represents the number of standard deviations a data point is from the mean in a
standardized distribution.
15. Define the Coefficient of Variation (CV)?
The Coefficient of Variation (CV) is the ratio of the standard deviation to the mean,
providing a relative measure of variability.
16. Differentiate between Time Series and Cross-Sectional Data?
Time series data is collected over time, while cross-sectional data is collected at a single point
in time.
17. Explain the Concept of Kernel Density Estimation?
Kernel Density Estimation is a non-parametric method to estimate the probability density
function of a random variable, providing a smooth curve.
18. Define Outliers in a Dataset?
Outliers are data points that deviate significantly from the rest of the dataset and may distort
statistical analyses.
19. Explain the Importance of Scaling in Machine Learning?
Scaling is important in machine learning to ensure that all features contribute equally to
model training, preventing bias towards variables with larger scales.
20. What is the Purpose of Time Series Smoothing Techniques?
Time series smoothing techniques aim to reveal underlying trends by reducing noise and
short-term fluctuations in data.

AD3301_DEV

https://play.google.com/store/apps/details?id=info.therithal.brainkart.annauniversitynotes&hl=en_IN
www.BrainKart.com Page 4 of 4

4931_Grace College of Engineering, Thoothukudi

PART – B
1. Explain the Significance of Numerical Summaries in Single Variable Analysis?
2. Illustrate the Process of Scaling and Standardizing Variables?
3. Describe the Role of Interquartile Range (IQR) in Data Analysis?
4. Explain the Concept of Skewness and Its Impact on Distributions?
5. Illustrate the Application of Kernel Density Estimation in Data Visualization?
6. Describe the Importance of Outlier Detection in Data Analysis?
7. Explain the Purpose of Time Series Smoothing Techniques?
8. Provide a step-by-step illustration of calculating the Coefficient of Variation (CV)?
9. Explain how it offers a relative measure of variability and its interpretation in different
scenarios?

AD3301_DEV

https://play.google.com/store/apps/details?id=info.therithal.brainkart.annauniversitynotes&hl=en_IN
Click on Subject/Paper under Semester to enter.
Professional English Discrete Mathematics Environmental Sciences
Professional English - - II - HS3252 - MA3354 and Sustainability -
I - HS3152 GE3451
Digital Principles and
Statistics and Probability and
Computer Organization
Matrices and Calculus Numerical Methods - Statistics - MA3391
- CS3351
- MA3151 MA3251
3rd Semester
1st Semester

4th Semester
2nd Semester

Database Design and Operating Systems -


Engineering Physics - Engineering Graphics
Management - AD3391 AL3452
PH3151 - GE3251

Physics for Design and Analysis of Machine Learning -


Engineering Chemistry Information Science Algorithms - AD3351 AL3451
- CY3151 - PH3256
Data Exploration and Fundamentals of Data
Basic Electrical and
Visualization - AD3301 Science and Analytics
Problem Solving and Electronics Engineering -
BE3251 - AD3491
Python Programming -
GE3151 Artificial Intelligence
Data Structures Computer Networks
- AL3391
Design - AD3251 - CS3591

Deep Learning -
AD3501

Embedded Systems
Data and Information Human Values and
and IoT - CS3691
5th Semester

Security - CW3551 Ethics - GE3791


6th Semester

7th Semester

8th Semester

Open Elective-1
Distributed Computing Open Elective 2
- CS3551 Project Work /
Elective-3
Open Elective 3 Intership
Big Data Analytics - Elective-4
CCS334 Open Elective 4
Elective-5
Elective 1 Management Elective
Elective-6
Elective 2
All Computer Engg Subjects - [ B.E., M.E., ] (Click on Subjects to enter)
Programming in C Computer Networks Operating Systems
Programming and Data Programming and Data Problem Solving and Python
Structures I Structure II Programming
Database Management Systems Computer Architecture Analog and Digital
Communication
Design and Analysis of Microprocessors and Object Oriented Analysis
Algorithms Microcontrollers and Design
Software Engineering Discrete Mathematics Internet Programming
Theory of Computation Computer Graphics Distributed Systems
Mobile Computing Compiler Design Digital Signal Processing
Artificial Intelligence Software Testing Grid and Cloud Computing
Data Ware Housing and Data Cryptography and Resource Management
Mining Network Security Techniques
Service Oriented Architecture Embedded and Real Time Multi - Core Architectures
Systems and Programming
Probability and Queueing Theory Physics for Information Transforms and Partial
Science Differential Equations
Technical English Engineering Physics Engineering Chemistry
Engineering Graphics Total Quality Professional Ethics in
Management Engineering
Basic Electrical and Electronics Problem Solving and Environmental Science and
and Measurement Engineering Python Programming Engineering

You might also like