0% found this document useful (0 votes)
16 views5 pages

Unit-1&2 Viva Ques

The document discusses various data visualization techniques including histograms, pie charts, bar charts and scatter plots. It also covers identifying outliers with box plots and topics like data handling, transformation and sanity checks. Furthermore, it discusses the Python library Seaborn for statistical data visualization and creating visualizations like regplots, pair plots and heatmaps. Additionally, it mentions another library called Plotly for interactive online graphs.

Uploaded by

snippet one
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
16 views5 pages

Unit-1&2 Viva Ques

The document discusses various data visualization techniques including histograms, pie charts, bar charts and scatter plots. It also covers identifying outliers with box plots and topics like data handling, transformation and sanity checks. Furthermore, it discusses the Python library Seaborn for statistical data visualization and creating visualizations like regplots, pair plots and heatmaps. Additionally, it mentions another library called Plotly for interactive online graphs.

Uploaded by

snippet one
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 5

UNIT-1

Necessity of Data Visualization:


Q: Why is data visualization important in data analysis?
A: Data visualization helps in understanding complex data by
presenting it in a visual format, making it easier to identify
patterns, trends, and relationships.

Data Handling and Cleaning – I:


Q: How do you handle missing values in a dataset?
A: Missing values can be handled by either removing the rows
or columns containing missing values, or by imputing them
using techniques like mean, median, or mode imputation.

Data Handling and Cleaning – II:


Q: What is the purpose of data transformation?
A: Data transformation is used to convert data from one form
to another, such as converting categorical data to numerical
data or normalizing data to a common scale.

Sanity Checks:
Q: What are sanity checks in data analysis?
A: Sanity checks are used to verify the integrity and accuracy
of data by comparing it against known values or expectations.
Outliers Analysis with Box Plots:
Q: How do you identify outliers using box plots?
A: Outliers can be identified using box plots by looking for
data points that fall outside the whiskers of the box plot,
which are typically defined as 1.5 times the interquartile
range.

Histograms:
Q: What is a histogram?
A: A histogram is a graphical representation of the distribution
of numerical data, where the data is divided into bins and the
frequency of each bin is plotted.

Pie Charts:
Q: When is it appropriate to use a pie chart?
A: Pie charts are appropriate for showing the composition of a
whole, such as the distribution of different categories within a
dataset.

Bar Charts:
Q: How do you interpret a bar chart?
A: A bar chart is used to compare the values of different
categories, where the height of each bar represents the value
of the category.
Scatter Plots:
Q: What is a scatter plot?
A: A scatter plot is a graphical representation of the
relationship between two numerical variables, where each
data point is represented by a dot on the plot.

Data Visualization Tools:


Q: What are some popular data visualization tools?
A: Some popular data visualization tools include matplotlib,
seaborn, ggplot2, and Tableau.

UNIT-2
Introduction to Seaborn:
Q: What is Seaborn?
A: Seaborn is a Python data visualization library based on
matplotlib that provides a high-level interface for drawing
attractive and informative statistical graphics.

Styling Options:
Q: How can you change the style of plots in Seaborn?
A: You can change the style of plots in Seaborn using the
sns.set_style() function, which allows you to choose from
different predefined styles such as "darkgrid", "whitegrid",
"dark", "white", and "ticks".
Regplots:
Q: What is a regplot in Seaborn?
A: A regplot is a scatter plot with a linear regression line fit to
the data, which allows you to visualize the relationship
between two numerical variables and assess the strength of
the relationship.

Pair Plots:
Q: What is a pair plot in Seaborn?
A: A pair plot is a grid of scatter plots that shows the
relationship between pairs of variables in a dataset, making it
easy to identify patterns and correlations.

Heatmaps:
Q: What is a heatmap in Seaborn?
A: A heatmap is a graphical representation of data where the
values of a matrix are represented as colors, making it easy to
visualize patterns and relationships in the data.

Line Charts:
Q: How can you create a line chart in Seaborn?
A: You can create a line chart in Seaborn using the
sns.lineplot() function, which allows you to plot the
relationship between two numerical variables over time or
another continuous variable.
Stacked Bar Charts:
Q: How can you create a stacked bar chart in Seaborn?
A: You can create a stacked bar chart in Seaborn using the
sns.barplot() function with the hue parameter set to a
categorical variable, which will stack the bars for each
category within the variable.

Case Study Summary:


Q: Can you provide a summary of a case study where Seaborn
was used for data visualization?
A: A case study summary could include details about the
dataset used, the types of plots created, the insights gained
from the visualizations, and any challenges encountered
during the analysis.

Plotly:
Q: What is Plotly?
A: Plotly is a Python graphing library that makes interactive,
publication-quality graphs online. It provides a wide range of
chart types, including line charts, scatter plots, bar charts, and
heatmaps, and allows for customization and interactivity.

You might also like