0% found this document useful (0 votes)
3 views3 pages

Datavisualization Interview

The document provides an overview of data visualization techniques using Seaborn and Matplotlib, highlighting various types of plots such as scatterplots, line plots, bar plots, and boxplots. It also discusses the importance of standard error, standard deviation, hypothesis testing, and the distinction between business analytics and business intelligence. Additionally, it covers concepts like probability density functions and the role of data warehouses in data processing and visualization.

Uploaded by

naincy435asati
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as TXT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
3 views3 pages

Datavisualization Interview

The document provides an overview of data visualization techniques using Seaborn and Matplotlib, highlighting various types of plots such as scatterplots, line plots, bar plots, and boxplots. It also discusses the importance of standard error, standard deviation, hypothesis testing, and the distinction between business analytics and business intelligence. Additionally, it covers concepts like probability density functions and the role of data warehouses in data processing and visualization.

Uploaded by

naincy435asati
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as TXT, PDF, TXT or read online on Scribd
You are on page 1/ 3

data visualization is graphical representation of data using visual element like

chart,graph and plot.

if you're new to making plots or need simple, good-looking ones quickly, Seaborn is
a great option.
But if you need more control over your plots or want to create more complicated
ones,
Matplotlib is better for that.

Relational Plots in Seaborn

Scatterplot-are used to findout relationship between two quantitive variables.

A line plot is a relational data visualization showing how one continuous variable
changes when another does. It's one of the most common graphs widely used
in finance, sales, marketing, healthcare, natural sciences, and more.

A relplot is a function in Seaborn that makes it easier to create scatter plots and
line plots.
It can also automatically organize your data into multiple smaller plots (called
"facets") to compare different parts of your data.
This tool is helpful because it saves you time by managing both the type of plot
and
the layout of the plots without needing to write a lot of extra code.

Categorical Plots in Seaborn

A bar plot is useful for comparing categories. It displays the relationship between
a categorical variable and a continuous variable.

Count plots are similar to bar plots but they display the number of occurrences of
each category in a dataset, making them useful for categorical data distribution
analysis.

A boxplot, also known as a box plot, box plots, or box-and-whisker plot, is a


standardized way of displaying the distribution of a data set based on its five-
number summary of data points: the “minimum,”
first quartile [Q1], median, third quartile [Q3] and “maximum.”

A violin plot (violinplot) combines a box plot with a kernel density plot, showing
the distribution, probability density, and central tendencies of the data.
It provides a detailed view of the data's distribution

strip plot

It's like a scatter plot, but for categories instead of continuous data. The
points are usually spread out along a line to show how they are distributed. Often,
it is used together with other graphs like box plots or violin plots to give a
clearer picture of the data.
It helps to see where most of the data points are grouped and also helps spot any
unusual data points (outliers).

A swarm plot displays individual data points for one or more categorical variables,
similar to a strip plot, but adjusts points to avoid overlap.
It provides a clear view of the distribution and density of the data.

Distribution Plots in Seaborn

A histogram (histplot) displays the distribution of a continuous variable by


dividing data into bins and plotting the frequency of data points in each bin. It
provides insights into the data's central tendency, dispersion, and shape.

A kernel density estimate (KDE) plot is a way to show how the data is spread out.

A distribution plot (displot) is a versatile seaborn function that allows for


visualization of univariate distributions. It can combine histograms, kernel
density estimates, and rug plots to provide insights into the distribution of a
single variable.

Heatmaps visualize data in a matrix format, where values are represented by color.
This is ideal for displaying correlations or other tabular data.

A pairs plot allows us to see both distribution of single variables and


relationships between two variables.

The standard error of the mean, or simply standard error,


indicates how different the population mean is likely to be from a sample mean.
The standard error is a common measure of sampling error—the difference between
a population parameter and a sample statistic.
By calculating standard error, you can estimate how representative your sample is
of your population and make valid conclusions.

The standard deviation describes variability within a single sample.


The standard error estimates the variability across multiple samples of a
population.
The standard deviation is a descriptive statistic that can be calculated from
sample data.
In contrast, the standard error is an inferential statistic that can only be
estimated (unless the real population parameter is known).

Hypothesis testing is used to verify whether the results of an experiment are valid
or not by using the null and alternate hypotheses.

probiltity density function is use to calculate probility of specific outcome of


random varible.
probility dencity is a relation ship between outcome of random variable and its
probilties.

F-test
Business analytics refers to the statistical methods and computing technologies for
processing,
mining and visualizing data to uncover patterns, relationships and insights that
enable better business decision-making.

business intelligence is primarily concerned with building reports and dashboards


to tell a story or describe the current or past state of the business.

datawhare house is central

You might also like