Datavisualization Interview
Datavisualization Interview
if you're new to making plots or need simple, good-looking ones quickly, Seaborn is
a great option.
But if you need more control over your plots or want to create more complicated
ones,
Matplotlib is better for that.
A line plot is a relational data visualization showing how one continuous variable
changes when another does. It's one of the most common graphs widely used
in finance, sales, marketing, healthcare, natural sciences, and more.
A relplot is a function in Seaborn that makes it easier to create scatter plots and
line plots.
It can also automatically organize your data into multiple smaller plots (called
"facets") to compare different parts of your data.
This tool is helpful because it saves you time by managing both the type of plot
and
the layout of the plots without needing to write a lot of extra code.
A bar plot is useful for comparing categories. It displays the relationship between
a categorical variable and a continuous variable.
Count plots are similar to bar plots but they display the number of occurrences of
each category in a dataset, making them useful for categorical data distribution
analysis.
A violin plot (violinplot) combines a box plot with a kernel density plot, showing
the distribution, probability density, and central tendencies of the data.
It provides a detailed view of the data's distribution
strip plot
It's like a scatter plot, but for categories instead of continuous data. The
points are usually spread out along a line to show how they are distributed. Often,
it is used together with other graphs like box plots or violin plots to give a
clearer picture of the data.
It helps to see where most of the data points are grouped and also helps spot any
unusual data points (outliers).
A swarm plot displays individual data points for one or more categorical variables,
similar to a strip plot, but adjusts points to avoid overlap.
It provides a clear view of the distribution and density of the data.
A kernel density estimate (KDE) plot is a way to show how the data is spread out.
Heatmaps visualize data in a matrix format, where values are represented by color.
This is ideal for displaying correlations or other tabular data.
Hypothesis testing is used to verify whether the results of an experiment are valid
or not by using the null and alternate hypotheses.
F-test
Business analytics refers to the statistical methods and computing technologies for
processing,
mining and visualizing data to uncover patterns, relationships and insights that
enable better business decision-making.