EDS Unit 5 ?
EDS Unit 5 ?
Data Visualization
Data Visualization is a technique used to understand and analyze the data
using graphical representations. It provides qualitative overview of large
datasets.
The color of the pixel reflects the value of the data point. The smaller the
value, the lighter the pixel’s color.
Example
Stick Figures
It maps data into 5 piece stick figure. A stick figure consists of 4 limbs and
one body
3. Tree Map: It is used to show hierarchical data with nested rectangles that
vary in size and color.
It is divided into slices, which represent the quantity of the category and the
data in it
Here the length of the bar corresponds to the measure of the data
Categories are placed on the vertical axis, and data points on the
horizontal axis.
Here each bar is subdivided into sub bars, stacked end to end
It shows the minimum, first quartile (Q1), median, third quartile (Q3), and
maximum of the data.
Histogram
It is a graphical representation of distribution of quantitative data
Unlike bar graphs, histogram doesn’t have fixed width, and interval spacing
Line Graphs
It is a graphical representation of two or more variables in the form of lines
or curves
Single line represents the relationship between two variables over time
There is equal space between each pair of There is no space between the
consecutive bars. consecutive bars.
The height of the bars shows the The area of rectangular bars shows the
frequency, and the width of the bars are frequency of the data and the width of the
same. bars need not to be same.
Data can be arranged in any order. Data is arranged in the order of range.
Equation:
Y = β0 + β1 X + ϵ
Where:
Equation:
Y = β0 + β1 X1 + β2 X2 + ⋯ + βn Xn + ϵ
Useful when there are multiple factors that contribute to the predicted
value.
Example: Predicting a person’s weight (Y) based on height (X1), age (X2), and
gender (X3).
Key Differences:
Linear Regression: Involves one independent variable (X) and one
dependent variable (Y).