5th Unit Fds
5th Unit Fds
Advantages:
o High density representation: Effective for large datasets
where fine detail is important.
o Efficient spatial representation: Excellent for geographic
data, heatmaps, or dense arrays of data.
3. Geometric Projection Visualization Techniques
Definition: These techniques project high-dimensional data (with many variables) onto
lower-dimensional spaces (usually 2D or 3D) to make it easier to interpret, while trying
to preserve the relationships within the data.
Common Techniques:
o Principal Component Analysis (PCA):
Definition: PCA is a technique used to reduce the
dimensionality of data by transforming it into a new set of
variables, the principal components, which explain the
most variance in the data.
Use Case: Often used for data reduction in machine
learning, pattern recognition, and large datasets.
Process: Identifies and ranks the axes (principal
components) that capture the most variation in the
dataset.
o t-SNE (t-Distributed Stochastic Neighbor Embedding):
Definition: t-SNE is a non-linear dimensionality reduction
technique that is used for the visualization of high-
dimensional data by mapping it into 2D or 3D.
Use Case: Often used for clustering and finding patterns in
complex data (e.g., neural network output).
Feature: t-SNE focuses on preserving local data structures
(similarity between points) in the lower-dimensional map.
o Multi-Dimensional Scaling (MDS):
Definition: A technique used to visualize the level of
similarity of individual cases of a dataset.
Use Case: For visualizing the distances or dissimilarities
between data points.
Example: Visualizing customer preferences where each
point represents a customer, and the distance between
points indicates similarity.
o 3D Scatter Plots:
Definition: A 3D scatter plot is a data visualization method
that represents data points in three dimensions, which can
reveal patterns, correlations, and clusters in multi-variable
data.
Use Case: Useful when examining the relationships
between three continuous variables (e.g., height, weight,
age).
Example: A scatter plot where each point represents an
individual's data on three different attributes (e.g., income,
age, and education).
Advantages:
o Data Reduction: PCA and t-SNE reduce the complexity of high-
dimensional data.
o Pattern Discovery: These methods help identify patterns,
clusters, and outliers in complex datasets.
Advantages:
o Intuitive: Icons make data more accessible and easier to
understand for a general audience.
o Engaging: Visual representation of data through icons makes
the data more engaging and appealing.
Definition: Interactive data visualization allows users to interact with the data by
zooming, filtering, or drilling down into specific data points. It often involves dynamic
dashboards and real-time data visualization.
Examples:
o Interactive Dashboards: Dashboards that allow users to filter,
sort, and visualize data dynamically.
o Data Exploration Tools: Tools like Tableau and Power BI that
allow users to explore data interactively.
Advantages:
o User Engagement: Enhances the experience by allowing users
to explore the data on their own.
o Flexibility: Provides the ability to dive deeper into specific areas
of interest.