ADS Exp3
ADS Exp3
Aim: To explore data visualization techniques using the Iris and Titanic datasets. This
includes identifying feature types, creating histograms and boxplots, comparing distributions,
and identifying outliers.
Theory:
Data visualization is a crucial step in data analysis as it helps in understanding patterns, trends, and
distributions. Some common types of visualizations include:
• Univariate Visualization: Examines one variable at a time (e.g., histograms, quartile
distributions).
• Multivariate Visualization: Displays relationships between multiple variables (e.g., scatter
plots, density charts).
• High-Dimensional Data Visualization: Projects multiple variables onto a two-dimensional
space using techniques like parallel coordinates.
Using visualization, we can:
1. Understand data distribution.
2. Identify outliers and anomalies.
3. Detect patterns and relationships between variables.
Step-wise Implementation:
CODE :
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns
Conclusion:
Data visualization plays a crucial role in understanding datasets. By analyzing the Iris and Titanic
datasets, we explored different visualization techniques such as histograms, boxplots, and outlier
detection methods. This experiment demonstrates how visual representations help in identifying
patterns and making informed decisions in data analysis.