SMDS Unit 1

Uploaded by

charancharan73202

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views36 pages

SMDS Unit 1

Uploaded by

charancharan73202

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 36

UNIT-1

Data Visualization and Distributions

Syllabus:
Data Visualization Techniques:
Introduction to Statistical Method's
Exploratory Data Analysis:
Charts:( Line, Pie, Bar)
Plots:( Bubble ,Scatter)
Maps:
Heat
Dot Distribution
Diagrams:
Trees
Matrices
principal Components Analysis
Intro to Data distributions
Probability Distributions
Discrete:
[Binomial, poisson]
Continuous:
[Normal, exponential]
Introduction Statistical Methods:
Statistical methods are set of techniques used to Collect,
analyze, interpret & present numerical data.
They help us draw meaningful Conclusion's from data,
identify pattern’s & trends.
Make informed decisions in various fields like:
 Business
 Science
 social research.

Key Concepts:
1.Data:
The raw information Collected fool analysis.
This can be anything from numerical measurements to
Categorical observations.
2.variables:
The characteristics being measured (or) Studied. Ex: age,
income, gender (or) test scores.
3.Descriptive statistics:
summarize & describe the main features of a data set.
This includes measures like:
* Mean :The average of a dataset
* Median: The middle value when data is arranged in order.
* Mode: The most frequent value
* Standard deviation: A Measure of how spread out the data
(4) Inferential statistics:
Make predictions (or) draw Conclusions
about a larger population based on sample.
This involves techniques like:
 Hypothesis Testing: Formulating & testing hypotheses
about a population parameter.
 Confidence intervals: Estimating a range of values that
likely
 Contains the true population parameter.
 Regression Analysis: Examining the relationship b/w
2(or)more variables.
Data:
"Data" is a Collection of facts such as numbers, words,
measurements& observations.
DATA

Qualitative Quantitative
Quantitative
(Categorical Data) Quantitative
(Numerical Data)
Ex :gender, Color

Discrete Continuous
(Counted) (Measured)
Data visualization techniques:
 Data visualization is the graphical representation of
 information & data.
 By using visual elements like charts, graphs & maps.
 Data visualization tools provide an accessible way to See
& understand trends, outliers & patterns in data.
 Some of the most common & effective Data visualization
techniques:

1.Bar chart:
use: Comparing Categories (or) groups.
Ex: sales figures for different products.

Bar chart
2.Line chart:
use: showing trends over time.
Ex: stock prices over a year.

Line chart
3.Pie chart:
Use: Showing the proportion of parts to a whole
EX: Market share of different companies.

Pie chart
4.Scatter plots:
Use: Showing the relationship b/w 2 variables.
EX: Correlation b/w Height & Weight.

Scatter plot
5.Bubble plots:
A bubble plot is a type of charts that visually represents
data with 3 variables.

Bubble plot
Ex: Imagine a bubble plot analyzing the performance of
different car models
Key components of Bubble plot:
1.X-Axis: Represents one variable.
2.Y-Axis: Represents another variable.
3.Bubble size: 3rd variable.

Heat Maps:
Use: Visualization data across 2 dimensions using color.
EX: Showing busy times in a restaurant.
Dot Distribution:
Definition:
A dot distribution is a type of thematic map that uses dots
to represent the presence, quantity,(or) value of a
phenomenon in a specific area.
Each dot represents a specific number of occurrences or
instances of the mapped phenomenon.
Types of Dot Distribution:
1.One-to-One Dot Map: Each dot represents one instance
(e.g., one person, one tree).
2.One-to-Many (Representative Dot Map): Each dot
represents a specific number (e.g., 1 dot =
100 people).
3.Random Dot Map: Dots are placed randomly within an
area to represent the quantity.
4.Uniform Dot Map: Dots are placed evenly spaced for a
more systematic look.
Advantages:
Visual Clarity: Provides an immediate visual impression of
distribution patterns.
Easy Interpretation: Simple to understand, especially for non-
technical audiences.
Effective Comparison: Useful for comparing density and
distribution between regions.
Scalable: Can represent small or large quantities effectively.
Disadvantages
Overlapping Dots: In high-density areas, dots may overlap,
making it hard to interpret.
Misleading Placement: Dots might be placed randomly,
leading to false impressions of exact
locations.
Scale Sensitivity: Choosing the right scale (value per dot) is
critical; wrong choice can distort
interpretation.
Data Generalization: Often uses aggregate data, which might
hide local variations.
Applications:
Population Studies: Mapping human population density and
distribution.
Epidemiology: Tracking the spread of diseases (e.g., COVID-
19 cases).
Agriculture: Showing distribution of crops or livestock.
Urban Planning: Visualizing distribution of facilities like
schools or hospitals.
Environmental Science: Mapping occurrences of natural
features like forests or water bodies.
Historical Studies: Representing historical events like battles
or migrations.
Tree diagrams:
Tree diagrams are a valuable tool in various statistical
Methods for data science.
It is used for representing the structure of a given website.
Matrices diagram:
Matrices diagram is powerful visualization
techniques that help you to understand and analyze the
relationships between different sets of data.
Principal components Analysis:
Introduction to data distributions:
Data distributions are a foundational concept in
statistics and data analysis.
They describe how data values are spread or
distributed across a range.
Understanding distributions helps us analyze data
patterns, make predictions, and draw meaningful
conclusions.
Types of Data Distributions:
1. Discrete
2. Continuous
1.Discrete:
1. Binomial Distribution:
o Used to describe the probability of success or
failure in experiments with two possible
outcomes.
o Example: Flipping a coin (heads or tails).
2. Poisson Distribution:
o Describes the probability of a given number
of events occurring within a fixed interval.
Example: Number of customer arrivals per minute.
2.Continuous:
1.Normal Distribution:
o Also called a "bell curve," this is one of the
most common data distributions.
o The data is symmetrically distributed around
the mean.
o Examples: Heights, weights, test scores often
follow a normal distribution.

2.Exponential Distribution:
o Describes the time between events in a
Poisson process (events occurring at a
constant rate independently).
o Example: Time between arrivals at a bus
stop.
Key Concepts in Data Distributions:
 Central Tendency: Measures like mean, median,
and mode indicate where the data is centered.
 Variability: Includes range, variance, and standard
deviation, showing the spread of data.
 Shape: Describes whether the data is
symmetrical, skewed, or has specific patterns.
\

A320 6E 1100
0% (1)
A320 6E 1100
1 page
Chapter 4.data Management Lesson 1 2
100% (1)
Chapter 4.data Management Lesson 1 2
86 pages
Selected Candidates List - Rinex Technologies - KA - 2025 Batch
No ratings yet
Selected Candidates List - Rinex Technologies - KA - 2025 Batch
3 pages
Fluke 718 300g Process Calibrator Manual
No ratings yet
Fluke 718 300g Process Calibrator Manual
36 pages
Parts Manual SK750 - SK755 (053-2566)
No ratings yet
Parts Manual SK750 - SK755 (053-2566)
207 pages
Graphical Presentation
No ratings yet
Graphical Presentation
6 pages
An Introduction To Submarine Cables
100% (1)
An Introduction To Submarine Cables
7 pages
Hackers Toeic
No ratings yet
Hackers Toeic
21 pages
Dna Mica Desist em As
No ratings yet
Dna Mica Desist em As
535 pages
SLIDES Statistics-Chapter 2
No ratings yet
SLIDES Statistics-Chapter 2
31 pages
Picturing Distributions With Graphs
No ratings yet
Picturing Distributions With Graphs
21 pages
Data Analytics
No ratings yet
Data Analytics
110 pages
Dissertation Topics Logistics Supply Chain
100% (1)
Dissertation Topics Logistics Supply Chain
7 pages
Smuat Guide
No ratings yet
Smuat Guide
53 pages
STA 111 Note
No ratings yet
STA 111 Note
12 pages
Brackets Lesson For Coding and Programming by Slidesgo
No ratings yet
Brackets Lesson For Coding and Programming by Slidesgo
57 pages
CODE UNNATI Marathon by BHIBHUSHITAM
No ratings yet
CODE UNNATI Marathon by BHIBHUSHITAM
91 pages
Share MBBS-LECTURE 3 (1) - 1
No ratings yet
Share MBBS-LECTURE 3 (1) - 1
68 pages
Charts
No ratings yet
Charts
11 pages
7700e SPM
No ratings yet
7700e SPM
2 pages
Lecture#06 - EDA2 - Graphical Data Analysis
No ratings yet
Lecture#06 - EDA2 - Graphical Data Analysis
55 pages
Week 3 Language Translators
No ratings yet
Week 3 Language Translators
6 pages
Chapter 2 Methods of Data Collection and Presentation
No ratings yet
Chapter 2 Methods of Data Collection and Presentation
35 pages
DVP 3
No ratings yet
DVP 3
97 pages
VIPDMTheory Chapter 2
No ratings yet
VIPDMTheory Chapter 2
56 pages
Pragya Sachdeva Resume
No ratings yet
Pragya Sachdeva Resume
1 page
RVO-STATISTICS - Statistics - Introduction To Statistics IBBI
No ratings yet
RVO-STATISTICS - Statistics - Introduction To Statistics IBBI
93 pages
Chapter 3 Non Spatial Data Visualization
No ratings yet
Chapter 3 Non Spatial Data Visualization
45 pages
DS - Unit 3
No ratings yet
DS - Unit 3
37 pages
Unit 2 - Merged
No ratings yet
Unit 2 - Merged
17 pages
Unit 4
No ratings yet
Unit 4
41 pages
RDA Imp
No ratings yet
RDA Imp
26 pages
RM Module 3
No ratings yet
RM Module 3
34 pages
Data Basics For ML
No ratings yet
Data Basics For ML
23 pages
DA Unit 4
No ratings yet
DA Unit 4
30 pages
KT-60 Introduction V1.0 20241114
No ratings yet
KT-60 Introduction V1.0 20241114
24 pages
Unit 4 - Data Visualization
No ratings yet
Unit 4 - Data Visualization
32 pages
Data Managementmmw
No ratings yet
Data Managementmmw
26 pages
Lecture - 7 - MSC
No ratings yet
Lecture - 7 - MSC
13 pages
Collection N PRSNTN
No ratings yet
Collection N PRSNTN
27 pages
Unit .......
No ratings yet
Unit .......
45 pages
Anirban Dutta - Aiml
No ratings yet
Anirban Dutta - Aiml
19 pages
Cyber Security Notes
No ratings yet
Cyber Security Notes
15 pages
Reasearch Methodology and Statistics
No ratings yet
Reasearch Methodology and Statistics
13 pages
Descriptive Statistics, Tables and Graphs 20
No ratings yet
Descriptive Statistics, Tables and Graphs 20
34 pages
AL - I (Unit - I)
No ratings yet
AL - I (Unit - I)
19 pages
Unit 3 DATA VISUAIZATION
No ratings yet
Unit 3 DATA VISUAIZATION
25 pages
L1 QM02 High Yield Notes
No ratings yet
L1 QM02 High Yield Notes
10 pages
EDA - Reviewer Midterm
No ratings yet
EDA - Reviewer Midterm
9 pages
Rashed
No ratings yet
Rashed
9 pages
Graphical Presentation of Data
No ratings yet
Graphical Presentation of Data
15 pages
EDA - Reviewer Midterm
No ratings yet
EDA - Reviewer Midterm
8 pages
I PPR Extracted
No ratings yet
I PPR Extracted
6 pages
DBBA2102
No ratings yet
DBBA2102
10 pages
11a Tabular & Graphical Presentation of Data
No ratings yet
11a Tabular & Graphical Presentation of Data
18 pages
Lesson 2 Notes
No ratings yet
Lesson 2 Notes
11 pages
Evolights Laser RGB 400mw Animation - Instrukcja Obs Ugi Manual Eng PL 16ch
No ratings yet
Evolights Laser RGB 400mw Animation - Instrukcja Obs Ugi Manual Eng PL 16ch
19 pages
Document
No ratings yet
Document
8 pages
Chapter 1 & 2 - Stats
No ratings yet
Chapter 1 & 2 - Stats
5 pages
Flashcard Statistics 1
No ratings yet
Flashcard Statistics 1
4 pages
1st Mid
No ratings yet
1st Mid
19 pages
Unit-5 Operator Overloading
No ratings yet
Unit-5 Operator Overloading
8 pages
Ia - Eda
No ratings yet
Ia - Eda
10 pages
June 2019 Pure Shadow Paper 2
No ratings yet
June 2019 Pure Shadow Paper 2
13 pages
Ae 9 Reviewer
No ratings yet
Ae 9 Reviewer
7 pages
11 em Acc Public MLM
No ratings yet
11 em Acc Public MLM
11 pages
Data Visualization Tech.
No ratings yet
Data Visualization Tech.
6 pages
5 Methods of Data Visualisation
No ratings yet
5 Methods of Data Visualisation
5 pages
Trellix Insights: Key Benefits
No ratings yet
Trellix Insights: Key Benefits
8 pages
Elective 3B
No ratings yet
Elective 3B
2 pages
Business Statistics: Qualitative or Categorical Data
No ratings yet
Business Statistics: Qualitative or Categorical Data
14 pages
Creative and Minimal Portfolio Presentation
No ratings yet
Creative and Minimal Portfolio Presentation
5 pages
Grey Minimalist Business Project Presentation
No ratings yet
Grey Minimalist Business Project Presentation
5 pages
CH 2 Notes Filled
No ratings yet
CH 2 Notes Filled
22 pages
Math Midterm
No ratings yet
Math Midterm
9 pages
Bustat Reviewer
No ratings yet
Bustat Reviewer
6 pages
Chap 7-1
No ratings yet
Chap 7-1
4 pages
Aln-V Ha-06-043-Analog Sensor Bases Installation Instructions
No ratings yet
Aln-V Ha-06-043-Analog Sensor Bases Installation Instructions
4 pages
Appendix 1: Apmoption Apm3Rdpar Csotpm
No ratings yet
Appendix 1: Apmoption Apm3Rdpar Csotpm
8 pages
Churn Analysis in Telecommunication Using Logistic Regression
No ratings yet
Churn Analysis in Telecommunication Using Logistic Regression
6 pages
GEA Marine Purifiers For Motor Yachts - tcm11-83673
No ratings yet
GEA Marine Purifiers For Motor Yachts - tcm11-83673
6 pages
Powin - SAMPLE Commissioning Schedule 22NOV2021
No ratings yet
Powin - SAMPLE Commissioning Schedule 22NOV2021
1 page
Chapter Six Methods of Describing Data
No ratings yet
Chapter Six Methods of Describing Data
20 pages
MAT211 Assignment - 1: Part - 1
No ratings yet
MAT211 Assignment - 1: Part - 1
10 pages
2/ Organizing and Visualizing Variables: Dcova
No ratings yet
2/ Organizing and Visualizing Variables: Dcova
4 pages
Double Skin Façade and Potential Integration With Other Building Environmental Technologies and Materials
No ratings yet
Double Skin Façade and Potential Integration With Other Building Environmental Technologies and Materials
8 pages
Unit 01 Statistics
No ratings yet
Unit 01 Statistics
10 pages
Business Statistics I Essentials
From Everand
Business Statistics I Essentials
Louise Clark
5/5 (5)
Illuminating Data: A hands on guide to data visualization in R
From Everand
Illuminating Data: A hands on guide to data visualization in R
Eman Ahmad
No ratings yet
Descriptive Statistics: Six Sigma Thinking, #3
From Everand
Descriptive Statistics: Six Sigma Thinking, #3
Sumeet Savant
No ratings yet

SMDS Unit 1

Uploaded by

SMDS Unit 1

Uploaded by

UNIT-1

Data Visualization and Distributions

You might also like