AIDS C04 Session 24

Uploaded by

likhitha bhagyasri yella

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views15 pages

AIDS C04 Session 24

Uploaded by

likhitha bhagyasri yella

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 15

21CS2213RA

AI for Data Science

Session -24

Contents: Density diagrams, Mean, Standard Deviation , Median,

Quantiles and correlations

1
The topics covered
• Representing statistical measures:
• Density diagrams
• Mean, Standard Deviation ,
• Median,
• Quantiles,
• and correlations
Density Plot
• A Density plot is a smoothed, continuous version of a histogram
estimated from the data.
• The most common form of estimation is known as kernel density
estimation.
• In this method, a continuous curve (the kernel) is drawn at every
individual data point and all of these curves are then added together
to make a single smooth density estimation.
Why Density Plot?
• It visualizes the distribution of data over a continuous interval or time
period.
• This chart is a variation of a Histogram that uses kernel smoothing to
plot values, allowing for smoother distributions by smoothing out the
noise.
• The peaks of a Density Plot help display where values are
concentrated over the interval.
• Density Plots have over Histograms is that they're better at
determining the distribution shape because they're not affected by the
number of bins used (each bar used in a typical histogram).
Example of Density Plot
import numpy as np
import matplotlib.pyplot as plt
from scipy.stats import gaussian_kde

data = np.random.normal(10,3,100) #
Generate Data
density = gaussian_kde(data)

x_vals = np.linspace(0,20,200) #
Specifying the limits of our data
density.covariance_factor = lambda : .5
#Smoothing parameter

density._compute_covariance()
plt.plot(x_vals,density(x_vals))
plt.show()
5
Statistical measures
• Statistics, in general, is the method of collection of data, tabulation,
and interpretation of numerical data
• With statistics, we can see how data can be used to solve complex
problems.
Descriptive Statistics

• descriptive statistics generally means describing the data with the

help of some representative methods like charts, tables, Excel files,
etc.
• The data is described in such a way that it can express some
meaningful information that can also be used to find some future
trends.
• Describing and summarizing a single variable is called univariate
analysis.
• Describing a statistical relationship between two variables is
called bivariate analysis.
• Describing the statistical relationship between multiple variables is
called multivariate analysis.
Mean
• It is the sum of observations divided by the total number of observations. It
is also defined as average which is the sum divided by count.
• The mean() function returns the mean or average of the data passed in its
arguments. If passed argument is empty, Statistics Error is raised.
• Example:

# mean()
import statistics

# initializing list
li = [1, 2, 3, 3, 2, 2, 2, 1]

# using mean() to calculate average of list

# elements
print ("The average of list values is : ",end="")
print (statistics.mean(li))
median()
• median() function is used to calculate the median, i.e middle element
of data. If the passed argument is empty, StatisticsError is raised.
Caclulating Median
• Step 1:Arrange the data in the increasing order and then find the mid
value.
• Step 2:Calulate median using the function.
Mode
• Mode is the number which occur most often in the data set.Here 150
is occurring twice so this is our mode.
Co-relations and Heat map
• A correlation heatmap is a graphical representation of a correlation
matrix representing the correlation between different variables.
• The value of correlation can take any value from -1 to 1.
• Correlation between two random variables or bivariate data does not
necessarily imply a causal relationship.
How to create seaborn correlation heatmap
Steps:
• Install seaborn package
• Ex: pip install seaborn
• Import all required modules
• Import the file where your data is stored
• Plot a heatmap
• Display it using matplotlib
Example
• import matplotlib.pyplot as py
• import pandas as pd
• import seaborn as sb
•
• # import file with data
• data = pd.read_csv(“data.csv”)
• print(data.corr())
• dataplot = sb.heatmap(data.corr(), cmap="YlGnBu", annot=True)
•
• # displaying heatmap
• py.show()
Thank you

Exploratory Data Analysis (EDA) in Python
No ratings yet
Exploratory Data Analysis (EDA) in Python
6 pages
Unit 2 1
No ratings yet
Unit 2 1
54 pages
1.1 Univariate Analysis: 1.1.1 Categorical Data
No ratings yet
1.1 Univariate Analysis: 1.1.1 Categorical Data
10 pages
Chapter Two
No ratings yet
Chapter Two
36 pages
CS361 FA23 Lec2 Post
No ratings yet
CS361 FA23 Lec2 Post
67 pages
R For Data Exploration
No ratings yet
R For Data Exploration
52 pages
Introduction To The Practice of Basic Statistics (Textbook Outline)
100% (14)
Introduction To The Practice of Basic Statistics (Textbook Outline)
65 pages
Data Preprocessing Python Tome II
No ratings yet
Data Preprocessing Python Tome II
14 pages
Aphical Representation
No ratings yet
Aphical Representation
8 pages
Stats Lect
No ratings yet
Stats Lect
77 pages
Data Analysis and Visualisation With Python
No ratings yet
Data Analysis and Visualisation With Python
42 pages
Lecture Notes
No ratings yet
Lecture Notes
37 pages
Data Structure Mcqs
83% (6)
Data Structure Mcqs
45 pages
Descriptive Statistics and Exploratory Data Analysis
No ratings yet
Descriptive Statistics and Exploratory Data Analysis
36 pages
3-Data Description
No ratings yet
3-Data Description
91 pages
Unit 3
No ratings yet
Unit 3
45 pages
DS Chapter - 2
No ratings yet
DS Chapter - 2
73 pages
B Lab Manual Machine Learning SEM-7 CSE 2024
No ratings yet
B Lab Manual Machine Learning SEM-7 CSE 2024
49 pages
Content From Jose Portilla's Udemy Course Learning Python For Data Analysis and Visualization Notes by Michael Brothers, Available On
No ratings yet
Content From Jose Portilla's Udemy Course Learning Python For Data Analysis and Visualization Notes by Michael Brothers, Available On
13 pages
Lecture 4
No ratings yet
Lecture 4
60 pages
DSILYTC Session 5 - Descriptive Statistics
No ratings yet
DSILYTC Session 5 - Descriptive Statistics
99 pages
Principles of AI Laboratory Varshadr
No ratings yet
Principles of AI Laboratory Varshadr
54 pages
3 Data Description
No ratings yet
3 Data Description
87 pages
Data Mining and Predictive Modelling Assignment
No ratings yet
Data Mining and Predictive Modelling Assignment
34 pages
Topic 2 - Descriptive - Statistics
No ratings yet
Topic 2 - Descriptive - Statistics
36 pages
Week2 Modified
No ratings yet
Week2 Modified
43 pages
PRW Questions
No ratings yet
PRW Questions
31 pages
Sl-3 Assignment No.8
No ratings yet
Sl-3 Assignment No.8
21 pages
5 - Data Summaries and Visualization
No ratings yet
5 - Data Summaries and Visualization
97 pages
Part A Assignment - No - 8
No ratings yet
Part A Assignment - No - 8
19 pages
5 - Data Summaries and Visualization
No ratings yet
5 - Data Summaries and Visualization
87 pages
Ai&Ml Bail606 ML Lab Manual
No ratings yet
Ai&Ml Bail606 ML Lab Manual
50 pages
Data Visualization
No ratings yet
Data Visualization
35 pages
Data Analytics Summary
No ratings yet
Data Analytics Summary
80 pages
Mod2 Notes
No ratings yet
Mod2 Notes
72 pages
Advanced Plot Types With Matplotlib
No ratings yet
Advanced Plot Types With Matplotlib
8 pages
Data Mining:: Concepts and Techniques
100% (1)
Data Mining:: Concepts and Techniques
63 pages
Word File For Prob and Stats
No ratings yet
Word File For Prob and Stats
25 pages
Data Analysis
No ratings yet
Data Analysis
6 pages
Materi 1 B VDE
No ratings yet
Materi 1 B VDE
18 pages
03 UnderstandData
No ratings yet
03 UnderstandData
29 pages
Word File For Prob and Stats
No ratings yet
Word File For Prob and Stats
18 pages
Lecture3 Classnotes
No ratings yet
Lecture3 Classnotes
31 pages
Unit2 Modified
No ratings yet
Unit2 Modified
42 pages
Stats Notes
No ratings yet
Stats Notes
16 pages
STATS 7053: Statistics in Engineering
No ratings yet
STATS 7053: Statistics in Engineering
24 pages
Cheatsheetforstatistics
No ratings yet
Cheatsheetforstatistics
4 pages
Sections Revision Part 2
No ratings yet
Sections Revision Part 2
7 pages
Unit 5
No ratings yet
Unit 5
25 pages
Unit 3 DS
No ratings yet
Unit 3 DS
30 pages
Soft Computing
No ratings yet
Soft Computing
30 pages
Nummerical Summaries
No ratings yet
Nummerical Summaries
11 pages
Unit 5 Descriptive Statistics
No ratings yet
Unit 5 Descriptive Statistics
7 pages
Introduction To Data Science
No ratings yet
Introduction To Data Science
22 pages
Advanced Plot Types With Seaborn
No ratings yet
Advanced Plot Types With Seaborn
8 pages
SDA 3E Chapter 2
No ratings yet
SDA 3E Chapter 2
40 pages
Creating and Customizing Advanvced Plots
No ratings yet
Creating and Customizing Advanvced Plots
10 pages
Datavisualization Interview
No ratings yet
Datavisualization Interview
3 pages
Concepts and Techniques: - Chapter 2
No ratings yet
Concepts and Techniques: - Chapter 2
29 pages
Data Visualizations: Histograms
No ratings yet
Data Visualizations: Histograms
27 pages
Notes: Section 1: Exploratory Data Analysis
No ratings yet
Notes: Section 1: Exploratory Data Analysis
6 pages
Daa Unit1 PPT
No ratings yet
Daa Unit1 PPT
243 pages
Data Structure and Algorithm Lec 1
No ratings yet
Data Structure and Algorithm Lec 1
19 pages
Graph Traversal: Bfs & Dfs
No ratings yet
Graph Traversal: Bfs & Dfs
70 pages
Binary Multiplication and Division
No ratings yet
Binary Multiplication and Division
11 pages
Appunti pg8 1 PDF
100% (1)
Appunti pg8 1 PDF
86 pages
Unit 6.2 Indexing and Hashing
No ratings yet
Unit 6.2 Indexing and Hashing
37 pages
Signals, Continuous Time and Discrete Time
No ratings yet
Signals, Continuous Time and Discrete Time
27 pages
برمجة خوارزميات تشفير vb
No ratings yet
برمجة خوارزميات تشفير vb
21 pages
Lecture 1: Cryptography: 1.2.1 Symmetric Case
No ratings yet
Lecture 1: Cryptography: 1.2.1 Symmetric Case
3 pages
Exp 7 PDF
No ratings yet
Exp 7 PDF
11 pages
L-26 (Parametric Geometric Continuity Conditions)
No ratings yet
L-26 (Parametric Geometric Continuity Conditions)
10 pages
Aditya Engineering College (A) : M.Raja Babu
No ratings yet
Aditya Engineering College (A) : M.Raja Babu
96 pages
BA 502 (1) Introduction To Statistics and Statistical Inference
No ratings yet
BA 502 (1) Introduction To Statistics and Statistical Inference
34 pages
Lecture 09 - FUZZY LOGIC
No ratings yet
Lecture 09 - FUZZY LOGIC
57 pages
Travelling Salesman and Distribution Problems: Ik Ij JK
No ratings yet
Travelling Salesman and Distribution Problems: Ik Ij JK
11 pages
Transportation Problem
No ratings yet
Transportation Problem
3 pages
Aparta Mes12s2 Mathlab Simulation-Activity-2.1
No ratings yet
Aparta Mes12s2 Mathlab Simulation-Activity-2.1
4 pages
FPGA - Ch0 - Folding
No ratings yet
FPGA - Ch0 - Folding
84 pages
KMeansPP Soda
No ratings yet
KMeansPP Soda
9 pages
Naac Lesson Plan Subject-Wsn
No ratings yet
Naac Lesson Plan Subject-Wsn
6 pages
Quic03 Phase Estimation
No ratings yet
Quic03 Phase Estimation
6 pages
Ben Ulmer, Matt Fernandez, Predicting Soccer Results in The English Premier League
No ratings yet
Ben Ulmer, Matt Fernandez, Predicting Soccer Results in The English Premier League
5 pages
An Automatic Dermatology Detection System Based On Deep Learning and Computer Vision
No ratings yet
An Automatic Dermatology Detection System Based On Deep Learning and Computer Vision
10 pages
Monticelli 1985
No ratings yet
Monticelli 1985
7 pages
Canonical Forms and Transfer Function
No ratings yet
Canonical Forms and Transfer Function
9 pages
Decision Theory
No ratings yet
Decision Theory
6 pages
Soft-NMS - Improving Object Detection With One Line of Code
No ratings yet
Soft-NMS - Improving Object Detection With One Line of Code
9 pages
Offline Password Brute-Forcer
No ratings yet
Offline Password Brute-Forcer
2 pages
Mathematics for Data Science: Linear Algebra with Matlab
From Everand
Mathematics for Data Science: Linear Algebra with Matlab
César Pérez López
No ratings yet

AIDS C04 Session 24

Uploaded by

AIDS C04 Session 24

Uploaded by

21CS2213RA

AI for Data Science

Contents: Density diagrams, Mean, Standard Deviation , Median,

• descriptive statistics generally means describing the data with the

# using mean() to calculate average of list

You might also like