0% found this document useful (0 votes)

6 views7 pages

Descriptive - Statistics Data Discret chp2

The document provides an overview of descriptive statistics, detailing the types of data (quantitative and categorical) and their subcategories (ordinal, nominal, continuous, discrete). It explains key concepts such as measures of center (mean, median, mode), measures of spread (range, IQR, standard deviation, variance), and the importance of data shape and outliers in analysis. Additionally, it distinguishes between descriptive and inferential statistics, highlighting their roles in data analysis.

Uploaded by

mansoursuihal26

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views7 pages

Descriptive - Statistics Data Discret chp2

Uploaded by

mansoursuihal26

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Descriptive Statistics

qualitative data

Binary

Descriptive Statistics Summary Of Udacity Course

Descriptive Statistics Data Types Quantitative and Categorical. Quantitative data takes on numeric values that
allow us to perform mathematical operations (like the number of dogs) Categorical are used to label a group or
set of items (like dog breeds - Collies, Labs, Poodles, etc.
https://www.linkedin.com/pulse/descriptive-statistics-summary-udacity-course-engy-wahpa

Data Types
Quantitative and Categorical.
Quantitative data takes on numeric values that allow us to perform mathematical operations (like the number of dogs).

Categorical are used to label a group or set of items (like dog breeds - Collies, Labs, Poodles, etc.).

Categorical Ordinal vs Categorical Nominal

We can divide categorical data further into two types: Ordinal and Nominal.

Descriptive Statistics 1
Categorical Ordinal data take on a ranked ordering (like a ranked interaction on a scale from Very Poor to Very Good with the
dogs).
Categorical Nominal data do not have an order or ranking (like the breeds of the dog ‫)ﺳﻼﻻت ﻣﻦ اﻟﻜﺎﻻب‬.

Quantitative Continuous vs Quantitative Discrete

We can think of quantitative data as being either continuous or discrete.
Continuous data can be split into smaller and smaller units, and still a smaller unit exists. An example of this is the age of
the dog - we can measure the units of the age in years, months, days, hours, seconds, but there are still smaller units that could be
associated with the age.
Discrete data only takes on countable values. The number of dogs we interact with is an example of a discrete data type.

Quantitative: Examples
Continuous : Height, Age, Income
Discrete : Pages in a Book, Trees in Yard, Dogs at a Coffee Shop

Categorical: Examples
Ordinal : Letter Grade, Survey Rating
Nominal : Gender, Marital Status, Breakfast Items

Analyzing Quantitative Data

Four Aspects for Quantitative Data
There are four main aspects to analyzing Quantitative data.

1. Measures of Center

2. Measures of Spread

3. The Shape of the data.

4. Outliers

Analyzing Categorical Data

if we were looking at the breeds of the dogs, we would care about how many dogs are of each breed, or what proportion of
dogs are of each breed type.
Categorical data is analyzed usually be looking at the counts or proportion of individuals that fall into each group.

1-Measures of Center
There are three measures of center:

1. Mean

2. Median

3. Mode

1- The Mean
The mean is often called the average or the expected value in mathematics.

We calculate the mean by adding all of our values together, and dividing by the number of values in our dataset.

Descriptive Statistics 2
2- The Median
The median splits our data so that 50% of our values are lower and 50% are higher.

Median for Odd Values

If we have an odd number of observations, the median is simply the number in the direct middle.

Median for Even Values

If we have an even number of observations, the median is the average of the two values in the middle.

3-The Mode
The mode is the most frequently observed value in our dataset.

There might be multiple modes for a particular dataset, or no mode at all.

Notation
Notation : Think of notation as a universal language used by academic and industry professionals to convey mathematical
ideas. 5+3

Random Variables
A random variable is a placeholder for the possible values of some process

Aggregations
An aggregation is a way to turn multiple numbers into fewer numbers (commonly one number).
Summation is a common aggregation. The notation used to sum our values is a greek symbol called sigma Σ.

2- Measures of Spread
Measures of Spread are used to provide us an idea of how spread out our data are from one another. Common measures of
spread include:

1. Range

2. Interquartile Range (IQR)

3. Standard Deviation

4. Variance

Descriptive Statistics 3
Histograms ‫اﻟﻤﺪرج اﻟﺘﻜﺮارى‬
Histograms : are super useful to understanding the different aspects of quantitative data. In the upcoming concepts, you will see
histograms used all the time to help you understand the four aspects we outlined earlier regarding a quantitative variable:

center - Spread - Shape - Outliers

‫اﻟﻤﺪرج اﻟﺘﻜﺮارى ﻫﻰ ﻣﺠﻤﻮﻋﺔ ﻣﻦ اﻟﺒﻴﺎﻧﺎت ﺑﺘﺘﻘﺴﻢ ﻟﻔﺌﺎت و ﺑﺘﺘﺤﻮل ﻟﺸﻜﻞ ﺑﻴﺎﻧﻰ‬

Calculating the 5 Number Summary (Outliers Or Skewed )

The five number summary consist of 5 values:

1. Minimum: The smallest number in the dataset.

2. Q1: The value such that 25% of the data fall below.

3. Q2: (Median) The value such that 50% of the data fall below.

4. Q3: The value such that 75% of the data fall below.

5. Maximum: The largest value in the dataset.

1- The Range = ( Max - Min )

The range is then calculated as the difference between the maximum and the minimum.

2- Interquartile Range (IQR) Q3 - Q1

The interquartile range is calculated as the difference between Q3 and Q1.

3-The Standard Deviation

The standard deviation is one of the most common measures for talking about the spread of data. It is defined as the average
distance of each observation from the mean.

Descriptive Statistics 4
The standard deviation is associated with risk in finance, assists in determining the significance of drugs in medical
studies, and measures the error of our results for predicting anything from the amount of rainfall we can expect
tomorrow to your predicted commute time tomorrow.

4-The Variance ‫اﻟﺘﻔﺎوت‬

The Variance : is the average squared difference of each observation from the mean
.

The variance is used to compare the spread of two different groups. A set of data with higher variance is more spread
out than a dataset with lower variance. Be careful though, there might just be an outlier (or outliers) that is increasing the
variance, when most of the data are actually very close.

3- The Shape Of Data

From a histogram we can quickly identify the shape of our data, which helps influence all of the measures we learned in the
previous concepts. We learned that the distribution of our data is frequently associated with one of the three shapes:
1. Right-skewed
2. Left-skewed

3. Symmetric (frequently normally distributed)

1- Right skewed Mean > Median

Real World Applications
Amount of drug remaining in a blood stream,

Time between phone calls at a call center,

Time until light bulb dies

Descriptive Statistics 5
2 - Left skewed Median > Mean
Real World Applications
Grades as a percentage in many universities,

Age of death,

Asset price changes

3 - Symmetric (frequently normally distributed) Median = Mean

Real World Applications ( Mean And Standard Deviation)
Height,

Weight, Errors,

Precipitation

Descriptive Statistics 6
4- Outliers
outliers : are points that fall very far from the rest of our data points. This influences measures like the mean and standard
deviation much more than measures associated with the five number summary.

Outliers Advise
1. Plot your data to identify if you have outliers.
2. Handle outliers accordingly via the methods above.
3. If no outliers and your data follow a normal distribution - use the mean and standard deviation to describe your dataset, and
report that the data are normally distributed.
4. If you have skewed data or outliers, use the five number summary to summarize your data and report the outliers.

Descriptive Statistics
Descriptive statistics is about describing our collected data.

Inferential Statistics
Inferential Statistics is about using our collected data to draw conclusions to a larger population.
We looked at specific examples that allowed us to identify the

1. Population - our entire group of interest.

2. Parameter - numeric summary about a population

3. Sample - subset of the population

4. Statistic - numeric summary about a sample

Descriptive Statistics 7

Assessment Procedures For Counselors and Helping Professionals 7E TB
100% (2)
Assessment Procedures For Counselors and Helping Professionals 7E TB
82 pages
Consumer Perception Towards Bata
60% (15)
Consumer Perception Towards Bata
43 pages
Research Methodology PPT Module 1 Sociology
No ratings yet
Research Methodology PPT Module 1 Sociology
106 pages
Data Analysis Challenger PDF 1
No ratings yet
Data Analysis Challenger PDF 1
9 pages
Educ 201
No ratings yet
Educ 201
2 pages
Data Analyst
No ratings yet
Data Analyst
21 pages
Data Types
No ratings yet
Data Types
7 pages
Data Analysis Fundamentals
100% (9)
Data Analysis Fundamentals
56 pages
Data Types
No ratings yet
Data Types
38 pages
Data Analysis Challenger PDF 2
No ratings yet
Data Analysis Challenger PDF 2
15 pages
Recap: Categorical Quantitative Continuous Discrete Ordinal Nominal
No ratings yet
Recap: Categorical Quantitative Continuous Discrete Ordinal Nominal
3 pages
Introduction To Statistics
100% (1)
Introduction To Statistics
60 pages
Ch1 Prob&Stat NEW
No ratings yet
Ch1 Prob&Stat NEW
35 pages
Intro To Descriptive Statistics: By: Mahmoud Galal
No ratings yet
Intro To Descriptive Statistics: By: Mahmoud Galal
28 pages
Summarize Topic in Statistical
No ratings yet
Summarize Topic in Statistical
5 pages
Discriptive Statics
No ratings yet
Discriptive Statics
4 pages
MATH2203 Statistics I - Week 1
No ratings yet
MATH2203 Statistics I - Week 1
27 pages
Basic Statistical Concepts-2
No ratings yet
Basic Statistical Concepts-2
20 pages
Chapter1 Statistics
No ratings yet
Chapter1 Statistics
17 pages
Safari
No ratings yet
Safari
385 pages
Statistics - Imp Points
No ratings yet
Statistics - Imp Points
6 pages
Module 4
No ratings yet
Module 4
51 pages
Biostatics Course
No ratings yet
Biostatics Course
29 pages
Notes 3 Descriptive Statistics RJMurden 2021
No ratings yet
Notes 3 Descriptive Statistics RJMurden 2021
47 pages
Lesson 02 Probability and Statistics
No ratings yet
Lesson 02 Probability and Statistics
127 pages
Introduction and Descriptive Statistics
No ratings yet
Introduction and Descriptive Statistics
50 pages
Lecture 1 - Introduction To Statistics
No ratings yet
Lecture 1 - Introduction To Statistics
48 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
26 pages
Gr11 Statistics Notes
No ratings yet
Gr11 Statistics Notes
13 pages
Statistics
No ratings yet
Statistics
63 pages
Getting To Know Your Data
No ratings yet
Getting To Know Your Data
42 pages
Importance of Descriptive Statistics
No ratings yet
Importance of Descriptive Statistics
59 pages
Basic Concepts in Statistics
No ratings yet
Basic Concepts in Statistics
42 pages
Lesson 5 (Descriptive Statistics Part 1) - Oct 2024
No ratings yet
Lesson 5 (Descriptive Statistics Part 1) - Oct 2024
72 pages
Descriptive Statistics Lecture
No ratings yet
Descriptive Statistics Lecture
24 pages
Statistics Theory
No ratings yet
Statistics Theory
3 pages
Midterm Reviewer 1
No ratings yet
Midterm Reviewer 1
8 pages
Statistics For Data Science
100% (1)
Statistics For Data Science
27 pages
Research Presentation
No ratings yet
Research Presentation
29 pages
MMW (Data Management) - Part 1
No ratings yet
MMW (Data Management) - Part 1
26 pages
Article Review 1 Eng
No ratings yet
Article Review 1 Eng
30 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
13 pages
MS102
No ratings yet
MS102
9 pages
01 Introduction
No ratings yet
01 Introduction
50 pages
1 Introduction
No ratings yet
1 Introduction
9 pages
Lecture 1
No ratings yet
Lecture 1
32 pages
Statistics
100% (1)
Statistics
6 pages
Biostatistics 1
No ratings yet
Biostatistics 1
120 pages
CH 2 Lecture Notes
No ratings yet
CH 2 Lecture Notes
12 pages
Statistics
No ratings yet
Statistics
11 pages
Topic 2 - Descriptive - Statistics
No ratings yet
Topic 2 - Descriptive - Statistics
36 pages
Statistics
No ratings yet
Statistics
21 pages
Math
No ratings yet
Math
50 pages
Study Guide
No ratings yet
Study Guide
16 pages
C1S1 Statistics Packet
No ratings yet
C1S1 Statistics Packet
24 pages
Statiscal Method Using R
No ratings yet
Statiscal Method Using R
150 pages
Class 1
No ratings yet
Class 1
52 pages
Statistics For Data Science
No ratings yet
Statistics For Data Science
93 pages
Exploring Data: AP Statistics Unit 1: Chapters 1-4
No ratings yet
Exploring Data: AP Statistics Unit 1: Chapters 1-4
83 pages
Statistics
No ratings yet
Statistics
4 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
18 pages
Descriptive Statistics: Six Sigma Thinking, #3
From Everand
Descriptive Statistics: Six Sigma Thinking, #3
Sumeet Savant
No ratings yet
Machine Learning - A Complete Exploration of Highly Advanced Machine Learning Concepts, Best Practices and Techniques: 4
From Everand
Machine Learning - A Complete Exploration of Highly Advanced Machine Learning Concepts, Best Practices and Techniques: 4
Peter Bradley
No ratings yet
Main Title: Planning Data Analysis Using Statistical Data
100% (1)
Main Title: Planning Data Analysis Using Statistical Data
40 pages
Levels of Measurement of Variables
No ratings yet
Levels of Measurement of Variables
1 page
MODULE 03 Types of Data
No ratings yet
MODULE 03 Types of Data
12 pages
RTT Terminal Practice Question
No ratings yet
RTT Terminal Practice Question
14 pages
My Favourite Things Object Attachment Comparative PDF
No ratings yet
My Favourite Things Object Attachment Comparative PDF
18 pages
Assessment of Learning Ii: Dr. Louie D. Asuncion Instructor
No ratings yet
Assessment of Learning Ii: Dr. Louie D. Asuncion Instructor
73 pages
Basic Biostatistics For Post-Graduate Students: Indian Journal of Pharmacology July 2012
No ratings yet
Basic Biostatistics For Post-Graduate Students: Indian Journal of Pharmacology July 2012
9 pages
524-Business Research Methods
No ratings yet
524-Business Research Methods
9 pages
A-Level Maths Generic Sample Chapter Year 1 FINAL
No ratings yet
A-Level Maths Generic Sample Chapter Year 1 FINAL
46 pages
Lecture Slides 1 Introduction
No ratings yet
Lecture Slides 1 Introduction
61 pages
Module 1 Assessment 2
No ratings yet
Module 1 Assessment 2
5 pages
MIP2602 Student Guide
No ratings yet
MIP2602 Student Guide
157 pages
Integrating Risk Management in The Innovation Project
No ratings yet
Integrating Risk Management in The Innovation Project
17 pages
QM Prelim Finals - Usergen
No ratings yet
QM Prelim Finals - Usergen
157 pages
Iit M Qualifier An Exam Qdq2 4 Aug 2024
No ratings yet
Iit M Qualifier An Exam Qdq2 4 Aug 2024
43 pages
A Review of Machine Learning For The Optimization of Production Process
No ratings yet
A Review of Machine Learning For The Optimization of Production Process
14 pages
Math 7 Q4 Module 1
No ratings yet
Math 7 Q4 Module 1
24 pages
Statistical Techniques in Business & Q Economics: Professor: Mamdouh Hamza Ahmed
No ratings yet
Statistical Techniques in Business & Q Economics: Professor: Mamdouh Hamza Ahmed
16 pages
Psychological Assessment With Ratio
No ratings yet
Psychological Assessment With Ratio
6 pages
Data Preparation PDF
No ratings yet
Data Preparation PDF
71 pages
Chapter 1 Updated
No ratings yet
Chapter 1 Updated
14 pages
Review Question - C3 - SACR3080
No ratings yet
Review Question - C3 - SACR3080
10 pages
Business Research Methods: Measurement
No ratings yet
Business Research Methods: Measurement
23 pages
Chapter 4 Data Management
No ratings yet
Chapter 4 Data Management
56 pages
A Survey of Discretization Techniques Taxonomy and Empirical Analysis in Supervised Learning
No ratings yet
A Survey of Discretization Techniques Taxonomy and Empirical Analysis in Supervised Learning
17 pages
Data Sources Advance Data Handling
No ratings yet
Data Sources Advance Data Handling
23 pages
14 Attitude and Rating Scales by Sommer PDF
No ratings yet
14 Attitude and Rating Scales by Sommer PDF
19 pages

Descriptive - Statistics Data Discret chp2

Uploaded by

Descriptive - Statistics Data Discret chp2

Uploaded by

Descriptive Statistics

Descriptive Statistics Summary Of Udacity Course

Categorical Ordinal vs Categorical Nominal

Quantitative Continuous vs Quantitative Discrete

Analyzing Quantitative Data

3. The Shape of the data.

Analyzing Categorical Data

Median for Odd Values

Median for Even Values

There might be multiple modes for a particular dataset, or no mode at all.

2. Interquartile Range (IQR)

center - Spread - Shape - Outliers

‫اﻟﻤﺪرج اﻟﺘﻜﺮارى ﻫﻰ ﻣﺠﻤﻮﻋﺔ ﻣﻦ اﻟﺒﻴﺎﻧﺎت ﺑﺘﺘﻘﺴﻢ ﻟﻔﺌﺎت و ﺑﺘﺘﺤﻮل ﻟﺸﻜﻞ ﺑﻴﺎﻧﻰ‬

Calculating the 5 Number Summary (Outliers Or Skewed )

1. Minimum: The smallest number in the dataset.

5. Maximum: The largest value in the dataset.

1- The Range = ( Max - Min )

2- Interquartile Range (IQR) Q3 - Q1

3-The Standard Deviation

4-The Variance ‫اﻟﺘﻔﺎوت‬

3- The Shape Of Data

3. Symmetric (frequently normally distributed)

1- Right skewed Mean > Median

Time between phone calls at a call center,

Time until light bulb dies

Asset price changes

3 - Symmetric (frequently normally distributed) Median = Mean

1. Population - our entire group of interest.

2. Parameter - numeric summary about a population

3. Sample - subset of the population

4. Statistic - numeric summary about a sample

You might also like