0% found this document useful (0 votes)

14 views13 pages

Descriptive Statistics

The document provides an overview of descriptive statistics, including its definition, purpose, and key characteristics. It covers variables, data types, measures of central tendency and variability, data visualization techniques, and applications in various fields. Additionally, it discusses advanced topics such as data cleaning, exploratory data analysis, and common tools used for statistical analysis.

Uploaded by

jovelynortinez.pit

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views13 pages

Descriptive Statistics

Uploaded by

jovelynortinez.pit

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 13

Descriptive Statistics

Objectives

1. Define descriptive statistics and its purpose.

2. Differentiate between variables and data types.

3. Identify and explain measures of central tendency and variability.

4. Understand and utilize data visualization techniques.

5. Apply descriptive statistical methods to analyze datasets.

Discussion

1. Introduction to Descriptive Statistics

Descriptive statistics refers to the branch of statistics that deals with the summarization

and organization of data. It focuses on presenting data in a meaningful way to identify

patterns, trends, and relationships. Unlike inferential statistics, which seeks to draw

conclusions about a population from a sample, descriptive statistics describes and

summarizes data collected from a dataset.

Purpose of Descriptive Statistics

The primary purpose of descriptive statistics is to simplify large datasets into

understandable formats, enabling decision-makers and researchers to interpret

information efficiently. It is especially useful in:

 Identifying key features of data.

 Detecting anomalies or outliers.

 Comparing datasets.

Key Characteristics

 Descriptive statistics do not involve making predictions or testing hypotheses.

 They focus solely on presenting the current state of data.

 They use graphs, tables, and summary measures to convey findings.

2. Variables

Variables are characteristics that can vary among individuals or objects. They are

fundamental in statistical analysis.

Types of Variables

1. Independent Variable:

o The variable manipulated or changed to observe its effect.

o Example: Amount of fertilizer applied to plants.

2. Dependent Variable:

o The variable measured or observed to determine the effect of the independent

variable.

o Example: Growth of plants in response to fertilizer.

Examples of Variables in Different Fields

 Business: Revenue (dependent) influenced by marketing budget (independent).

 Healthcare: Blood pressure (dependent) affected by medication dosage

(independent).

 Education: Test scores (dependent) impacted by study hours (independent).

3. Data Types

Data can be broadly classified into qualitative and quantitative types:

1. Qualitative Data:

o Represents categorical information such as gender, color, or type.

o Examples: Eye color (blue, green, brown), marital status (single, married).

2. Quantitative Data:

o Represents numerical information.

o Discrete Data:

 Can only take specific values.

 Example: Number of students in a class.

o Continuous Data:

 Can take any value within a given range.

 Example: Height, weight, temperature.

Levels of Measurement

1. Nominal Scale: Categorical data without any order (e.g., colors).

2. Ordinal Scale: Categorical data with a specific order (e.g., rankings).

3. Interval Scale: Numerical data without a true zero (e.g., temperature in Celsius).

4. Ratio Scale: Numerical data with a true zero (e.g., height, weight).

4. Measures of Central Tendency

Central tendency provides a single value that represents the center of a dataset.

Mean:

 The arithmetic average, calculated by summing all data points and dividing by

the number of points.

 Formula: Mean=∑xn\text{Mean} = \frac{\sum x}{n}

 Example: For data points 2, 4, 6, the mean is 2+4+63=4\frac{2 + 4 + 6}{3} = 4.

Median:

 The middle value in a sorted dataset. If the dataset has an even number of

values, the median is the average of the two middle values.

 Example: For data points 3, 5, 7, the median is 5.

Mode:

 The most frequently occurring value in a dataset. Some datasets may have more

than one mode or no mode at all.

 Example: For data points 1, 2, 2, 3, 4, the mode is 2.

5. Measures of Variability

Measures of variability describe the spread or dispersion of a dataset.

Range:

 The difference between the highest and lowest values in a dataset.

 Formula: Range=Maximum value−Minimum value\text{Range} = \text{Maximum

value} - \text{Minimum value}

 Example: For data points 10, 15, 20, the range is 20−10=1020 - 10 = 10.

Standard Deviation:

 Measures the average distance of each data point from the mean. A higher

standard deviation indicates greater variability.

 Formula: σ=∑(xi−μ)2n\sigma = \sqrt{\frac{\sum (x_i - \mu)^2}{n}}

 Example: For data points 2, 4, 6, with a mean of 4, the standard deviation is

(2−4)2+(4−4)2+(6−4)23\sqrt{\frac{(2-4)^2 + (4-4)^2 + (6-4)^2}{3}}.

Variance:

 The square of the standard deviation, showing the degree of spread in the dataset.

 Formula: σ2=∑(xi−μ)2n\sigma^2 = \frac{\sum (x_i - \mu)^2}{n}

 Example: For the same dataset, variance is σ2=2.67\sigma^2 = 2.67.

Interquartile Range (IQR):

 The range of the middle 50% of data, calculated as the difference between the third

quartile (Q3) and the first quartile (Q1).

 Formula: IQR=Q3−Q1\text{IQR} = Q3 - Q1

6. Data Visualization

Visualizing data helps convey information effectively and identify patterns.

Types of Charts and Graphs:

1. Frequency Distributions: Shows how often each value occurs.

2. Histograms: A bar chart representing the frequency distribution of a dataset.

3. Bar Charts: Used for comparing categorical data.

4. Pie Charts: Represents proportions as segments of a circle.

5. Scatter Plots: Visualizes relationships between two variables on a Cartesian

plane.

Advanced Visualization Techniques:

 Box Plots: Show the distribution of data and identify outliers.

 Heat Maps: Represent data density or intensity through color.

 Line Graphs: Display trends over time.

7. Applications of Descriptive Statistics

Descriptive statistics are widely used in various fields:

Business:

 Analyzing sales trends.

 Studying customer preferences.

Healthcare:

 Monitoring patient demographics.

 Tracking disease patterns.

Education:

 Evaluating student performance.

 Assessing classroom diversity.

Research:

 Summarizing experimental data.

 Communicating results to stakeholders.

8. Advanced Topics in Descriptive Statistics

Data Cleaning and Preparation:

Before applying descriptive statistical methods, data must be cleaned to remove errors,

outliers, and inconsistencies. This involves:

1. Identifying missing values and deciding whether to fill, exclude, or analyze separately.

2. Removing duplicate entries.

3. Normalizing or standardizing data for uniformity.

Exploratory Data Analysis (EDA):

EDA is a complementary process that uses descriptive statistics and data visualization

to uncover insights and detect patterns. Techniques include:

 Correlation Analysis: Identifying relationships between variables.

 Clustering: Grouping similar data points for segmentation.

Multivariate Descriptive Statistics:

This involves summarizing and analyzing data with multiple variables:

1. Covariance: Measures how two variables change together.

2. Correlation Coefficient: Indicates the strength and direction of a relationship

between variables.

9. Common Tools and Software

Several tools and software simplify the process of descriptive statistics:

1. Microsoft Excel: Widely used for basic calculations, graph creation, and

summary statistics.

2. R Programming: Open-source software for advanced statistical analysis and

visualization.

3. Python: Libraries like Pandas, NumPy, and Matplotlib aid in statistical

computations.

4. SPSS (Statistical Package for the Social Sciences): Commonly used in social

science research.

References

1. Gravetter, F. J., & Wallnau, L. B. (2016). Statistics for the Behavioral Sciences.

Cengage Learning.

2. Moore, D. S., Notz, W. I., & Fligner, M. A. (2018). The Basic Practice of

Statistics. W.H. Freeman.

3. Siegel, A. F. (2016). Practical Business Statistics. Academic Press.

4. Field, A. (2017). Discovering Statistics Using SPSS. Sage Publications.

5. McKinney, W. (2017). Python for Data Analysis. O'Reilly Media.

Assessment Test

Multiple Choice Questions

1. What does descriptive statistics primarily focus on? a. Drawing conclusions about

a population b. Summarizing and organizing data c. Testing hypotheses d.

Predicting future trends

2. Which of the following is an example of qualitative data? a. Weight b. Age c. Eye

color d. Income

3. What measure of central tendency is the most frequently occurring value? a.

Mean b. Median c. Mode d. Range

4. The difference between the highest and lowest values is called: a. Mean b.

Range c. Standard deviation d. Variance

5. Which measure describes the spread of data around the mean? a. Mode b.

Median c. Standard deviation d. Range

6. In a normal distribution, the mean, median, and mode are: a. Different b. Equal c.

Unrelated d. Undefined
7. Continuous data can: a. Only take specific values b. Take any value within a

range c. Be qualitative d. Only be integers

8. A histogram is used to represent: a. Relationships between variables b.

Frequency distributions c. Categorical data d. Percentages

9. Variance is calculated as: a. Square root of the standard deviation b. Square of

the standard deviation c. Mean of data points d. Difference between maximum

and minimum values

10. Pie charts are best suited for: a. Displaying frequencies b. Showing proportions

c. Comparing continuous data d. Analyzing relationships

Enumeration

1. List the three measures of central tendency.

2. Identify two types of quantitative data.

3. Mention three common data visualization techniques.

4. State two fields where descriptive statistics are applied.

5. Enumerate three measures of variability.

Answer Key

Multiple Choice Answers:

1. b

2. c

3. c

4. b

5. c

6. b

7. b

8. b

9. b

10. b

Enumeration Answers:

1. Mean, Median, Mode

2. Discrete, Continuous

3. Frequency distributions, Histograms, Scatter plots

4. Business, Healthcare
5. Range, Standard Deviation, Variance

Football Betting Secrets
100% (2)
Football Betting Secrets
35 pages
MCQ Normal Distribution With Correct Answers
100% (18)
MCQ Normal Distribution With Correct Answers
6 pages
Unit 2 - Merged
No ratings yet
Unit 2 - Merged
17 pages
MS102
No ratings yet
MS102
9 pages
Contents UNIT 42
No ratings yet
Contents UNIT 42
21 pages
Research Report
No ratings yet
Research Report
47 pages
Chapter2-Statistical Analysis
No ratings yet
Chapter2-Statistical Analysis
86 pages
EDA - Reviewer Midterm
No ratings yet
EDA - Reviewer Midterm
9 pages
EDA - Reviewer Midterm
No ratings yet
EDA - Reviewer Midterm
8 pages
02data Edited v2
No ratings yet
02data Edited v2
43 pages
Iba Unit - Ii
No ratings yet
Iba Unit - Ii
31 pages
Bustat Reviewer
No ratings yet
Bustat Reviewer
6 pages
Ahsan Stats
No ratings yet
Ahsan Stats
9 pages
Introduction To Descriptive Statistics I: Sanju Rusara Seneviratne Mbpss
No ratings yet
Introduction To Descriptive Statistics I: Sanju Rusara Seneviratne Mbpss
35 pages
Statistics and Its Types (v1.0)
No ratings yet
Statistics and Its Types (v1.0)
6 pages
Quantitative Data Analysis Thru Descriptive Statistics
No ratings yet
Quantitative Data Analysis Thru Descriptive Statistics
6 pages
Business Analytics
No ratings yet
Business Analytics
44 pages
Unit .......
No ratings yet
Unit .......
45 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
26 pages
Article Review 1 Eng
No ratings yet
Article Review 1 Eng
30 pages
Pointers To Review Statistics
No ratings yet
Pointers To Review Statistics
6 pages
Advanced Statistics1
No ratings yet
Advanced Statistics1
19 pages
Session 1 On Descriptive Statistics
No ratings yet
Session 1 On Descriptive Statistics
24 pages
Statistics
No ratings yet
Statistics
152 pages
02 Exploratory Data Analytics
No ratings yet
02 Exploratory Data Analytics
41 pages
Stats 1 Module Updated
No ratings yet
Stats 1 Module Updated
53 pages
Unit 2
No ratings yet
Unit 2
20 pages
Module 3 Data Analysis Techniques
No ratings yet
Module 3 Data Analysis Techniques
55 pages
Statistical Analysis - Descriptive Stat
No ratings yet
Statistical Analysis - Descriptive Stat
6 pages
Module 1
No ratings yet
Module 1
64 pages
Deck 1 - Data Types, Data Display, and Summary 2024F
No ratings yet
Deck 1 - Data Types, Data Display, and Summary 2024F
42 pages
Descriptive Statistics and Exploratory Data Analysis
No ratings yet
Descriptive Statistics and Exploratory Data Analysis
36 pages
Day 01-Basic Statistics
No ratings yet
Day 01-Basic Statistics
36 pages
C4 Descriptive Statistics
No ratings yet
C4 Descriptive Statistics
34 pages
Statistics Notes Self Made
100% (1)
Statistics Notes Self Made
41 pages
Week 8 Quantitative Data Analysis - Descriptive Statistics
No ratings yet
Week 8 Quantitative Data Analysis - Descriptive Statistics
59 pages
Creative and Minimal Portfolio Presentation
No ratings yet
Creative and Minimal Portfolio Presentation
5 pages
Notes Stats
No ratings yet
Notes Stats
21 pages
UNIT II - Statistics For Data Science - New
No ratings yet
UNIT II - Statistics For Data Science - New
153 pages
Ge8 Statistics
No ratings yet
Ge8 Statistics
2 pages
Guiang Mamow Paper 1 Statistical Terms
No ratings yet
Guiang Mamow Paper 1 Statistical Terms
5 pages
SCA - Module 4
No ratings yet
SCA - Module 4
49 pages
1.9 Data and Data Analysis
No ratings yet
1.9 Data and Data Analysis
31 pages
CS822 DataMining Week2
No ratings yet
CS822 DataMining Week2
28 pages
614 Descriptive Statistcs
No ratings yet
614 Descriptive Statistcs
56 pages
Statistics
No ratings yet
Statistics
21 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
63 pages
Research Method Lecture Notes
No ratings yet
Research Method Lecture Notes
32 pages
Unit 2 Fod
No ratings yet
Unit 2 Fod
32 pages
Ssmda End Sem
No ratings yet
Ssmda End Sem
152 pages
Descriptive Statistics Summary (Session 1-5) : Types of Data - Two Types
No ratings yet
Descriptive Statistics Summary (Session 1-5) : Types of Data - Two Types
4 pages
Chapter 2 - Understand Data
No ratings yet
Chapter 2 - Understand Data
63 pages
Lesson 5 (Descriptive Statistics Part 1) - Oct 2024
No ratings yet
Lesson 5 (Descriptive Statistics Part 1) - Oct 2024
72 pages
02 Kinds of Data
No ratings yet
02 Kinds of Data
41 pages
Data Analysis
No ratings yet
Data Analysis
43 pages
8409 Statistics
No ratings yet
8409 Statistics
17 pages
02 Data
No ratings yet
02 Data
36 pages
ch-2 Data Analysis and Interpritaion
No ratings yet
ch-2 Data Analysis and Interpritaion
40 pages
IT326 - Ch2
No ratings yet
IT326 - Ch2
44 pages
Introduction To Non Parametric Methods Through R Software
From Everand
Introduction To Non Parametric Methods Through R Software
Editor IJSMI
No ratings yet
Overview Of Bayesian Approach To Statistical Methods: Software
From Everand
Overview Of Bayesian Approach To Statistical Methods: Software
Vinaitheerthan Renganathan
No ratings yet
Machine Learning - A Complete Exploration of Highly Advanced Machine Learning Concepts, Best Practices and Techniques: 4
From Everand
Machine Learning - A Complete Exploration of Highly Advanced Machine Learning Concepts, Best Practices and Techniques: 4
Peter Bradley
No ratings yet
Chapter 7 Binomial, Normal, and Poisson Distributions
No ratings yet
Chapter 7 Binomial, Normal, and Poisson Distributions
2 pages
SmartPLS Report
No ratings yet
SmartPLS Report
201 pages
Algebra 1 Unit 6 Describing Data Notes
No ratings yet
Algebra 1 Unit 6 Describing Data Notes
13 pages
Simplified Field Table - Lenght For Age - Boys
100% (1)
Simplified Field Table - Lenght For Age - Boys
4 pages
Data Analytics Syllabus
No ratings yet
Data Analytics Syllabus
15 pages
2 - Excel Example With 3 Assets
No ratings yet
2 - Excel Example With 3 Assets
13 pages
Measurement of Variability
No ratings yet
Measurement of Variability
11 pages
Assignment # 1
No ratings yet
Assignment # 1
3 pages
Regression Metrics
No ratings yet
Regression Metrics
26 pages
Classification and Tabulation
No ratings yet
Classification and Tabulation
75 pages
SPSS Answers (Chapter 5)
No ratings yet
SPSS Answers (Chapter 5)
6 pages
AppliedStatistics PDF
No ratings yet
AppliedStatistics PDF
401 pages
Biostatistics End of Semester Exam Notes
No ratings yet
Biostatistics End of Semester Exam Notes
26 pages
HW - Session 3 1
No ratings yet
HW - Session 3 1
3 pages
Marikina Polytechnic College Graduate School Exercises 3 EDUC 602 - Statistics in Education Name of Students: - and - Score
0% (1)
Marikina Polytechnic College Graduate School Exercises 3 EDUC 602 - Statistics in Education Name of Students: - and - Score
2 pages
Chapter 2
No ratings yet
Chapter 2
37 pages
MATH1041 Assignment
No ratings yet
MATH1041 Assignment
8 pages
Sampling
No ratings yet
Sampling
50 pages
Variability
100% (1)
Variability
20 pages
5 1 Representation of Data Hard
No ratings yet
5 1 Representation of Data Hard
29 pages
LAS in PRACTICAL RESEARCH 2 QUARTER 2 Week 3-4
No ratings yet
LAS in PRACTICAL RESEARCH 2 QUARTER 2 Week 3-4
17 pages
Lecture Notes in Financial Econometrics (MSC Course) : Paul Söderlind 13 June 2013
No ratings yet
Lecture Notes in Financial Econometrics (MSC Course) : Paul Söderlind 13 June 2013
348 pages
Pengaruh Kompensasi Dan Disiplin Kerja Terhadap Kinerja Karyawan PT Pama Persada Nusantara Di Sangatta-Kutai Timur
No ratings yet
Pengaruh Kompensasi Dan Disiplin Kerja Terhadap Kinerja Karyawan PT Pama Persada Nusantara Di Sangatta-Kutai Timur
14 pages
S3 - Measures of Central Tendency of Grouped Data
No ratings yet
S3 - Measures of Central Tendency of Grouped Data
24 pages
Uji VALIDITAS
No ratings yet
Uji VALIDITAS
3 pages
Statistical Inference
No ratings yet
Statistical Inference
69 pages
Unit Pengolahan Data: Lampiran Hasil Olah Data
No ratings yet
Unit Pengolahan Data: Lampiran Hasil Olah Data
4 pages
6.1-6.4 Review
No ratings yet
6.1-6.4 Review
5 pages