0% found this document useful (0 votes)
5 views25 pages

Lecture 1 STA201

The document is a lecture on the elements of statistics and probability, covering topics such as the history of statistics, basic statistical terms, and the branches of statistics. It emphasizes the importance of data science as an interdisciplinary field that combines skills from statistics, computer science, and domain expertise. Additionally, it discusses various types of data, sampling methods, and the applications of statistics across different fields.

Uploaded by

anasamin820
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
5 views25 pages

Lecture 1 STA201

The document is a lecture on the elements of statistics and probability, covering topics such as the history of statistics, basic statistical terms, and the branches of statistics. It emphasizes the importance of data science as an interdisciplinary field that combines skills from statistics, computer science, and domain expertise. Additionally, it discusses various types of data, sampling methods, and the applications of statistics across different fields.

Uploaded by

anasamin820
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 25

Elements of Statistics and Probability

Lecture 1
Introduction to Statistics and Data Science

Dipu Ray
Adjunct Lecturer

BRAC University

Dipu Ray Adjunct Lecturer (BRAC University) Elements of Statistics and Probability October 27, 2024 1 / 25
Contents

1 Introduction

2 Data Science

3 Basic Statistical terms

4 Branches of Statistics

5 Scope and Application of Statistics

6 References

Dipu Ray Adjunct Lecturer (BRAC University) Elements of Statistics and Probability October 27, 2024 2 / 25
History of Statistics

Statistics is the art of learning from data.

Statistics was designed for the systematic collection of demographic


and economic data by states in 18th century.[3]

Origin of the word


the Latin word “Status” or

Italian word “Statista” or

German word “Statistik”

each of which means a political state or data analysis of a state.

Dipu Ray Adjunct Lecturer (BRAC University) Elements of Statistics and Probability October 27, 2024 3 / 25
What is Statistics?

Statistics is a branch of Applied Mathematics dealing with collecting,


organizing, summarizing, analyzing and making decisions from data.

“A set of methods for making decisions or predictions in the presence


of uncertainty.”-David S. Moore.

The science of organizing and summarizing numerical or nonnumerical


information.

Dipu Ray Adjunct Lecturer (BRAC University) Elements of Statistics and Probability October 27, 2024 4 / 25
Data Science

Data Science is an interdisciplinary subject.

The best existing definition of data science is illustrated by Drew


Conway’s Data Science Venn Diagram.

Figure: Drew Conway’s Data Science Venn Diagram

Dipu Ray Adjunct Lecturer (BRAC University) Elements of Statistics and Probability October 27, 2024 5 / 25
Data Science

Three distinct and overlapping areas:

the skills of a Statistician who knows how to model and summarize


datasets.

the skills of a computer scientist who can design and use algorithms
to efficiently store, process, and visualize this data.

the domain expertise— what we might think of as classical training


in a subject— necessary both to formulate the right questions and to
put their answers in context.

Dipu Ray Adjunct Lecturer (BRAC University) Elements of Statistics and Probability October 27, 2024 6 / 25
Basic Statistical terms

! Variable: The characteristics about which information can be


collected.

! Distribution: The way a variable’s values are spread over the possible
values.

Dipu Ray Adjunct Lecturer (BRAC University) Elements of Statistics and Probability October 27, 2024 7 / 25
Continue...

Figure: Types of Variables


Dipu Ray Adjunct Lecturer (BRAC University) Elements of Statistics and Probability October 27, 2024 8 / 25
Continue...

Quantitative Variable
Discrete variables assume values that can be counted.

Example: Number of accidents in a city, where we have 1, 2, 3, ...ork


accidents.

Continuous variables assume all values between any two specific


values, i.e. they take all values in an interval.

They often include fractions and decimals.

Example: The height of a student in a Country is between 110 cm


(person elf) and 226 cm (person giant).

Dipu Ray Adjunct Lecturer (BRAC University) Elements of Statistics and Probability October 27, 2024 9 / 25
Continue...

Qualitative Variable
A variable that cannot assume a numerical value but can be classified
into two or more non-numeric categories.

The categories are mutually exclusive (disjoint).

Example: Eye color: black, brown, blue, green, ..., the brand of a
computer, opinion of people etc.

Dipu Ray Adjunct Lecturer (BRAC University) Elements of Statistics and Probability October 27, 2024 10 / 25
Continue...

Exercise
1 What are the differences between discrete variable and continuous
variable?

Dipu Ray Adjunct Lecturer (BRAC University) Elements of Statistics and Probability October 27, 2024 11 / 25
Continue...

Basis
Discrete Variable Continuous Variable
for Comparison
Meaning Variable assumes a finite number of isolated values Variable assumes infinite number of different values
Values Values are obtained by counting Values are obtained by measuring
Classification Non-overlapping Overlapping
Assumes Distinct or separate values Any value between the two values

Dipu Ray Adjunct Lecturer (BRAC University) Elements of Statistics and Probability October 27, 2024 12 / 25
Continue...

Data
⋄ Data: Values of a variable.

⋄ Qualitative data: Values of a qualitative variable.

⋄ Quantitative data: Values of a quantitative variable.

⋄ Discrete data: Values of a discrete variable.

⋄ Continuous data: Values of a continuous variable.

⋄ Observation: Each individual piece of data.

⋄ Dataset: The collection of all observations for a particular variable or


a set of variables.

Dipu Ray Adjunct Lecturer (BRAC University) Elements of Statistics and Probability October 27, 2024 13 / 25
Continue...

Cross-section data versus Time series data


[2]

Cross-section data: Data collected on different elements at the


same point in time or for the same period of time.

Time-series data: Data that give the values of the same variable for
the same element at different points in time or for different periods of
time.

Dipu Ray Adjunct Lecturer (BRAC University) Elements of Statistics and Probability October 27, 2024 14 / 25
Continue...

Population and Sample


Population consists of all elements-individuals, items, or
objects—whose characteristics are being studied.

Sample is a subset of the population.

Parameter and Statistic


Parameter is a characteristic of the population.

Statistic is a characteristic of the sample.

Dipu Ray Adjunct Lecturer (BRAC University) Elements of Statistics and Probability October 27, 2024 15 / 25
Continue...

Census and Sample Survey


Survey: Collection of data on the elements of a population or sample.

Census is a survey that includes every member of the population.

Sample survey is the technique of collecting information from a


portion of the population.

Example: If we collect information on the 2024 incomes of all families in


Dhaka, it will be referred to as a census. On the other hand, if we collect
information on the 2024 incomes of 50 families from Dhaka, it will be
called a sample survey.

Dipu Ray Adjunct Lecturer (BRAC University) Elements of Statistics and Probability October 27, 2024 16 / 25
Continue...

Representative Sample and Random Sample


Representative Sample: A sample that represents the
characteristics of the population as closely as possible.

Random Sample: A sample drawn in such a way that each element


of the population has a chance of being selected is called a random
sample.

A sample may be selected with or without replacement.

Dipu Ray Adjunct Lecturer (BRAC University) Elements of Statistics and Probability October 27, 2024 17 / 25
Exercise

1 We are interested in determining the average height of 4th-semester


students of BRAC University. Find out the population, sample,
parameter and statistic.
2 Classify each variable as qualitative or quantitative.
(i) Time needed to finish the exam.
(ii) Colors of basketball team T-shirts.
(iii) Weights of luggage of passengers.
(iv) Classification of children in a day care center according to gender.

Dipu Ray Adjunct Lecturer (BRAC University) Elements of Statistics and Probability October 27, 2024 18 / 25
Continue...

3 Classify each variable as discrete or continuous


(i) Lifetime (in hours) of table lamps.
(ii) Number of cars rented each week.
(iii) Number of cups sold each day by coffee shop.
(iv) Weights of boys in a school

Dipu Ray Adjunct Lecturer (BRAC University) Elements of Statistics and Probability October 27, 2024 19 / 25
Sources of Data

Sources of Data
Primary sources: Internal sources

Secondary sources: External sources

Dipu Ray Adjunct Lecturer (BRAC University) Elements of Statistics and Probability October 27, 2024 20 / 25
Branches of Statistics

Descriptive Statistics: Descriptive Statistics consists of methods for


organizing, displaying, and describing data by using tables, graphs,
and summary measures.

Inferential Statistics: Inferential Statistics consists of methods that


use sample results to help make decisions or predictions about a
population.[2]

Dipu Ray Adjunct Lecturer (BRAC University) Elements of Statistics and Probability October 27, 2024 21 / 25
Continue...

Figure: The relationship between probability and inferential statistics

Dipu Ray Adjunct Lecturer (BRAC University) Elements of Statistics and Probability October 27, 2024 22 / 25
Scope and Applications of Statistics [1]
Economics Astronomy

Business Geology

Actuary Geography
Computer Science Social science
Ecology Politics
Medicine Psychology
Engineering Environmental studies

Dipu Ray Adjunct Lecturer (BRAC University) Elements of Statistics and Probability October 27, 2024 23 / 25
Dipu Ray Adjunct Lecturer (BRAC University) Elements of Statistics and Probability October 27, 2024 24 / 25
References

[1] Jay L Devore et al. Probability and statistics for engineering and the
sciences/Jay Devore. 2013.

[2] Prem S Mann. Introductory statistics. John Wiley & Sons, 2010.

[3] Sheldon Ross. Probability and statistics for engineers and scientists.
Vol. 16. Elsevier, New Delhi, 2009.

Dipu Ray Adjunct Lecturer (BRAC University) Elements of Statistics and Probability October 27, 2024 25 / 25

You might also like