0% found this document useful (0 votes)

22 views4 pages

Statistics

Uploaded by

banushri914

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views4 pages

Statistics

Uploaded by

banushri914

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Introduction to Statistics

Statistics is a mathematical science that includes methods for collecting, organizing, analyzing
and visualizing data in such a way that meaningful conclusions can be drawn.
Statistics is also a field of study that summarizes the data, interpret the data making decisions
based on the data.
Statistics is composed of two broad categories:
1. Descriptive Statistics
2. Inferential Statistics

1. Descriptive Statistics
Descriptive statistics describes the characteristics or properties of the data. It helps to summarize
the data in a meaningful data in a meaningful way. It allows important patterns to emerge from
the data. Data summarization techniques are used to identify the properties of data. It is helpful in
understanding the distribution of data. They do not involve in generalizing beyond the data.
1.1 Two types of descriptive statistics
1. Measures of Central Tendency: (Mean , Median , Mode)
2. Measures of data spread or dispersion (range, quartiles, variance and standard deviation)

1.1.1 Measures of Central Tendency: (Mean , Median , Mode)

A measure of central tendency is a single value that attempts to describe a set of data by
identifying the central position within that set of data. The mean, median and mode are all valid
measures of central tendency.
1.1.2 Measures of spread:

Measures of spread are the ways of summarizing a group of data by describing how scores are
spread out. To describe this spread, a number of statistics are available to us, including the range,
quartiles, absolute deviation, variance and standard deviation.

• The degree to which numerical data tend to spread is called the dispersion, or variance of the data.
The common measures of data dispersion: Range, Quartiles, Outliers, and Boxplots.

Inferential Statistics – Definition and Types

Inferential statistics is generally used when the user needs to make a conclusion about the whole
population at hand, and this is done using the various types of tests available. It is a technique which
is used to understand trends and draw the required conclusions about a large population by taking
and analyzing a sample from it. Descriptive statistics, on the other hand, is only about the smaller
sized data set at hand – it usually does not involve large populations. Using variables and the
relationships between them from the sample, we will be able to make generalizations and predict
other relationships within the whole population, regardless of how large it is.

With inferential statistics, data is taken from samples and generalizations are made about a
population. Inferential statistics use statistical models to compare sample data to other samples or
to previous research.

There are two main areas of inferential statistics:

1. Estimating parameters:
This means taking a statistic from the sample data (for example the sample mean) and using it to
infer about a population parameter (i.e. the population mean).There may be sampling variations
because of chance fluctuations, variations in sampling techniques, and other sampling errors.
Estimation about population characteristics may be influenced by such factors. Therefore, in
estimation the important point is that to what extent our estimate is close to the true value.

Characteristics of Good Estimator: A good statistical estimator should have the following
characteristics, (i) Unbiased (ii) Consistent (iii) Accuracy

i) Unbiased

An unbiased estimator is one in which, if we were to obtain an infinite number of random samples of
a certain size, the mean of the statistic would be equal to the parameter. The sample mean, ( x ) is
an unbiased estimate of population mean (μ)because if we look at possible random samples of size
N from a population, then mean of the sample would be equal to μ.

ii) Consistent

A consistent estimator is one that as the sample size increased, the probability that estimate has a
value close to the parameter also increased. Because it is a consistent estimator, a sample mean
based on 20 scores has a greater probability of being closer to (μ) than does a sample mean based
upon only 5 scores

iii) Accuracy

The sample mean is an unbiased and consistent estimator of population mean (μ).But we should not
over look the fact that an estimate is just a rough or approximate calculation. It is unlikely in any
estimate that ( x ) will be exactly equal to population mean (μ). Whether or not x is a good estimate
of (μ) depends upon the representativeness of sample, the sample size, and the variability of scores
in the population.

2. Hypothesis tests. This is where sample data can be used to answer research questions. For
example, we might be interested in knowing if a new cancer drug is effective. Or if breakfast helps
children perform better in schools.

Inferential statistics is closely tied to the logic of hypothesis testing. We hypothesize that this value
characterise the population of observations. The question is whether that hypothesis is reasonable
evidence from the sample. Sometimes hypothesis testing is referred to as statistical decision-making
process. In day-to-day situations.

Differences between Descriptive and Inferential Inferential Statistics

Statistics Descriptive Statistics

Concerned with describing the target population Make inferences from the sample and generalize
them to the population

Organise, analyse, present the data in a Compare, tests and predicts future outcomes
meaningful way
The analysed results are in the form of graphs, The analysed results are the probability scores
charts etc
Describes the data which is already known Tries to make conclusions about the population
beyond the data available
Tools: Measures of central tendency and Tools: Hypothesis tests, analysis of variance etc
measures of spread

Random Variables

A random variable, X, is a variable whose possible values are numerical outcomes of a random
phenomenon. There are two types of random variables, discrete and continuous.

Example of Random variable

- A person’s blood type

- Number of leaves on a tree

- Number of times a user visits LinkedIn in a day

- Length of a tweet.

Discrete Random Variables :

A discrete random variable is one which may take on only a countable number of distinct values such
as 0,1,2,3,4,........ Discrete random variables are usually counts. If a random variable can take only a
finite number of distinct values, then it must be discrete. Examples of discrete random variables
include the number of children in a family, the Friday night attendance at a cinema, the number of
patients in a doctor's surgery, the number of defective light bulbs in a box of ten.

The probability distribution of a discrete random variable is a list of probabilities associated with
each of its possible values. It is also sometimes called the probability function or the probability mass
function

Suppose a random variable X may take k different values, with the probability that X = xi defined to
be P(X = xi) = pi. The probabilities pi must satisfy the following:

1: 0 < pi < 1 for each i

2: p1 + p2 + ... + pk = 1.

Example

Suppose a variable X can take the values 1, 2, 3, or 4. The probabilities associated with each outcome
are described by the following table:

Outcome

Probability
0.1

0.3

0.4

0.2

The probability that X is equal to 2 or 3 is the sum of the two probabilities: P(X = 2 or X = 3) = P(X = 2)
+ P(X = 3) = 0.3 +

0.4 = 0.7. Similarly, the probability that X is greater than 1 is equal to 1 - P(X = 1) = 1 - 0.1 = 0.9, by
the complement rule.

Continuous Random Variables

A continuous random variable is one which takes an infinite number of possible values. Continuous
random variables are usually measurements. Examples include height, weight, the amount of sugar
in an orange, the time required to run a mile.

A continuous random variable is not defined at specific values. Instead, it is defined over an interval
of values, and is represented by the area under a curve (known as an integral). The probability of
observing any single value is equal to 0, since the number of values which may be assumed by the
random variable is infinite.

Suppose a random variable X may take all values over an interval of real numbers. Then the
probability that X is in the set of outcomes A, P(A), is defined to be the area above A and under a
curve. The curve, which represents a function p(x), must satisfy the following:

1: The curve has no negative values (p(x) > 0 for all x)

2: The total area under the curve is equal to 1.

A curve meeting these requirements is known as a density curve.

All random variables (discrete and continuous) have a cumulative distribution function. It is a
function giving the probability that the random variable X is less than or equal to x, for every value x.
For a discrete random variable, the cumulative distribution function is found by

summing up the probabilities

Educ 201
No ratings yet
Educ 201
2 pages
Negro Who's Who in California (1948)
100% (2)
Negro Who's Who in California (1948)
154 pages
APO-Philippines Brand Guide
100% (3)
APO-Philippines Brand Guide
17 pages
Statistical and Probability Tools For Cost Engineering
No ratings yet
Statistical and Probability Tools For Cost Engineering
16 pages
Week One: Introduction To Quantitative Methods MBA 2013
No ratings yet
Week One: Introduction To Quantitative Methods MBA 2013
49 pages
Module 5 Ge 114
No ratings yet
Module 5 Ge 114
15 pages
SPSS Session 1 Descriptive Statistics and Univariate
No ratings yet
SPSS Session 1 Descriptive Statistics and Univariate
8 pages
Statistics
No ratings yet
Statistics
45 pages
Predictive Analytics Notes1
No ratings yet
Predictive Analytics Notes1
37 pages
Statistics and Probability
No ratings yet
Statistics and Probability
2 pages
STATISTICS AND PROBABILITY Revised
No ratings yet
STATISTICS AND PROBABILITY Revised
32 pages
Midterms Gec Math Adooooor
No ratings yet
Midterms Gec Math Adooooor
6 pages
Prof. Joy V. Lorin-Picar Davao Del Norte State College: New Visayas, Panabo City
No ratings yet
Prof. Joy V. Lorin-Picar Davao Del Norte State College: New Visayas, Panabo City
91 pages
Biostatistics 1
No ratings yet
Biostatistics 1
120 pages
Unit II: Basic Data Analytic Methods
No ratings yet
Unit II: Basic Data Analytic Methods
38 pages
Basic Statistics: Statistics: Is A Science That Analyzes Information Variables (For Instance
No ratings yet
Basic Statistics: Statistics: Is A Science That Analyzes Information Variables (For Instance
14 pages
Unit 8. Data Analysis
No ratings yet
Unit 8. Data Analysis
69 pages
Cba101 MT
No ratings yet
Cba101 MT
4 pages
Statistics
100% (1)
Statistics
11 pages
Statistics Lesson 1
No ratings yet
Statistics Lesson 1
111 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
3 pages
Chapter 01
No ratings yet
Chapter 01
56 pages
Engineering Data Analysis
No ratings yet
Engineering Data Analysis
64 pages
Erwin John Landicho
No ratings yet
Erwin John Landicho
8 pages
Notes Data Analytics
No ratings yet
Notes Data Analytics
19 pages
Lecture 1
No ratings yet
Lecture 1
32 pages
Statistics
No ratings yet
Statistics
61 pages
DSML
No ratings yet
DSML
510 pages
STATISTICS (Tanya) PG 1 - 28
No ratings yet
STATISTICS (Tanya) PG 1 - 28
35 pages
Chapter 1
No ratings yet
Chapter 1
17 pages
Probability and Statistics (Tutorial 1)
No ratings yet
Probability and Statistics (Tutorial 1)
35 pages
Chapter1 Statistics
No ratings yet
Chapter1 Statistics
17 pages
Statistics
No ratings yet
Statistics
14 pages
Reviewer Part 1
No ratings yet
Reviewer Part 1
9 pages
Modified Ps Final 2023
No ratings yet
Modified Ps Final 2023
124 pages
Stats Reviewer
No ratings yet
Stats Reviewer
16 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
27 pages
Chapter 1 - NATURE OF STATISTICS
No ratings yet
Chapter 1 - NATURE OF STATISTICS
14 pages
2introduction To STATISTICS
No ratings yet
2introduction To STATISTICS
24 pages
Types of Data Qualitative Data
No ratings yet
Types of Data Qualitative Data
5 pages
Prob & Stat
No ratings yet
Prob & Stat
50 pages
W1 Lesson 1 - Basic Statistical Concepts - Module PDF
No ratings yet
W1 Lesson 1 - Basic Statistical Concepts - Module PDF
11 pages
Module 001 Basic Statistical Concept
No ratings yet
Module 001 Basic Statistical Concept
12 pages
Basic Concepts in Statistics
No ratings yet
Basic Concepts in Statistics
42 pages
Presentation 4
No ratings yet
Presentation 4
29 pages
Data Analytics Notes
No ratings yet
Data Analytics Notes
44 pages
Lesson 5 (Descriptive Statistics Part 1) - Oct 2024
No ratings yet
Lesson 5 (Descriptive Statistics Part 1) - Oct 2024
72 pages
STATISTICS
No ratings yet
STATISTICS
9 pages
BDU Biometrics
No ratings yet
BDU Biometrics
122 pages
Basics For Understanding
No ratings yet
Basics For Understanding
8 pages
Statistics
No ratings yet
Statistics
152 pages
Stats 1 Module Updated
No ratings yet
Stats 1 Module Updated
53 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
7 pages
Std121-121e - Business Statistics Course Booklet 2023
No ratings yet
Std121-121e - Business Statistics Course Booklet 2023
82 pages
EDA MODULE 1 Nature of Statistics
No ratings yet
EDA MODULE 1 Nature of Statistics
12 pages
Statistical Analysis
No ratings yet
Statistical Analysis
26 pages
Statistics and Probabilities Quarter 1
No ratings yet
Statistics and Probabilities Quarter 1
6 pages
CHAPTER+ONE+Descriptive+Statistics+ +univariate
No ratings yet
CHAPTER+ONE+Descriptive+Statistics+ +univariate
12 pages
Statistics Lecture 1
No ratings yet
Statistics Lecture 1
20 pages
Statistics For Data Science
100% (1)
Statistics For Data Science
27 pages
Solow Growth Model Summary
No ratings yet
Solow Growth Model Summary
1 page
Chapter 11 Test Bank PDF
No ratings yet
Chapter 11 Test Bank PDF
116 pages
Alliance Management
No ratings yet
Alliance Management
5 pages
Advance Structures (7th Semester) (B.ARCH)
No ratings yet
Advance Structures (7th Semester) (B.ARCH)
93 pages
Nonlinear Inversion Flight Control For A Supermaneuverable Aircraft
100% (1)
Nonlinear Inversion Flight Control For A Supermaneuverable Aircraft
9 pages
01 Road Roller Basic Knowledge (6611E)
0% (1)
01 Road Roller Basic Knowledge (6611E)
16 pages
Split Learning Over Wireless Networks Parallel Design and Resource Management
No ratings yet
Split Learning Over Wireless Networks Parallel Design and Resource Management
30 pages
Labour Regulations in The UAE Are Governed by The UAE Labour Law
No ratings yet
Labour Regulations in The UAE Are Governed by The UAE Labour Law
10 pages
COMP40004 - Web Development and Operating Systems
No ratings yet
COMP40004 - Web Development and Operating Systems
4 pages
Chitaliya Dipak - Nirma
No ratings yet
Chitaliya Dipak - Nirma
93 pages
Endorsement Letter Honda
No ratings yet
Endorsement Letter Honda
1 page
Backgroud of Malaysia Airlines 1
No ratings yet
Backgroud of Malaysia Airlines 1
38 pages
Journal of Accounting and Economics: Shuping Chen, Ying Huang, Ningzhong Li, Terry Shevlin T
No ratings yet
Journal of Accounting and Economics: Shuping Chen, Ying Huang, Ningzhong Li, Terry Shevlin T
19 pages
Concurrence of Big Data Analytics and Healthcare
No ratings yet
Concurrence of Big Data Analytics and Healthcare
10 pages
Final Training Design
No ratings yet
Final Training Design
4 pages
Pipeline Pre Trenching Pre Qua - Rev A 27june22 - Final
No ratings yet
Pipeline Pre Trenching Pre Qua - Rev A 27june22 - Final
57 pages
Film Insurance
100% (1)
Film Insurance
8 pages
Credentials - Impeerical Consulting
No ratings yet
Credentials - Impeerical Consulting
22 pages
Policing For Profit
No ratings yet
Policing For Profit
212 pages
Technical Notes - John C. Hull
No ratings yet
Technical Notes - John C. Hull
64 pages
Lista Hasting
No ratings yet
Lista Hasting
2 pages
Untitled 2
No ratings yet
Untitled 2
31 pages
NHB Ebook Wet Markets
No ratings yet
NHB Ebook Wet Markets
19 pages
Introduction To The USA and Canada
No ratings yet
Introduction To The USA and Canada
10 pages
Pro Wrestling Illustrated, 2005-03 (2004 in Wrestling) (C)
No ratings yet
Pro Wrestling Illustrated, 2005-03 (2004 in Wrestling) (C)
148 pages
123GL Undstd Cybersec
No ratings yet
123GL Undstd Cybersec
6 pages
Experimental Study On The Application of Polymer Modified Bitumen in The Flexible Pavement
No ratings yet
Experimental Study On The Application of Polymer Modified Bitumen in The Flexible Pavement
1 page
Types of Bus Bar System
No ratings yet
Types of Bus Bar System
7 pages

Statistics

Uploaded by

Statistics

Uploaded by

Introduction to Statistics

1.1.1 Measures of Central Tendency: (Mean , Median , Mode)

Inferential Statistics – Definition and Types

There are two main areas of inferential statistics:

Differences between Descriptive and Inferential Inferential Statistics

Example of Random variable

- A person’s blood type

- Number of leaves on a tree

- Number of times a user visits LinkedIn in a day

Discrete Random Variables :

1: 0 < pi < 1 for each i

Continuous Random Variables

1: The curve has no negative values (p(x) > 0 for all x)

2: The total area under the curve is equal to 1.

A curve meeting these requirements is known as a density curve.

summing up the probabilities

You might also like