0% found this document useful (0 votes)
220 views66 pages

Statistics Probability

The document discusses introductory concepts in statistics including what statistics is, why it is important, where it is used, sampling methods, measures of central tendency, measures of variation, correlation, and the use of tables and charts. Key concepts covered include mean, median, mode, range, percentiles, quartiles, variance, standard deviation, positive correlation, negative correlation, and no correlation.

Uploaded by

bizzpy n
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
220 views66 pages

Statistics Probability

The document discusses introductory concepts in statistics including what statistics is, why it is important, where it is used, sampling methods, measures of central tendency, measures of variation, correlation, and the use of tables and charts. Key concepts covered include mean, median, mode, range, percentiles, quartiles, variance, standard deviation, positive correlation, negative correlation, and no correlation.

Uploaded by

bizzpy n
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 66

Copyright Intellipaat. All rights reserved.

01 Intro to Statistics 02 Sampling

03 Central Tendencies 04 Variation

05 Correlation 06 Importance of Tables and charts

Hypothesis Testing , Estimation


07 And Goodness of Fit 08 Probability

Copyright Intellipaat. All rights reserved.


Introduction to
Statistics

Copyright Intellipaat. All rights reserved.


What is Statistics?
Copyright Intellipaat. All rights reserved.
What is Statistics?

Statistics is a branch of Mathematics that deals with collection, analyzing, and


interpreting large amounts of data.

Copyright IntelliPaat, All rights reserved


Why is Statistics important?
Copyright Intellipaat. All rights reserved.
Why is Statistics important?

Statistics allows us to derive knowledge from large datasets and this knowledge can
then be used to make predictions, decisions, classifications etc.

Copyright IntelliPaat, All rights reserved


Where is Statistics used?

Copyright Intellipaat. All rights reserved.


Where is Statistics used?

Statistics are used in various fields, some of them are:

Sales Weather
Medical Research Stock Market
Projection Forecasting

Copyright IntelliPaat, All rights reserved


Sampling

Copyright Intellipaat. All rights reserved.


Sampling

Sampling is the process of collecting data to perform analysis on

Copyright IntelliPaat, All rights reserved


Sample vs Population

Copyright Intellipaat. All rights reserved.


Sample vs Population

Population is the entire dataset such as the whole population of a country, Sample is subset of that
population which is analyzed to make inferences

Copyright IntelliPaat, All rights reserved


Random Sampling

Copyright Intellipaat. All rights reserved.


Random Sampling

Random Sampling is the process of selecting a subset / sample from a population in


such a way that every data point is equally likely to be included in the sample

Copyright IntelliPaat, All rights reserved


Stratified Sampling

Copyright Intellipaat. All rights reserved.


Stratified Sampling

Stratified Sampling is the process of dividing your samples into layers or groups and
then performing random sampling for each group

Copyright IntelliPaat, All rights reserved


Central Tendencies

Copyright Intellipaat. All rights reserved.


Central Tendencies

Central Tendency is used to indicate where does the middle or center of the
distribution of our data lies

Copyright IntelliPaat, All rights reserved


Mean

Copyright Intellipaat. All rights reserved.


Mean

Mean is the average of the data. In simpler terms it’s the sum of values divided by total
number of values. It’s represented by Greek letter Sigma

Mean

Copyright IntelliPaat, All rights reserved


Mode

Copyright Intellipaat. All rights reserved.


Mode

Mode is used to indicate the most frequent data point, in other words the one which
occurs most number of times

Mode

Copyright IntelliPaat, All rights reserved


Median

Copyright Intellipaat. All rights reserved.


Median

Median is the middle of the data. If the data is arranged in ascending order then the
data element which occurs right at the center is the median

Median

Copyright IntelliPaat, All rights reserved


Variation

Copyright Intellipaat. All rights reserved.


Variation

Variation in statistics is used to show how data is dispersed, or spread out. Several
measures of variation are used in statistics.

Range Quartiles Variance

Copyright IntelliPaat, All rights reserved


Range
Copyright Intellipaat. All rights reserved.
Range

Range is the difference between the highest and the lowest values in our dataset. Range
tells us the distance between the lowest and highest values in our data

Copyright IntelliPaat, All rights reserved


Percentiles

Copyright Intellipaat. All rights reserved.


Percentiles

Percentiles are scores that are used to describe a value below which some Observations fall.
E.g.: If X is at 70th Percentile it mean 70% of other data points from our sample are below X

Copyright IntelliPaat, All rights reserved


Quartiles
Copyright Intellipaat. All rights reserved.
Quartiles

Quartiles are used to break the data into 4 parts so as to better find the spread of data in a
way that is less influenced by outliers.

Quartiles are expressed in percentiles. 1st Quartile is 25th Percentile, 2nd Quartile is 50th
Percentile (Median) and 3rd Quartile is 75th Percentile

Copyright IntelliPaat, All rights reserved


Interquartile Range (IQR)
Copyright Intellipaat. All rights reserved.
Interquartile Range (IQR)

Interquartile Range (IQR) is the difference between the lower and upper quartile. This gives
us a better idea of the range of data.

Copyright IntelliPaat, All rights reserved


Standard Variance and
Standard Deviation
Copyright Intellipaat. All rights reserved.
Standard Variance and Standard Deviation

Standard Variance measures how far a set of numbers are spread out from their average
value.

Standard Deviation is used to express the magnitude by which the members of a group differ
from the mean value for the group.

Standard Deviation is the square root of Standard Variance.

Copyright IntelliPaat, All rights reserved


Correlation
Copyright Intellipaat. All rights reserved.
Correlation

Correlation is a term that is a measure of the strength of a linear relationship between two
quantitative variables

Copyright IntelliPaat, All rights reserved


Positive Correlation
Copyright Intellipaat. All rights reserved.
Positive Correlation

Positive Correlation is a term that is used to describe a positive linear relationship between
two quantitative variables

Positive
Correlation

Copyright IntelliPaat, All rights reserved


No Correlation
Copyright Intellipaat. All rights reserved.
No Correlation

No Correlation is a term used to describe no linear relationship between two quantitative


variables

No
Correlation

Copyright IntelliPaat, All rights reserved


Negative Correlation
Copyright Intellipaat. All rights reserved.
Negative Correlation

Negative Correlation is a term that is used to describe the strength of a Negative linear
relationship between two quantitative variables

Negative
Correlation

Copyright IntelliPaat, All rights reserved


Tables

Copyright Intellipaat. All rights reserved.


Tables

A way of presenting statistical data through a systematic arrangement of the numbers


describing some mass phenomenon or process

A statistical table may be regarded as representing a subject and predicate. The meaning of each
number is indicated by the headings of the corresponding row and column.

Copyright IntelliPaat, All rights reserved


Charts
Copyright Intellipaat. All rights reserved.
Charts

A statistical graph or chart is defined as the pictorial representation of statistical data in


graphical form. The statistical graphs are used to represent a set of data to make it easier to
understand and interpret statistical information.

Copyright IntelliPaat, All rights reserved


Charts

A statistical graph or chart is defined as the pictorial representation of statistical data in


graphical form. The statistical graphs are used to represent a set of data to make it easier to
understand and interpret statistical information.

Lets list down the types of charts

Copyright IntelliPaat, All rights reserved


Charts

A statistical graph or chart is defined as the pictorial representation of statistical data in


graphical form. The statistical graphs are used to represent a set of data to make it easier to
understand and interpret statistical information.

Lets list down the types of charts

Types Of Charts
1.Bar chart
2.Histogram
3.Pie chart
4.Box chart
5.Line Graph
6.Scatter plot

Copyright IntelliPaat, All rights reserved


1. Bar chart

Bar charts are among the most frequently used chart types. As the name suggests
a bar chart is composed of a series of bars illustrating a variable’s development.
Given that bar charts are such a common chart type, people are generally familiar
with them and can understand them easily

Copyright IntelliPaat, All rights reserved


2. Histogram

A series of bins showing us the frequency of observations of a given variable. The


definition of histogram charts is short and easy..

Copyright IntelliPaat, All rights reserved


3. Box chart

Box plot, also called the box-and-whisker plot: a way to show the distribution of
values based on the five-number summary: minimum, first quartile, median, third
quartile, and maximum.

Copyright IntelliPaat, All rights reserved


4. Pie chart

A pie chart is a circular graph divided into slices. The larger a slice is the bigger
portion of the total quantity it represents.

Copyright IntelliPaat, All rights reserved


5. Line chart

A line chart is, as one can imagine, a line or multiple lines showing how single, or
multiple variables develop over time. It is a great tool because we can easily
highlight the magnitude of change of one or more variables over a period.

Copyright IntelliPaat, All rights reserved


7. Scatter plot

A scatter plot is a type of chart that is often used in the fields of statistics and data
science. It consists of multiple data points plotted across two axes. Each variable
depicted in a scatter plot would have multiple observations. If a scatter plot includes
more than two variables, then we would use different colors to signify that.

Copyright IntelliPaat, All rights reserved


Hypothesis Testing and Estimation

Hypothesis testing refers to the process of making inferences or educated guesses about a
particular parameter. This can either be done using statistics and sample data, or it can be
done on the basis of an uncontrolled observational study.

Estimation, in statistics, any of numerous procedures used to calculate the value of some
property of a population from observations of a sample drawn from the population.

Copyright IntelliPaat, All rights reserved


Goodness of Fit

The goodness-of-fit test is a statistical hypothesis test to see how well sample data fit a
distribution from a population.
his test shows if your sample data represents the data you would expect to find in the actual
population

Copyright IntelliPaat, All rights reserved


Goodness of Fit

The goodness-of-fit test is a statistical hypothesis test to see how well sample data fit a
distribution from a population.
his test shows if your sample data represents the data you would expect to find in the actual
population

Goodness-of-fit establishes the discrepancy between the observed values and those that
would be expected of the model in a normal distribution case.

Copyright IntelliPaat, All rights reserved


Probability
Copyright Intellipaat. All rights reserved.
Introduction to Probability

Probability defines the likelihood of occurrence of an event

Copyright IntelliPaat, All rights reserved


Introduction to Probability

Probability defines the likelihood of occurrence of an event

Probability can be defined as the ratio of the number of favorable outcomes to the total
number of outcomes of an event.

Copyright IntelliPaat, All rights reserved


Introduction to Probability

Probability defines the likelihood of occurrence of an event

Probability can be defined as the ratio of the number of favorable outcomes to the total
number of outcomes of an event.

Probability can be defined as the ratio of the number of favorable outcomes to the total
number of outcomes of an event.

Copyright IntelliPaat, All rights reserved


Probability

For an experiment having 'n' number of outcomes, the number of favorable outcomes
can be denoted by x. The formula to calculate the probability of an event is as follows.

Probability(Event) = Favorable Outcomes


Total Outcomes

Copyright IntelliPaat, All rights reserved


India: +91-7847955955

US: 1-800-216-8930 (TOLL FREE)

[email protected]
[email protected]

24/7 Chat with Our Course Advisor

Copyright Intellipaat. All rights reserved.

You might also like