SS ZG536
ADVANCED STATISTICAL TECHNIQUES
FOR ANALYTICS
BITS Pilani Dr Y V K Ravi Kumar, BITS- Pilani
Pilani|Dubai|Goa|Hyderabad
BITS Pilani
Hyderabad Campus
L- 2: Descriptive Statistics
Today…..
Recall the past for a while_ Simple tools
Visualization of data
Basics of probability
Discussion & Problems on probability
Conditional probability
Advanced Statistical Techniques for Analytics 01-12-2018 Slide 3
Visualization
Summary gives an idea about the data
summary(income)
Min 1st QU. Median Mean 3rd Qu Max
- 7.8 12.5 32.0 52.03 67.2 585
Visualization – why
Advanced Statistical Techniques for Analytics 01-12-2018 Slide 4
Data Visualisation
Line chart
Bar chart
Histogram
Pie chart
Scatter plot
Box plot
Advanced Statistical Techniques for Analytics 01-12-2018 Slide 5
Line Chart
Advanced Statistical Techniques for Analytics 01-12-2018 Slide 6
Bar Chart
Advanced Statistical Techniques for Analytics 01-12-2018 Slide 7
Histograms
Advanced Statistical Techniques for Analytics 01-12-2018 Slide 8
Histograms
Advanced Statistical Techniques for Analytics 01-12-2018 Slide 9
Histograms
Advanced Statistical Techniques for Analytics 01-12-2018 Slide 10
Pie charts
Advanced Statistical Techniques for Analytics 01-12-2018 Slide 11
Scatter Plot
Advanced Statistical Techniques for Analytics 01-12-2018 Slide 12
Box plot
Advanced Statistical Techniques for Analytics 01-12-2018 Slide 13
To conclude _ Visualization
Visualization gives a sense of data distribution and
relationship among variables
Visualization is an iterative process and helps answer
questions about the data. Time spent is not wasted during the
modelling process and helps to find the optimal model to fit
the data
Advanced Statistical Techniques for Analytics 01-12-2018 Slide 14
!!!!
A famous statistician would never travel by airplane, because she had
studied air travel and estimated the probability of there being a bomb on
any given flight was 1 in a million, and she was not prepared to accept
these odds.
One day a colleague met her at a conference far from home.
"How did you get here, by train?"
"No, I flew"
"What about the possibility of a bomb?"
"Well, I began thinking that if the odds of one bomb are 1:million, then the
odds of TWO bombs are (1/1,000,000) x (1/1,000,000) = 10-12. This is a
very, very small probability, which I can accept. So, now I bring my own
bomb along!"
Advanced Statistical Techniques for Analytics 01-12-2018 Slide 15
Random Experiment
Term "random experiment" is used to describe any action
whose outcome is not known in advance. Here are some
examples of experiments dealing with statistical data:
Tossing a coin
Counting how many times a certain word or a combination of
words appears in the text of the “King Lear” or in a text of
Confucius
counting occurrences of a certain combination of amino
acids in a protein database.
pulling a card from the deck
Advanced Statistical Techniques for Analytics 01-12-2018 Slide 16
Sample Space
Discrete sample spaces.
Continuous sample spaces
Advanced Statistical Techniques for Analytics 01-12-2018 Slide 17
Event
Independent events
Dependent events
Advanced Statistical Techniques for Analytics 01-12-2018 Slide 18
Probability
Advanced Statistical Techniques for Analytics 01-12-2018 Slide 19
Axioms of Probability
Advanced Statistical Techniques for Analytics 01-12-2018 Slide 20
1
The sales manager of an e commerce company
says that 80% of those who visit their
website for the first time do not buy any
mobile. If a new customer visits the website,
what is the probability that the customer
would buy mobile
Advanced Statistical Techniques for Analytics 01-12-2018 Slide 21
2
Blue Black Brown Total
Software prog
35 25 20 80
Project Mgrs
7 8 5 20
Total
42 33 25 100
If an employee is selected at random , what is the
probability that he is a software prog?
………………………………………………………..,what
is the probability that he is wearing a blue trouser
Advanced Statistical Techniques for Analytics 01-12-2018 Slide 22
3
A Survey conducted by a bank revealed that 40% of the accounts are
savings accounts and 35% of the accounts are current accounts and
the balance are loan accounts.
What is the probability that an account taken at random is a loan
account ?
What is the probability that an account taken at random is NOT
savings account ?
What is the probability that an account taken at random is NOT a
current account
What is the probability that an account taken at random is a current
account or a loan account?
Advanced Statistical Techniques for Analytics 01-12-2018 Slide 23
Thanks
Advanced Statistical Techniques for Analytics 01-12-2018 Slide 24