Sample Intro Statistics Intuitive Guide
Sample Intro Statistics Intuitive Guide
Statistics
AN INTUITIVE GUIDE FOR ANALYZING DATA
AND UNLOCKING DISCOVERIES
Jim Frost
Copyright © 2019 by Jim Frost.
Ordering Information:
3
Contents
i
iii
v
INTRODUCTION
Prepare for an
Adventure!
NOTE: This sample contains only the introduction and first chapter.
Please buy the full ebook for all the content listed in the Table of Con-
tents. You can buy it in My Store.
I love statistics and analyzing data! I also love talking and writing
about it. I was a researcher at a major university. Then, I spent over a
decade working at a major statistical software company. During my
time at the software company, I learned how to present statistics in a
manner that makes it more intuitive. I want you to understand the es-
sential concepts, practices, and knowledge for statistics so you can an-
alyze your data confidently. That’s the goal of my book.
Over the years, I've literally received thousands of requests from as-
piring data scientists for statistical guidance. This book is my answer -
years of knowledge and thousands of hours of hard work distilled into
a thorough, practical guide for understanding statistics.
I hope this book helps you see statistics through my eyes―as a key that
can unlock discoveries trapped in your data. Discovery is a bumpy
road. There might be statistical challenges en route, but they can be
7
Jim Frost
I hope you’ll view this book as a voyage. We start with the practical
nuts and bolts about the field of statistics and data analysis. Then we
move to the broader context of the process and challenges of using
statistics to expand human knowledge. Now, that’s an adventure in
statistics!
In this book, you'll learn about the many facets of the field of statistics.
We’ll start with an overview of statistics and why it is so important.
Then, we’ll move on to the necessary skills and knowledge about the
different types of data, and how to summarize data both graphically
and numerically. Importantly, I’ll show you how to discover relation-
ships between your variables. After that, we go to the next level and
learn about populations and how to estimate their properties using
samples.
Finally, we look at how statistics are part of the scientific method and
help expand the scope of human knowledge. This process includes de-
signing experiments that identify causation, rather than mere correla-
tion, by accounting for challenges that all studies face. The book closes
with criteria you can use to evaluate the quality of both the data and
the design of the experiment itself. These criteria are useful for cri-
tiquing your research or other researchers' experiments.
Reading this book will allow you to take your skills to the next level.
You’ll be able to proceed onto more advanced statistical studies such
as hypothesis testing, regression analysis, and experimental design.
Alternatively, you’ll have a solid foundation for a career in data sci-
ence and machine learning.
You’ll notice that there are not many equations in this book. After all,
you should let your statistical software handle the calculations while
8
Introduction to Statistics: An Intuitive Guide
Please note that throughout this book I use Minitab statistical soft-
ware. However, this book is not about teaching particular software but
rather how to perform basic graphing and analysis. All common sta-
tistical software packages should be able to perform the analyses that
I show. There is nothing in here that is unique to Minitab.
9
CHAPTER 1
The Importance of
Statistics
Statistics are everywhere today. You’ll run across them in your every-
day life. Four of five dentists prefer this toothpaste! On the news. This
scientific poll used random sampling and has a margin of error of 3%.
In school. Your grades were averages of scores. Perhaps you were
graded on a curve? Then, you took the SAT, ACT, or GRE and received
your results with percentiles. You’ll find statistics in almost every as-
pect of life: Nielsen TV ratings, surveys, political polls, insurance
rates, and so on. And then, of course, there are the ubiquitous sports
statistics, batting averages, third-down conversions, and countless
more! Politicians and policy wonks use statistics to make their cases.
Companies use statistics to use data to make decisions and increase
their profits.
10
Introduction to Statistics: An Intuitive Guide
Let’s start by covering two main reasons why studying the field of sta-
tistics is crucial in modern society. First, statisticians are guides for
learning from data and navigating common problems that can lead you
to incorrect conclusions. Second, given the growing importance of de-
cisions and opinions based on data, it’s crucial that you can critically
assess the quality of analyses that others present to you.
11
Jim Frost
12
Introduction to Statistics: An Intuitive Guide
13
Jim Frost
“The best thing about being a statistician is that you get to play in eve-
ryone else’s backyard.”
The world today produces more data and more analyses designed to
influence you than ever before. Are you ready for it?
14
Introduction to Statistics: An Intuitive Guide
• Wow! I took this supplement and lost a lot of weight! This pill
must work!
• I know someone who smoked for decades, and it never pro-
duced any significant illness. Those claims about smoking are
exaggerated!
• This anti-aging cream took years off. It must be the best!
I’m sure you’ve heard that you can’t trust anecdotal evidence. How-
ever, we still ask our friends for recommendations about restaurants,
travel destinations, auto mechanics, and so on. The tricky thing about
anecdotal evidence is that even when an individual story is true, it can
still be entirely misleading. How does that work?
15
Jim Frost
A quick look at the table should be enough to convince you that anec-
dotal evidence is not trustworthy! However, it’s even worse thanks to
psychological factors that prime us for believing these stories.
16
Introduction to Statistics: An Intuitive Guide
ten people who took the supplement and didn’t lose weight. We also
don’t know what else Fred might be doing to lose weight.
Next, I’ll illustrate the problems graphically and explain how statistics
and the scientific method deals with them.
The graph below displays the results from anecdotal stories of people
who took a hypothetical weight loss supplement. Think of this chart
as a summary of the results presented in a TV commercial. We’ll even
assume that these people are telling the complete truth. The supple-
ment looks effective, right? They’ve lost a lot of weight! When you
see the individuals and hear their emotional stories about weight loss,
we want to believe that the supplement worked.
Regrettably, the graph doesn’t provide the full story. Remember, an-
ecdotal evidence uses small non-random samples that aren’t general-
izable beyond the sample. The individuals might have been cherry-
picked for their narratives, or perhaps they presented the tales on
17
Jim Frost
In this graph, blue dots represent the supplement takers, and red dots
represent those who didn’t take the supplement.
These results are not as impressive as the other graph. Some who took
the supplement lost the weight shown in the TV ad, but many more
lost much less weight. Those people didn’t come forward with their
less exciting stories!
18
Introduction to Statistics: An Intuitive Guide
Furthermore, those who did not take the supplement fit the same pat-
tern as those who did. Collectively, taking the supplement didn’t pro-
duce more significant weight loss than the control group.
19
Jim Frost
weight loss) and being able to generalize the results beyond the origi-
nal sample requires the researchers to use various data collecting pro-
cedures and experimental designs.
The first portion covers the essential tools of the trade, including data
types, summarizing the data, and identifying relationships between
variables. I’ll describe the types of data and how to present them
graphically. Learn how summary statistics represent an entire dataset
and describe where an observation falls within it. These statistics in-
clude measures of central tendency, measures of variability, percen-
tiles, and correlation. Then, we’ll move onto probability distributions.
Probability distributions help you understand the distribution of val-
ues and calculate probabilities. We’ll pay extra attention to the crucial
Normal Distribution.
20
Introduction to Statistics: An Intuitive Guide
The second portion covers the practices and procedures for inferen-
tial statistics and using statistics in the scientific process. Inferential
statistics allow you to use a relatively small sample to learn about an
entire population. However, making this leap from a sample to the
population requires additional procedures and methodologies. I’ll also
cover how the field of statistics fits in with the scientific method along
with the essentials of designing experiments to answer questions.
NOTE: This sample contains only the introduction and first chapter.
Please buy the full ebook for all the content listed in the Table of Con-
tents. You can buy it in My Store.
21