Welcome to Scribd!

0% found this document useful (0 votes)

17 views

Regression Outliers

Uploaded by

Ross Zhou

statistics

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Regression Outliers

Uploaded by

Ross Zhou

0% found this document useful (0 votes)

17 views2 pages

statistics

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

statistics

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

0% found this document useful (0 votes)

17 views2 pages

Regression Outliers

Uploaded by

Ross Zhou

statistics

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

Jump to Page

You are on page 1of 2

Search inside document

REGRESSION OUTLIERS

1. Identification of Outliers
An outlier is an extreme observation. Typically points further than, say, three or four
standard deviations from the mean are considered as outliers. In regression however,
the situation is somewhat more complex in the sense that some outlying points will have
more influence on the regression than others. In JMPIN there is one diagnostic that can
be used to identify possibly influential outliers, known as Cooks Distance, or simply
Cooks D. Given a regression of Y on ( x1 ,.., xk ) using data set ( y j , x1 j ,.., xkj ), j = 1,.., n ,

if
s =

estimated root mean square error,

y j =

regression estimate of the conditional mean E (Y j | x1 j ,.., xkj ) ,

y j (i ) = regression estimate of the conditional mean E (Y j | x1 j ,.., xkj ) with the

i th data point ( yi , x1i ,.., xki ) removed,

then Cooks Distance for point i is given by

( y
D =
n

j =1

y j (i ) )

(k + 1) s 2

, i = 1,.., n

Intuitively, Di is a normalized measure of the influence of point i on all predicted mean

values, y j , j = 1,.., n . Cooks D can be obtained using Fit Model in JMPIN as follows:
(i) Right click on the heading of the Parameter Estimates table,
(ii) Select the Save Columns options, and click on Cooks D Influence.
(iii) A new data column will appear, contain the Cooks D Influence values.
To identify potential outliers, one Rule of Thumb is to treat point i as an outlier when:

4
n (k + 1)

As with all Rules of Thumb, this provides only a rough guideline (and often tends to
identify too many points as potential outliers). The best strategy is to look at the
distribution of Cooks D values and see whether there are any conspicuously large values
relative to the others. If these values are roughly of the magnitude 4 /(n k 1) or larger,
then they are worth investigating further.
2. Treatment of Outliers

The key point to stress here is that the above procedure can only serve to identify points
that are suspicious from a statistical perspective. It does not mean that these points should
automatically be eliminated! The removal of data points can be dangerous. While this
will always improves the fit of your regression, it may end up destroying some of the
most important information in your data.
Hence the first question that should be asked is whether there exists some substantive
information about these points that suggests that they should be removed. Do they
involve special properties or circumstances not relevant for the situation under
investigation? Do they involve possible measurement errors? If no such distinguishing
features can be found, then there are no clear grounds for eliminating outliers.
An alternative approach is to perform the regression both with and without these outliers,
and examine their specific influence on the results. If this influence is minor, then it may
not matter whether or not they are omitted. On the other hand, if their influence is
substantial, then it is probably best to present the results of both analyses, and simply
alert the reader to the fact that these points may be questionable.

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Rating: 4 out of 5 stars
4/5 (5987)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
Rating: 4 out of 5 stars
4/5 (625)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Rating: 4 out of 5 stars
4/5 (1112)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Rating: 4.5 out of 5 stars
4.5/5 (898)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
Rating: 4.5 out of 5 stars
4.5/5 (1739)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Rating: 4 out of 5 stars
4/5 (1238)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Rating: 4 out of 5 stars
4/5 (932)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Rating: 4 out of 5 stars
4/5 (619)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Rating: 4.5 out of 5 stars
4.5/5 (546)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
Rating: 4.5 out of 5 stars
4.5/5 (2120)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Rating: 4.5 out of 5 stars
4.5/5 (357)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Rating: 4 out of 5 stars
4/5 (831)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Rating: 4.5 out of 5 stars
4.5/5 (477)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
Rating: 4 out of 5 stars
4/5 (1058)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Rating: 4.5 out of 5 stars
4.5/5 (275)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
Rating: 4.5 out of 5 stars
4.5/5 (814)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
Rating: 4 out of 5 stars
4/5 (1953)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
Rating: 4.5 out of 5 stars
4.5/5 (443)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Tóibín
Rating: 3.5 out of 5 stars
3.5/5 (2029)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Rating: 3.5 out of 5 stars
3.5/5 (425)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Rating: 3.5 out of 5 stars
3.5/5 (2272)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
Rating: 4.5 out of 5 stars
4.5/5 (4852)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Rating: 4 out of 5 stars
4/5 (99)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Rating: 4.5 out of 5 stars
4.5/5 (125)
The Barra Global Equity Model (GEM2) : Research Notes
Document79 pages
The Barra Global Equity Model (GEM2) : Research Notes
rprice
No ratings yet
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Rating: 4.5 out of 5 stars
4.5/5 (270)
Yes Please
From Everand
Yes Please
Amy Poehler
Rating: 4 out of 5 stars
4/5 (1949)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Rating: 4 out of 5 stars
4/5 (4255)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
Rating: 4.5 out of 5 stars
4.5/5 (1934)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
Rating: 3.5 out of 5 stars
3.5/5 (2599)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Rating: 3.5 out of 5 stars
3.5/5 (232)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Rating: 4.5 out of 5 stars
4.5/5 (235)
Handbook of TRANSPORT MODELLING Button and Hensher
Document346 pages
Handbook of TRANSPORT MODELLING Button and Hensher
Marijan Jakovljevic
100% (6)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
Rating: 3.5 out of 5 stars
3.5/5 (805)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
Rating: 4 out of 5 stars
4/5 (4042)
Determination of Pepsin Digestability in Fish Meal 2000-1
Document26 pages
Determination of Pepsin Digestability in Fish Meal 2000-1
Duchoanghdhd Nguyen Duc
0% (1)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Rating: 4 out of 5 stars
4/5 (75)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Rating: 3.5 out of 5 stars
3.5/5 (139)
John Adams
From Everand
John Adams
David McCullough
Rating: 4.5 out of 5 stars
4.5/5 (2520)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
Rating: 3.5 out of 5 stars
3.5/5 (883)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
Rating: 3.5 out of 5 stars
3.5/5 (109)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
Rating: 4 out of 5 stars
4/5 (45)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
Rating: 4.5 out of 5 stars
4.5/5 (789)
TIDE Metodologija CBA
Document44 pages
TIDE Metodologija CBA
Marijan Jakovljevic
No ratings yet
Social Discount Rate in CBA PDF
Document192 pages
Social Discount Rate in CBA PDF
Marijan Jakovljevic
No ratings yet
127FT L 015 Friction
Document4 pages
127FT L 015 Friction
Marijan Jakovljevic
No ratings yet
Impact of Side Friction On Speed-Flow Relationships
Document27 pages
Impact of Side Friction On Speed-Flow Relationships
Marijan Jakovljevic
75% (4)
Traffic Flow Fundamentals
Document239 pages
Traffic Flow Fundamentals
Marijan Jakovljevic
100% (1)
NHCRP Report 600A Human Factors Guidelines For Road Systems - 2008
Document146 pages
NHCRP Report 600A Human Factors Guidelines For Road Systems - 2008
Welly Pradipta bin Maryulis
No ratings yet
Molna Masa Goriva:: Rješenje Seminarskog Zadatka
Document4 pages
Molna Masa Goriva:: Rješenje Seminarskog Zadatka
Marijan Jakovljevic
No ratings yet
Little Women
From Everand
Little Women
Louisa May Alcott
Rating: 4 out of 5 stars
4/5 (105)
Converting Briquettes of Orange and Banana Peels Into Carbonaceous Materials For Activated Sustainable Carbon and Fuel Sources
Document11 pages
Converting Briquettes of Orange and Banana Peels Into Carbonaceous Materials For Activated Sustainable Carbon and Fuel Sources
Sophie
No ratings yet
BS ISO TR 5168 - 1998 Uncertainty
Document74 pages
BS ISO TR 5168 - 1998 Uncertainty
WASIF 33
No ratings yet
SPSS Independent Samples T Test
Document72 pages
SPSS Independent Samples T Test
Jeffer Mwangi
No ratings yet
Dissertation Data Analysis Excel
Document7 pages
Dissertation Data Analysis Excel
WriteMySociologyPaperAnchorage
100% (1)
Aqa 73562 QP Jun19
Document24 pages
Aqa 73562 QP Jun19
Alaa Shnien
No ratings yet
Introduction To Data Mining For Business Analytics
Document51 pages
Introduction To Data Mining For Business Analytics
Sherwin Lopez
No ratings yet
Guide - NIR - Calibration Best Practice - GB
Document19 pages
Guide - NIR - Calibration Best Practice - GB
Mihaly J
No ratings yet
Online Training-Professional Development: Instructor Software
Document17 pages
Online Training-Professional Development: Instructor Software
Abdurahman Aco
No ratings yet
Advanced ANOVA - MANOVA - Wikiversity
Document7 pages
Advanced ANOVA - MANOVA - Wikiversity
Divya pathak
No ratings yet
Basic Statistical Descriptions of Data: Dr. Amiya Ranjan Panda
Document35 pages
Basic Statistical Descriptions of Data: Dr. Amiya Ranjan Panda
Anu agarwal
No ratings yet
Electricity Theft Detection in AMI Based On Clustering and Local Outlier Factor
Document10 pages
Electricity Theft Detection in AMI Based On Clustering and Local Outlier Factor
karthikhulihalli
No ratings yet
Equivalence of Two Healthcare Costing Methods: Bottom-Up and Top-Down
Document15 pages
Equivalence of Two Healthcare Costing Methods: Bottom-Up and Top-Down
Daniel Marcos
No ratings yet
Data Quality
Document7 pages
Data Quality
Aaruni Giriraj
No ratings yet
Burkhardt Lenhard 2021
Document29 pages
Burkhardt Lenhard 2021
Alex Loredo Garcia
No ratings yet
Box and Whisker Plots
Document2 pages
Box and Whisker Plots
nicolas
No ratings yet
Terms 2
Document11 pages
Terms 2
jlayambot
No ratings yet
14. Lecture 11- Writing Effective Data Commentary
Document14 pages
14. Lecture 11- Writing Effective Data Commentary
shifa.alsa3di
No ratings yet
Prediction of Compressive Strength and Elastic Modulus of Carbonate Rocks
Document12 pages
Prediction of Compressive Strength and Elastic Modulus of Carbonate Rocks
junjie zhao
No ratings yet
Project of Statistical Packages Excel Work
Document20 pages
Project of Statistical Packages Excel Work
zakia ashiq
No ratings yet
Introduction Lecture 1
Document78 pages
Introduction Lecture 1
ghkdd843
No ratings yet
SPT Vs Cu - Stroud (2019)
Document9 pages
SPT Vs Cu - Stroud (2019)
Suresh Chaulagain
No ratings yet
Concepts (PPT) - Data Preprocessing
Document19 pages
Concepts (PPT) - Data Preprocessing
mtemp7489
No ratings yet
DWDM Unit6-Data Similarity Measures
Document40 pages
DWDM Unit6-Data Similarity Measures
mounika
No ratings yet
Basic Business Statistics - A Casebook (PDFDrive)
Document257 pages
Basic Business Statistics - A Casebook (PDFDrive)
HRISHIKESH DHARMENDRA SINGH
No ratings yet
A Meta-Analytic Review of Social, Self-Concept, and Behavioral Outcomes of Peer-Assisted Learning
Document18 pages
A Meta-Analytic Review of Social, Self-Concept, and Behavioral Outcomes of Peer-Assisted Learning
Shahid Khan
No ratings yet
Data Quality Checklist - V3
Document3 pages
Data Quality Checklist - V3
Ali Murtaza
No ratings yet
Advanced Adjustment Concepts
Document18 pages
Advanced Adjustment Concepts
AlihuertA
No ratings yet
Interview Questions Big Data Analytics
Document27 pages
Interview Questions Big Data Analytics
Senthil Kumar
No ratings yet