0% found this document useful (0 votes)

7 views

Exploratory_data_analysis

Exploratory data analysis (EDA) is a statistical approach for analyzing data sets to summarize their main characteristics, often through visualization methods, and contrasts with traditional hypothesis testing. Promoted by John Tukey since 1970, EDA encourages the exploration of data to formulate hypotheses and improve data collection. It encompasses various techniques and tools, including graphical methods and statistical programming languages, to facilitate unexpected discoveries and support further analysis.

Uploaded by

celikcelik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

Exploratory_data_analysis

Uploaded by

celikcelik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Exploratory data analysis

In statistics, exploratory data analysis (EDA) is an approach of analyzing data sets to summarize their
main characteristics, often using statistical graphics and other data visualization methods. A statistical
model can be used or not, but primarily EDA is for seeing what the data can tell beyond the formal
modeling and thereby contrasts with traditional hypothesis testing, in which a model is supposed to be
selected before the data is seen. Exploratory data analysis has been promoted by John Tukey since 1970
to encourage statisticians to explore the data, and possibly formulate hypotheses that could lead to new
data collection and experiments. EDA is different from initial data analysis (IDA),[1][2] which focuses
more narrowly on checking assumptions required for model fitting and hypothesis testing, and handling
missing values and making transformations of variables as needed. EDA encompasses IDA.

Overview
Tukey defined data analysis in 1961 as: "Procedures for analyzing data, techniques for interpreting the
results of such procedures, ways of planning the gathering of data to make its analysis easier, more
precise or more accurate, and all the machinery and results of (mathematical) statistics which apply to
analyzing data."[3]

Exploratory data analysis is a technique to analyze and investigate a dataset and summarize its main
characteristics. A main advantage of EDA is providing the visualization of data after conducting analysis.

Tukey's championing of EDA encouraged the development of statistical computing packages, especially
S at Bell Labs.[4] The S programming language inspired the systems S-PLUS and R. This family of
statistical-computing environments featured vastly improved dynamic visualization capabilities, which
allowed statisticians to identify outliers, trends and patterns in data that merited further study.

Tukey's EDA was related to two other developments in statistical theory: robust statistics and
nonparametric statistics, both of which tried to reduce the sensitivity of statistical inferences to errors in
formulating statistical models. Tukey promoted the use of five number summary of numerical data—the
two extremes (maximum and minimum), the median, and the quartiles—because these median and
quartiles, being functions of the empirical distribution are defined for all distributions, unlike the mean
and standard deviation. Moreover, the quartiles and median are more robust to skewed or heavy-tailed
distributions than traditional summaries (the mean and standard deviation). The packages S, S-PLUS, and
R included routines using resampling statistics, such as Quenouille and Tukey's jackknife and Efron 's
bootstrap, which are nonparametric and robust (for many problems).

Exploratory data analysis, robust statistics, nonparametric statistics, and the development of statistical
programming languages facilitated statisticians' work on scientific and engineering problems. Such
problems included the fabrication of semiconductors and the understanding of communications networks,
both of which were of interest to Bell Labs. These statistical developments, all championed by Tukey,
were designed to complement the analytic theory of testing statistical hypotheses, particularly the
Laplacian tradition's emphasis on exponential families.[5]
Development
John W. Tukey wrote the book Exploratory
Data Analysis in 1977.[6] Tukey held that too
much emphasis in statistics was placed on
statistical hypothesis testing (confirmatory
data analysis); more emphasis needed to be
placed on using data to suggest hypotheses to
test. In particular, he held that confusing the
two types of analyses and employing them on
the same set of data can lead to systematic
bias owing to the issues inherent in testing
hypotheses suggested by the data.

The objectives of EDA are to:

Data science process flowchart
Enable unexpected discoveries in the
data
Suggest hypotheses about the causes of observed phenomena
Assess assumptions on which statistical inference will be based
Support the selection of appropriate statistical tools and techniques
Provide a basis for further data collection through surveys or experiments[7]
Many EDA techniques have been adopted into data mining. They are also being taught to young students
as a way to introduce them to statistical thinking.[8]

Techniques and tools

There are a number of tools that are useful for EDA, but EDA is characterized more by the attitude taken
than by particular techniques.[9]

Typical graphical techniques used in EDA are:

Box plot
Histogram
Multi-vari chart
Run chart
Pareto chart
Scatter plot (2D/3D)
Stem-and-leaf plot
Parallel coordinates
Odds ratio
Targeted projection pursuit
Heat map
Bar chart
Horizon graph
Glyph-based visualization methods such as PhenoPlot[10] and Chernoff faces
Projection methods such as grand tour, guided tour and manual tour
Interactive versions of these plots
Dimensionality reduction:

Multidimensional scaling
Principal component analysis (PCA)
Multilinear PCA
Nonlinear dimensionality reduction (NLDR)
Iconography of correlations
Typical quantitative techniques are:

Median polish
Trimean
Ordination

History
Many EDA ideas can be traced back to earlier authors, for example:

Francis Galton emphasized order statistics and quantiles.

Arthur Lyon Bowley used precursors of the stemplot and five-number summary (Bowley
actually used a "seven-figure summary", including the extremes, deciles and quartiles, along
with the median—see his Elementary Manual of Statistics (3rd edn., 1920), p. 62[11]– he
defines "the maximum and minimum, median, quartiles and two deciles" as the "seven
positions").
Andrew Ehrenberg articulated a philosophy of data reduction (see his book of the same
name).
The Open University course Statistics in Society (MDST 242), took the above ideas and merged them
with Gottfried Noether's work, which introduced statistical inference via coin-tossing and the median test.

Example
Findings from EDA are orthogonal to the primary analysis task. To illustrate, consider an example from
Cook et al. where the analysis task is to find the variables which best predict the tip that a dining party
will give to the waiter.[12] The variables available in the data collected for this task are: the tip amount,
total bill, payer gender, smoking/non-smoking section, time of day, day of the week, and size of the party.
The primary analysis task is approached by fitting a regression model where the tip rate is the response
variable. The fitted model is

(tip rate) = 0.18 - 0.01 × (party size)

which says that as the size of the dining party increases by one person (leading to a higher bill), the tip
rate will decrease by 1%, on average.

However, exploring the data reveals other interesting features not described by this model.
Histogram of tip amounts Histogram of tip amounts Scatterplot of tips Scatterplot of tips
where the bins cover $1 where the bins cover vs. bill. Points vs. bill separated
increments. The $0.10 increments. An below the line by payer gender
distribution of values is interesting phenomenon correspond to tips and smoking
skewed right and is visible: peaks occur at that are lower than section status.
unimodal, as is common the whole-dollar and half- expected (for that Smoking parties
in distributions of small, dollar amounts, which is bill amount), and have a lot more
non-negative quantities. caused by customers points above the variability in the
picking round numbers as line are higher tips that they
tips. This behavior is than expected. We give. Males tend
common to other types of might expect to to pay the (few)
purchases too, like see a tight, higher bills, and
gasoline. positive linear the female non-
association, but smokers tend to
instead see be very
variation that consistent
increases with tip tippers (with
amount. In three
particular, there conspicuous
are more points exceptions
far away from the shown in the
line in the lower sample).
right than in the
upper left,
indicating that
more customers
are very cheap
than very
generous.

What is learned from the plots is different from what is illustrated by the regression model, even though
the experiment was not designed to investigate any of these other trends. The patterns found by exploring
the data suggest hypotheses about tipping that may not have been anticipated in advance, and which
could lead to interesting follow-up experiments where the hypotheses are formally stated and tested by
collecting new data.

Software
JMP, an EDA package from SAS Institute.
KNIME, Konstanz Information Miner – Open-Source data exploration platform based on
Eclipse.
Minitab, an EDA and general statistics package widely used in industrial and corporate
settings.
Orange, an open-source data mining and machine learning software suite.
Python, an open-source programming language widely used in data mining and machine
learning.
R, an open-source programming language for statistical computing and graphics. Together
with Python one of the most popular languages for data science.
TinkerPlots an EDA software for upper elementary and middle school students.
Weka an open source data mining package that includes visualization and EDA tools such
as targeted projection pursuit.

See also
Anscombe's quartet, on importance of exploration
Data dredging
Predictive analytics
Structured data analysis (statistics)
Configural frequency analysis
Descriptive statistics

References
1. Chatfield, C. (1995). Problem Solving: A Statistician's Guide (2nd ed.). Chapman and Hall.
ISBN 978-0412606304.
2. Baillie, Mark; Le Cessie, Saskia; Schmidt, Carsten Oliver; Lusa, Lara; Huebner, Marianne;
Topic Group "Initial Data Analysis" of the STRATOS Initiative (2022). "Ten simple rules for
initial data analysis" (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8870512). PLOS
Computational Biology. 18 (2): e1009819. Bibcode:2022PLSCB..18E9819B (https://ui.adsab
s.harvard.edu/abs/2022PLSCB..18E9819B). doi:10.1371/journal.pcbi.1009819 (https://doi.or
g/10.1371%2Fjournal.pcbi.1009819). PMC 8870512 (https://www.ncbi.nlm.nih.gov/pmc/artic
les/PMC8870512). PMID 35202399 (https://pubmed.ncbi.nlm.nih.gov/35202399).
3. John Tukey-The Future of Data Analysis-July 1961 (http://projecteuclid.org/download/pdf_1/
euclid.aoms/1177704711)
4. Becker, Richard A., A Brief History of S (https://web.archive.org/web/20150723044213/http://
www2.research.att.com/areas/stat/doc/94.11.ps), Murray Hill, New Jersey: AT&T Bell
Laboratories, archived from the original (http://www2.research.att.com/areas/stat/doc/94.11.
ps) (PS) on 2015-07-23, retrieved 2015-07-23, "... we wanted to be able to interact with our
data, using Exploratory Data Analysis (Tukey, 1971) techniques."
5. Morgenthaler, Stephan; Fernholz, Luisa T. (2000). "Conversation with John W. Tukey and
Elizabeth Tukey, Luisa T. Fernholz and Stephan Morgenthaler" (https://doi.org/10.1214%2Fs
s%2F1009212675). Statistical Science. 15 (1): 79–94. doi:10.1214/ss/1009212675 (https://d
oi.org/10.1214%2Fss%2F1009212675).
6. Tukey, John W. (1977). Exploratory Data Analysis. Pearson. ISBN 978-0201076165.
7. Behrens-Principles and Procedures of Exploratory Data Analysis-American Psychological
Association-1997 (https://web.archive.org/web/20170808064326/cll.stanford.edu/~willb/cour
se/behrens97pm.pdf)
8. Konold, C. (1999). "Statistics goes to school". Contemporary Psychology. 44 (1): 81–82.
doi:10.1037/001949 (https://doi.org/10.1037%2F001949).
9. Tukey, John W. (1980). "We need both exploratory and confirmatory". The American
Statistician. 34 (1): 23–25. doi:10.1080/00031305.1980.10482706 (https://doi.org/10.1080%
2F00031305.1980.10482706).
10. Sailem, Heba Z.; Sero, Julia E.; Bakal, Chris (2015-01-08). "Visualizing cellular imaging data
using PhenoPlot" (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4354266). Nature
Communications. 6 (1): 5825. Bibcode:2015NatCo...6.5825S (https://ui.adsabs.harvard.edu/
abs/2015NatCo...6.5825S). doi:10.1038/ncomms6825 (https://doi.org/10.1038%2Fncomms
6825). ISSN 2041-1723 (https://search.worldcat.org/issn/2041-1723). PMC 4354266 (http
s://www.ncbi.nlm.nih.gov/pmc/articles/PMC4354266). PMID 25569359 (https://pubmed.ncbi.
nlm.nih.gov/25569359).
11. Elementary Manual of Statistics (3rd edn.,
1920)https://archive.org/details/cu31924013702968/page/n5
12. Cook, D. and Swayne, D.F. (with A. Buja, D. Temple Lang, H. Hofmann, H. Wickham, M.
Lawrence) (2007) "Interactive and Dynamic Graphics for Data Analysis: With R and GGobi"
Springer, 978-0387717616

Bibliography
Andrienko, N & Andrienko, G (2005) Exploratory Analysis of Spatial and Temporal Data. A
Systematic Approach. Springer. ISBN 3-540-25994-5
Cook, D. and Swayne, D.F. (with A. Buja, D. Temple Lang, H. Hofmann, H. Wickham, M.
Lawrence) (2007-12-12). Interactive and Dynamic Graphics for Data Analysis: With R and
GGobi. Springer. ISBN 9780387717616.
Cook, D. and Swayne, D.F. (with A. Buja, D. Temple Lang, H. Hofmann, H. Wickham, M.
Lawrence) (2007-12-12). Interactive and Dynamic Graphics for Data Analysis: With R and
GGobi. Springer. ISBN 9780387717616.
Hoaglin, D C; Mosteller, F & Tukey, John Wilder (Eds) (1985). Exploring Data Tables, Trends
and Shapes. ISBN 978-0-471-09776-1.
Hoaglin, D C; Mosteller, F & Tukey, John Wilder (Eds) (1983). Understanding Robust and
Exploratory Data Analysis. ISBN 978-0-471-09777-8.
Young, F. W. Valero-Mora, P. and Friendly M. (2006) Visual Statistics: Seeing your data with
Dynamic Interactive Graphics. Wiley ISBN 978-0-471-68160-1 Jambu M. (1991) Exploratory
and Multivariate Data Analysis. Academic Press ISBN 0123800900
S. H. C. DuToit, A. G. W. Steyn, R. H. Stumpf (1986) Graphical Exploratory Data Analysis.
Springer ISBN 978-1-4612-9371-2
Hoaglin, D C; Mosteller, F & Tukey, John Wilder (Eds) (1985). Exploring Data Tables, Trends
and Shapes (https://archive.org/details/exploringdatatab0000unse). Wiley. ISBN 978-0-471-
09776-1.
Hoaglin, D C; Mosteller, F & Tukey, John Wilder (Eds) (1983). Understanding Robust and
Exploratory Data Analysis. Wiley. ISBN 978-0-471-09777-8.
Inselberg, Alfred (2009). Parallel Coordinates:Visual Multidimensional Geometry and its
Applications. London New York: Springer. ISBN 978-0-387-68628-8.
Leinhardt, G., Leinhardt, S., Exploratory Data Analysis: New Tools for the Analysis of
Empirical Data (https://journals.sagepub.com/doi/pdf/10.3102/0091732X008001085),
Review of Research in Education, Vol. 8, 1980 (1980), pp. 85–157.
Martinez, W. L.; Martinez, A. R. & Solka, J. (2010). Exploratory Data Analysis with MATLAB,
second edition. Chapman & Hall/CRC. ISBN 9781439812204.
Theus, M., Urbanek, S. (2008), Interactive Graphics for Data Analysis: Principles and
Examples, CRC Press, Boca Raton, FL, ISBN 978-1-58488-594-8
Tucker, L; MacCallum, R. (1993). Exploratory Factor Analysis (http://www.unc.edu/~rcm/boo
k/factornew.htm).
Tukey, John Wilder (1977). Exploratory Data Analysis (https://archive.org/details/exploratory
dataa00tuke_0). Addison-Wesley. ISBN 978-0-201-07616-5.
Velleman, P. F.; Hoaglin, D. C. (1981). Applications, Basics and Computing of Exploratory
Data Analysis (https://archive.org/details/applicationsbasi00vell). Duxbury Press. ISBN 978-
0-87150-409-8.
Young, F. W. Valero-Mora, P. and Friendly M. (2006) Visual Statistics: Seeing your data with
Dynamic Interactive Graphics (http://www.uv.es/visualstats/Book). Wiley ISBN 978-0-471-
68160-1
Jambu M. (1991) Exploratory and Multivariate Data Analysis (http://www.sciencedirect.com/
science/book/9780123800909). Academic Press ISBN 0123800900
S. H. C. DuToit, A. G. W. Steyn, R. H. Stumpf (1986) Graphical Exploratory Data Analysis (ht
tps://link.springer.com/book/10.1007%2F978-1-4612-4950-4). Springer ISBN 978-1-4612-
9371-2

External links
Carnegie Mellon University – free online course on Probability and Statistics, with a module
on EDA (https://oli.cmu.edu/courses/free-open/statistics-course-details/)
• Exploratory data analysis chapter: engineering statistics handbook (https://www.itl.nist.gov/
div898/handbook/eda/eda.htm)

Retrieved from "https://en.wikipedia.org/w/index.php?title=Exploratory_data_analysis&oldid=1269626762"

Time Series with Python: How to Implement Time Series Analysis and Forecasting Using Python
From Everand
Time Series with Python: How to Implement Time Series Analysis and Forecasting Using Python
Bob Mather
3/5 (1)
E Ticket Receipt
No ratings yet
E Ticket Receipt
1 page
Credit Transactions
100% (4)
Credit Transactions
12 pages
Exploratory Data Analysis
No ratings yet
Exploratory Data Analysis
7 pages
Exploratory Data Analysis Stephan Morgenthaler (2009)
100% (2)
Exploratory Data Analysis Stephan Morgenthaler (2009)
12 pages
Datamining and Analytics Unit V
No ratings yet
Datamining and Analytics Unit V
102 pages
DSV Module-2
No ratings yet
DSV Module-2
23 pages
Module 2
No ratings yet
Module 2
81 pages
Exploratory Data Analysis
No ratings yet
Exploratory Data Analysis
9 pages
What Is EDA
No ratings yet
What Is EDA
2 pages
Lecture W6 EDA
No ratings yet
Lecture W6 EDA
28 pages
Eda 2022 04 11 09352244
No ratings yet
Eda 2022 04 11 09352244
35 pages
Exploratory Data Analysis in ML
No ratings yet
Exploratory Data Analysis in ML
7 pages
John W. Tukey y El Análisis de Datos - Descarga Gratuita de PDF
No ratings yet
John W. Tukey y El Análisis de Datos - Descarga Gratuita de PDF
8 pages
Exploratory Data Analysis
No ratings yet
Exploratory Data Analysis
23 pages
Unit 3 Ids Notes
No ratings yet
Unit 3 Ids Notes
31 pages
Unit-1
No ratings yet
Unit-1
52 pages
EDA
No ratings yet
EDA
3 pages
Chapter 7 SQQS1033
No ratings yet
Chapter 7 SQQS1033
37 pages
Unit 5 Exploratory Data Analysis (EDA)
100% (1)
Unit 5 Exploratory Data Analysis (EDA)
41 pages
07_EDA
No ratings yet
07_EDA
5 pages
1 Eda
100% (2)
1 Eda
822 pages
Data Sciecnce
No ratings yet
Data Sciecnce
16 pages
CHAPTER 1 (Exploratory Data Analysis)
No ratings yet
CHAPTER 1 (Exploratory Data Analysis)
789 pages
Exploratory Data Analysis unit 2
No ratings yet
Exploratory Data Analysis unit 2
39 pages
Exploratory Data Analysis
100% (3)
Exploratory Data Analysis
791 pages
DS Lecture 15
No ratings yet
DS Lecture 15
44 pages
Exploratory Data Analysis PDF
100% (4)
Exploratory Data Analysis PDF
791 pages
FDS Unit 2
No ratings yet
FDS Unit 2
15 pages
John W. Tukey and Data Analysis: David C. Hoaglin
No ratings yet
John W. Tukey and Data Analysis: David C. Hoaglin
8 pages
Exploratory Data Analysis-Engineering Statistics Handbook
100% (2)
Exploratory Data Analysis-Engineering Statistics Handbook
790 pages
Exploratory Data Analysis Engineering Statistics Handbook PDF
100% (1)
Exploratory Data Analysis Engineering Statistics Handbook PDF
790 pages
Exploratory Data Analysis
No ratings yet
Exploratory Data Analysis
13 pages
1 Exploratory Data Analysis
100% (1)
1 Exploratory Data Analysis
786 pages
Group-7
No ratings yet
Group-7
19 pages
Exploratory Data Analysis-Engineering Statistics Handbook NIST 2002
100% (1)
Exploratory Data Analysis-Engineering Statistics Handbook NIST 2002
804 pages
Exploratory Data Analysis (Eda) : Niraj Poudyal, PHD Kathmandu University School of Arts
No ratings yet
Exploratory Data Analysis (Eda) : Niraj Poudyal, PHD Kathmandu University School of Arts
54 pages
Role of Statistics in Data Science
No ratings yet
Role of Statistics in Data Science
8 pages
Data Science Process
No ratings yet
Data Science Process
30 pages
Engineering Statistics Handbook 2003
No ratings yet
Engineering Statistics Handbook 2003
1,522 pages
827b551be7606030c4c1ca693fb54a0ed875
No ratings yet
827b551be7606030c4c1ca693fb54a0ed875
12 pages
Exploratory data analysis in the context of data mining and resampling
No ratings yet
Exploratory data analysis in the context of data mining and resampling
6 pages
DOC-20250125-WA0000.
No ratings yet
DOC-20250125-WA0000.
15 pages
eda1
No ratings yet
eda1
25 pages
EDA
No ratings yet
EDA
9 pages
Exploratory Data Analysis & Data Preprocessing
No ratings yet
Exploratory Data Analysis & Data Preprocessing
16 pages
C21_SMA_EXP4[1]
No ratings yet
C21_SMA_EXP4[1]
12 pages
EDA Lecture notes
No ratings yet
EDA Lecture notes
205 pages
Module 2 ExploratoryDataAnalysis
No ratings yet
Module 2 ExploratoryDataAnalysis
22 pages
IOT Domain
No ratings yet
IOT Domain
70 pages
Unit 3
No ratings yet
Unit 3
47 pages
Statistic Course
No ratings yet
Statistic Course
37 pages
Fundamentals of Data Source and Preparation For ML v31
No ratings yet
Fundamentals of Data Source and Preparation For ML v31
45 pages
Data Analysis
No ratings yet
Data Analysis
17 pages
Module 2 PPT
No ratings yet
Module 2 PPT
78 pages
Data Exploration and Visualization
100% (1)
Data Exploration and Visualization
281 pages
EDA Module1 Full Answers
No ratings yet
EDA Module1 Full Answers
5 pages
Machine Learning - A Complete Exploration of Highly Advanced Machine Learning Concepts, Best Practices and Techniques: 4
From Everand
Machine Learning - A Complete Exploration of Highly Advanced Machine Learning Concepts, Best Practices and Techniques: 4
Peter Bradley
No ratings yet
Descriptive Statistics: Six Sigma Thinking, #3
From Everand
Descriptive Statistics: Six Sigma Thinking, #3
Sumeet Savant
No ratings yet
Data Analytics
From Everand
Data Analytics
Jeffery Short
1/5 (1)
Statistics: Practical Concept of Statistics for Data Scientists
From Everand
Statistics: Practical Concept of Statistics for Data Scientists
John Slavio
No ratings yet
Gale Researcher Guide for: Econometric Models
From Everand
Gale Researcher Guide for: Econometric Models
Chupp
No ratings yet
maths U II ( PART 2)
No ratings yet
maths U II ( PART 2)
17 pages
Catalogo PC4000 PDF
100% (1)
Catalogo PC4000 PDF
8 pages
Database Systems Concepts: Taiz University, 2020
No ratings yet
Database Systems Concepts: Taiz University, 2020
29 pages
Aftershock - Beyond The Civil War
No ratings yet
Aftershock - Beyond The Civil War
4 pages
Pragmatics & Discourse Studies 2024
No ratings yet
Pragmatics & Discourse Studies 2024
65 pages
RRL Partial
No ratings yet
RRL Partial
9 pages
Online Answers To Homework Questions
100% (1)
Online Answers To Homework Questions
7 pages
Nabakem Mega Check Cleaner MSDS
No ratings yet
Nabakem Mega Check Cleaner MSDS
8 pages
MODULE 3 (EDPM)-GOKUL
No ratings yet
MODULE 3 (EDPM)-GOKUL
12 pages
Flipkart-Labels-10-Dec-2024-06-41
No ratings yet
Flipkart-Labels-10-Dec-2024-06-41
8 pages
Eric Helms - Nutrition - Calculator
No ratings yet
Eric Helms - Nutrition - Calculator
16 pages
unit 10 1-2
No ratings yet
unit 10 1-2
6 pages
12 Image Reconstruction
No ratings yet
12 Image Reconstruction
20 pages
Abator Mic
No ratings yet
Abator Mic
70 pages
Intro Macro Tutorial 9 (Week 10) : Review Questions
No ratings yet
Intro Macro Tutorial 9 (Week 10) : Review Questions
2 pages
KOSO Kent Introl Subsea Chokes and Control Valves For Well Control and Seafloor Processing Applications
100% (1)
KOSO Kent Introl Subsea Chokes and Control Valves For Well Control and Seafloor Processing Applications
16 pages
SLF103 ApplicationRefundExcessOverpaymentSTL V02
67% (3)
SLF103 ApplicationRefundExcessOverpaymentSTL V02
1 page
functional-movement-assessment-lower-extremity
No ratings yet
functional-movement-assessment-lower-extremity
3 pages
Reflective Synthesis Paper
No ratings yet
Reflective Synthesis Paper
2 pages
Comparative Analysis NVMe vs SATA
No ratings yet
Comparative Analysis NVMe vs SATA
3 pages
PRPA - Trucking Company List
No ratings yet
PRPA - Trucking Company List
13 pages
Be A Pioneer - Script
No ratings yet
Be A Pioneer - Script
11 pages
Data Communications
No ratings yet
Data Communications
30 pages
Learning Plan: Great Shepherd Christian Academy Asian Foundation, Inc
No ratings yet
Learning Plan: Great Shepherd Christian Academy Asian Foundation, Inc
4 pages
Luv N' Care v. Regent - Order Granting MSJ
No ratings yet
Luv N' Care v. Regent - Order Granting MSJ
19 pages
Dynamic ARP Inspection: DAI in A DHCP Environment
No ratings yet
Dynamic ARP Inspection: DAI in A DHCP Environment
2 pages
Atpg Question Answer: Name: Meet Zankat
100% (2)
Atpg Question Answer: Name: Meet Zankat
7 pages
MPPT Charge Controller For Solar Stand Alone System: Project Guide Team Members
No ratings yet
MPPT Charge Controller For Solar Stand Alone System: Project Guide Team Members
13 pages

Exploratory_data_analysis

Uploaded by

Exploratory_data_analysis

Uploaded by

Exploratory data analysis

The objectives of EDA are to:

Techniques and tools

Typical graphical techniques used in EDA are:

Francis Galton emphasized order statistics and quantiles.

(tip rate) = 0.18 - 0.01 × (party size)

Retrieved from "https://en.wikipedia.org/w/index.php?title=Exploratory_data_analysis&oldid=1269626762"

You might also like