Exploratory Data Analysis (EDA) is an approach to analyzing data sets to summarize their main characteristics with visual methods. The primary goal of EDA is to provide insight into the underlying structure of a data set. EDA involves using techniques like graphs to uncover patterns, extract important variables, detect outliers, and test assumptions. EDA aims to maximize understanding of a data set without relying on formal modeling or hypothesis testing.
Download as PPTX, PDF, TXT or read online on Scribd
0 ratings0% found this document useful (0 votes)
194 views
Exploratory Data Analysis
Exploratory Data Analysis (EDA) is an approach to analyzing data sets to summarize their main characteristics with visual methods. The primary goal of EDA is to provide insight into the underlying structure of a data set. EDA involves using techniques like graphs to uncover patterns, extract important variables, detect outliers, and test assumptions. EDA aims to maximize understanding of a data set without relying on formal modeling or hypothesis testing.
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 9
CHAPTER III
Exploratory Data Analysis
In short EDA. Exploratory Data Analysis • In statistics, exploratory data analysis (EDA) is an approach to analyzing data sets to summarize their main characteristics, often with visual methods. A statistical model can be used or not, but primarily EDA is for seeing what the data can tell us beyond the formal modeling or hypothesis testing task. Exploratory data analysis was promoted by John Tukey to encourage statisticians to explore the data, and possibly formulate hypotheses that could lead to new data collection and experiments. EDA is different from initial data analysis (IDA), which focuses more narrowly on checking assumptions required for model fitting and hypothesis testing, and handling missing values and making transformations of variables as needed. What are the EDA Goals? • The primary goal of EDA is to maximize the analyst's insight into a data set and into the underlying structure of a data set, while providing all of the specific items that an analyst would want to extract from a data set EDA FUNCTIONS: Exploratory Data Analysis (EDA) is an approach/philosophy for data analysis that employs a variety of techniques (mostly graphical) tomaximize insight into a data set; • uncover underlying structure; • extract important variables; • detect outliers and anomalies; • test underlying assumptions; • develop parsimonious models; and • determine optimal factor settings. EDA FOCUS: • The EDA approach is precisely that--an approach--not a set of techniques, but an attitude/philosophy about how a data analysis should be carried out. PIVOT TABLE…