Exploratory Data Analysis
- refers to the critical process of performing initial investigations on data so as to discover
patterns, to spot anomalies, to test hypothesis and to check assumptions with the help of
summary statistics and graphical representations.
- a bit like taking the vital signs of your data set in order to tell what you are working with. EDA
can be an explicit step you take during (or before) your analysis, or it can be a more organic
process that changes in quantity and quality with each data set. EDA can help to familiarize you
with the data (especially if it is not yours) or help you gain a deeper understanding of possible
features and relationships in the data.
- a statistical approach or technique for analyzing data sets in order to summarize their important
and main characteristics generally by using some visual aids. The EDA approach can be used to
gather knowledge about the following aspects of data:
Main characteristics or features of the data.
The variables and their relationships.
Finding out the important variables that can be used in our problem.
Exploratory Data Analysis is the most essential tool for analyzing data sets to summarize their main
characteristic and by data visualization. It is the key to understanding the data better by gaining insights
into different data patterns, relationships between variables, outliers, any special trends that help get
insights into the data to tell a story.
https://www.indeed.com/career-advice/career-development/how-to-conduct-exploratory-data-analysis
Sample Analysis: https://towardsdatascience.com/exploratory-data-analysis-8fc1cb20fd15 - (panda)
https://bookdown.org/steve_midway/DAR/exploratory-data-analysis.html - Peng’s 10 steps (R)
https://towardsdatascience.com/exploratory-data-analysis-in-r-for-beginners-fe031add7072 - (R)
https://cran.r-project.org/web/packages/dlookr/vignettes/EDA.html - (R)
https://www.pluralsight.com/guides/exploratory-data-analysis-in-r - (R)
https://www.geeksforgeeks.org/exploratory-data-analysis-in-r-programming/ - (R)
https://www.youtube.com/watch?v=Z76AWVc6E60 – Using ggplot in R (From the saved code)
https://www.statology.org/exploratory-data-analysis-in-r/
*https://www.statology.org/what-is-a-strong-correlation/
https://arshren.medium.com/exploratory-data-analysis-in-r-3c32df481aad
https://statsandr.com/blog/outliers-detection-in-r/ - outlier detection – osner’s test
Predictive Data Analysis
- analysis which uses statistics operations to analyze historical facts to make predict future events.
Using predictive analytics can help many businesses as it finds out the relationship between the
data collected and based on the relationship, the pattern is predicted .
- a general approach to analyzing historical data and attempting to predict what will happen next.
Many organizations use this type of analysis to predict future sales, expected costs, equipment
failures or any number of other potential events.
https://www.educba.com/predictive-analysis-in-r/ - (R)
https://www.youtube.com/watch?v=DPAsYtUHkkM – (R)
https://pestleanalysis.com/predictive-analysis/ - Types of Predictive Models
https://www2.insightsoftware.com/definitive-guide-to-predictive-analytics/7-steps-to-start-your-
predictive-analytics-project/ - Steps to start predictive analysis
https://www.geeksforgeeks.org/predictive-analysis-in-r-programming/ - R Predictive additional