Adv Data Analysis
Adv Data Analysis
2 Options:
The key to the profit of the business is rely on the data the company provides.
STATISTICS:
Statistics is a branch of mathematics, which deals with processing, analysing and summarizing the
data.
DATA ANALYST is responsible for data collection, data cleaning, data pre processing, data analysis,
data visualization, EDA.
NOTE:
1. Descriptive statistics
2. Inferential statistics.
DESCRIPTIVE STATISTICS:
Statistical methods like mean/avg, median, mode, variance, standard deviation, correlation.
INFERENTIAL STATISTICS:
TYPES OF DATA:
1.NUMERICAL DATA:
1.CONTINUOUS:
If every real number between the picked data points is also a valid value for data, then the data is
called as continuous data.
2.DISCRETE:
In discrete, If we pick any 2 valid points. The real numbers between the 2 valid points is not a data
point in the dataset.
2.CATEGORICAL DATA:
1.ORDINAL:
Ordinal data is categorical data where order is present among categories / classes.
Ex-grade of apple.
2.NOMINAL:
Nominal data is the categorical data with no specified order among the classes / cateegories.
Ex- colours
3.BINARY:
Ex-True / False
NOTE:
Before applying any statistical method, 1st understand the data type.
STATISTICAL METHOD:
1.MEAN / AVERAGE:
APPLICATIONS OF MEAN:
2.MEDIAN:
When outliers are present in the data, then median is more reliable metric then mean.
Outliers present in data, effects the mean value which is not good for proper data analysis.
MODE:
Mode gives the data point/term which is most frequent in the data.
Mode is the term that occurs most number of times in the data.
NOTE:
A data set can have multiple modes but the mean and median of the data set will be only one.