Statistics and Data Analytics Notes
Statistics and Data Analytics Notes
Statistics is the science of collecting, organizing, analyzing, and interpreting data to make informed decisions.
Descriptive statistics summarize data using numerical measures and graphical tools.
- Median: The middle value separating the higher half from the lower half.
- Standard Deviation: The square root of the variance, indicating data spread.
Data Visualization
Data visualization involves presenting data in graphical format to identify patterns, trends, and outliers.
- Histograms
- Boxplots
- Scatter plots
- Bar charts
- Line graphs
Probability distributions describe how probabilities are distributed over values of a random variable.
Comprehensive Statistics and Data Analytics Notes
Hypothesis Testing
Linear algebra involves vectors, matrices, and linear transformations used in population modeling and
statistical analysis.
- Population statistics: Mean, variance, and correlation structures modeled using matrices.
Quantitative Analysis
Involves the use of mathematical and statistical modeling, measurement, and research to understand
behavior.
- Optimization
- Gauss-Markov Theorem: OLS estimators are BLUE (Best Linear Unbiased Estimators)
- Vector Space: Collection of vectors closed under vector addition and scalar multiplication