Statistical Evaluation of Big Data
Statistical Evaluation of Big Data
Random samples are used to avoid bias and other unwanted effects.
1 2 3
2 5
4 5 6
7
7 8 9
Classical Statistical Comparison and Evaluation
• Most widely used evaluation technique is Hypothesis
Testing
Mean of sample A, B
Var, S.D
Hypothesis testing
Sig=2
Predicting true – or- false: classification
Classification is the most common application of
computer based prediction.
The typical problem is to distinguish between two
classes.
Error rates
Performance is measured by keeping track of the number
of mistakes that are made on sample cases.
The sample error rate (erate) is the percent of
classifications that are incorrect. It is given by:
erate =*100
Line of Best fit: minimizes the distance between each individual point
and the regression line.
Distance Measures
The objective of regression is to minimize the distance between the
true/observed value for case i, yi and the predicted value yi’.
Two measures of distance are commonly used. The classical regression
measure is mean squared error (mse) and Mean absolute distance
(mad) as:
The mean absolute distance is the more intuitive measure and is less
sensitive to outliers.
Square root of mse (rmse) is slightly larger than mad.
Example
Calculate the distance measures for the following data:
No of cases (Half yearly Observed value (in Lakhs) Predicted Value (in Lakhs)
sales)
January 125 128
February 132 117
March 115 105
April 137 125
May 122 126
June 130 138
mse = {+………….}
mad = {|125-128|+|132-117|………….}
• Computing error measures and Moving Average
Week Sales 3MA Error= Actual – |Error| |%Error|
Forecast
1 39
2 44
3 40
4 45 41 45-41=4 |4|=4 4^2=16 4/45=8.89%
5 38 43 -5 |-5|=5 25 5/38=13.16%
6 43 41 2 2 4 2/43=4.65%
7 39 42 -3 3 9 3/39=7.69%
8 40 14/4=3.5 54/4 = 13.5 34.39%/4 =
MAD= MSE= 13.5 8.60%
MAD=3.5 MAPE=8.60%
Technically, the error rate for classification is a proportion, but in large samples the
error rate is equivalent to a mean.
For regression, we extrapolate this analysis of sample mean error and variance.
Where merr is the mean error measure, either mad or mse, and erri is the error
distance for case i, either |yi- yi‘| or (yi- yi‘)2.
Comparing Results for Error Measures
Performance is measured in terms of mean error(merr) on independent test
cases(A and B)
To compare two results, for example The results for two different prediction
methods and the standard hypothesis testing model can determine whether
differences between merr(A) and merr(B) are significant.