0% found this document useful (0 votes)
29 views

Descriptive Statistics: Hypothesis Testing - Means

This document defines key statistical formulas used for descriptive statistics, hypothesis testing, means, proportions, regression lines, and confidence intervals. It explains formulas for calculating the mean, variance, standard deviation, standard error, z-score, least squares regression line, residuals, sum of squares, coefficient of determination, correlation coefficient, slope, intercept, confidence levels, and prediction intervals. The document provides statistical formulas and the symbols used to represent variables and parameters in the formulas.

Uploaded by

ponku75
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
29 views

Descriptive Statistics: Hypothesis Testing - Means

This document defines key statistical formulas used for descriptive statistics, hypothesis testing, means, proportions, regression lines, and confidence intervals. It explains formulas for calculating the mean, variance, standard deviation, standard error, z-score, least squares regression line, residuals, sum of squares, coefficient of determination, correlation coefficient, slope, intercept, confidence levels, and prediction intervals. The document provides statistical formulas and the symbols used to represent variables and parameters in the formulas.

Uploaded by

ponku75
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

STATISTICS FORMULAS

DESCRIPTIVE STATISTICS:

HYPOTHESIS TESTING MEANS:

MEAN:

STANDARD ERROR:

VARIANCE:

MARGIN OF ERROR: m =

STANDARD DEVIATION:

CONFIDENCE INTERVAL: C.I. =

STANDARD ERROR:

SAMPLE SIZE FOR A GIVEN m:

Z-SCORE:

ONE SAMPLE Z-TEST:

REGRESSION LINES:

or m =

T-TEST:

TWO SAMPLE Z-TEST:

For a data set


, where (
) are the centroids (means)
of the data set, and is the correlation coefficient:
LEAST-SQUARES REGRESSION LINE:

TWO SAMPLE T-TEST:

RESIDUALS:
SSM

SSE

SST = SSM+SSE
2

PROPORTION:

, where X= number of successes

COEFFICIENT OF DETERMINATION: r =

STANDARD ERROR:

CORRELATION COEFFICIENT: r =

MARGIN OF ERROR: m =

SLOPE:

Z-TEST, ONE-SAMPLE PROPORTION:

INTERCEPT:
STD ERR, 2-SAMP PROP:
VARIANCE:

ST DEV:
MARGIN OF ERR, 2-SAMP PROP: m =

STANDARD ERROR b1: SEb1 =


PLUS FOUR PROPORTIONS:
STANDARD ERROR bo: SEb0 =

EST DIFF BTWN PROPS:

CONFIDENCE LEVEL FOR THE INTERCEPT


CONFIDENCE LEVEL FOR THE SLOPE:
PREDICTION INTERVAL:

:
:

t*SEb0

STD DEV:

t*SEb1
POOLED PROPORTION:
POOLED STD ERR:
TWO SAMPLE Z-SCORE:

th

Reference: Moore DS, McCabe GP & Craig BA. Introduction to the Basic Practice of Statistics. New York: W.H. Freeman & Co, 5 edition.

DESCRIPTIONS OF STATISTICS FORMULAS


MEAN: The mean, symbolized by x-bar, equals one divided by the number of samples multiplied by the sum of all data points,
symbolized by x-sub-i.
VARIANCE: Variance, symbolized by s squared, equals 1 divided by the number of samples minus one, multiplied by the sum of each
data point subtracted by the mean then squared.
STANDARD DEVIATION: Standard deviation, symbolized by s, equals the square root of the variance s-squared.
STANDARD ERROR: The standard error of the mean equals the standard deviation divided by the square root of the number of
samples.
Z-SCORE: Z equals the test data minus the population mean, then divided by the population standard deviation.
REGRESSION LINES:
LEAST-SQUARES REGRESSION LINE: The predicted value, symbolized by y-hat, equals the intercept, symbolized by b-sub-o, plus the
slope, symbolized by b-sub-1, times the data point x.
RESIDUALS: The residual, symbolized by e-sub-I, equals the data point y, symbolized by y-sub-I, minus the predicted value from the
least-squares regression line, symbolized y-hat.
SSM, SSE, SST: Sum of square means equals the sum of the centriod, symbolized by y-bar, minus the predicted value of each x data
point, symbolized by y-hat sub I. Sum of square errors equal the sum of each y data point, symbolized by y-sub-I, minus the
predicted value of each data point, symbolized by y-hat-sub-I, then squared. The Sum of Square Total = Sum of Square Means plus
Sum of Square Errors.
COEFFICIENT OF DETERMINATION: The coefficient of determination, symbolized r-squared, equals the sum of square means divided
by the sum of squares total.
CORRELATION COEFFICIENT: The correlation coefficient r equals the square root of the coefficient of determination, symbolized by
r-squared.
SLOPE: Slope, symbolized b-sub-one, equals the correlation coefficient r multiplied by the ratio of the standard deviation of the x
data points to the standard deviation of the y data points.
INTERCEPT: Intercept, symbolized by b-sub-zero, equals the mean of the y data points, symbolized by y-bar, minus the slope,
symbolized by b-sub-one multiplied by the mean of the x data points, symbolized by x-bar.
VARIANCE: Mean of Square Errors, symbolized s-squared or MSE, is equal to the sum of the residuals, symbolized by e-sub-I,
squared then divided by the number of data points subtracted by two. STANDARD DEVIATION, symbolized by s, equals the square
root of variance.
STANDARD ERROR: The standard error of the slope, symbolized by SE-sub-b1, equals the standard deviation, symbolized by s,
divided by the square root of the sum of each data point, symbolized by x-sub-I, subtracted from the mean of all x data points,
symbolized by s-bar, then squared.
The STANDARD ERROR of the intercept, symbolized by SE-sub-bo, equals the standard deviation, symbolized by s, multiplied by the
square root of one divided by the number of data points plus the mean of all xs squared, symbolized by x-bar squared, divided by
the sum of all x data points, symbolized by x-sub-I minus the mean of all x data points, symbolized by x-bar, squared.

Reference: Moore DS, McCabe GP & Craig BA. Introduction to the Basic Practice of Statistics. Ne w York: W.H. Freeman & Co, 5th edition.

CONFIDENCE LEVEL FOR THE INTERCEPT: The confidence level for the intercept, symbolized beta-sub-zero, equals the sample
intercept, symbolized by b-sub-zero, plus or minus the t-score for the interval, symbolized by t, multiplied by the standard error of
the intercept.
CONFIDENCE LEVEL FOR THE SLOPE: The confidence level for the slope, symbolized by beta-sub-one, equals the sample slope,
symbolized by b-sub-one, plus or minus the t-score for the interval, symbolized by t, multiplied by the standard error of the slope.
PREDICTION INTERVAL: The prediction interval equals the predicted value of y, symbolized by y-hat, plus or minus the t-score for
the interval, symbolized by t, multiplied by the standard error.

Reference: Moore DS, McCabe GP & Craig BA. Introduction to the Basic Practice of Statistics. Ne w York: W.H. Freeman & Co, 5th edition.

You might also like