Ana Assignment 5019
Ana Assignment 5019
Binomial test
For an exact test used in place of the 2 x 1 chi-squared test for goodness of fit, see Binomial
test.
Likelihood-ratio tests in general statistical modelling, for testing whether there is evidence of
the need to move from a simple model to a more complicated one (where the simple model is
nested within the complicated one).
Using the chi-squared distribution to interpret Pearson’s chi-squared statistic requires one to
assume that the discrete probability of observed binomial frequencies in the table can be
approximated by the continuous chi-squared distribution. This assumption is not quite correct
and introduces some error.
To reduce the error in approximation, Frank Yates suggested a correction for continuity that
adjusts the formula for Pearson’s chi-squared test by subtracting 0.5 from the absolute
difference between each observed value and its expected value in a 2 × 2 contingency table.
This reduces the chi-squared value obtained and thus increases its p-value.
Applications
In cryptanalysis, the chi-squared test is used to compare the distribution of plaintext and
(possibly) decrypted ciphertext. The lowest value of the test means that the decryption
was successful with high probability. This method can be generalized for solving modern
cryptographic problems.
In bioinformatics, chi-squared test is used to compare the distribution of certain
properties of genes (e.g., genomic content, mutation rate, interaction network
clustering, etc.) belonging to different categories (e.g., disease genes, essential genes,
genes on a certain chromosome etc.).
Conclusion:
If chi-square calculated value is greater than Chi-square critical value then null-
hypothesis will be rejected .
but if Chi-square calculated value is less than Chi-square critical value then our null
hypothesis will be fail to reject.
T-Test:
A t-test is a type of inferential statistic used to determine if there is a significant
difference between the means of two groups, which may be related in certain features.
It is mostly used when the data sets, like the data set recorded as the outcome from
flipping a coin 100 times, would follow a normal distribution and may have unknown
variances. A t-test is used as a hypothesis testing tool, which allows testing of an
assumption applicable to a population.
A t-test looks at the t-statistic, the t-distribution values, and the degrees of freedom to
determine the statistical significance. To conduct a test with three or more means, one
must use an analysis of variance.
A t-test is a type of inferential statistic used to determine if there is a significant
difference between the means of two groups, which may be related in certain features.
The t-test is one of many tests used for the purpose of hypothesis testing in statistics.
Calculating a t-test requires three key data values. They include the difference between
the mean values from each data set (called the mean difference), the standard deviation
of each group, and the number of data values of each group.
There are several different types of t-test that can be performed depending on the data
and type of analysis required.
Calculating T-Tests
Calculating a t-test requires three key data values. They include the difference between the
mean values from each data set (called the mean difference), the standard deviation of each
group, and the number of data values of each group.
The outcome of the t-test produces the t-value. This calculated t-value is then compared
against a value obtained from a critical value table (called the T-Distribution Table). This
comparison helps to determine the effect of chance alone on the difference, and whether the
difference is outside that chance range. The t-test questions whether the difference between
the groups represents a true difference in the study or if it is possibly a meaningless random
difference.
o A large t-score indicates that the groups are different.
o A small t-score indicates that the groups are similar.
There are three types of t-tests, and they are categorized as dependent and independent t-
tests.
T-Test Assumptions
1. The first assumption made regarding t-tests concerns the scale of measurement. The
assumption for a t-test is that the scale of measurement applied to the data collected
follows a continuous or ordinal scale, such as the scores for an IQ test.
2. The second assumption made is that of a simple random sample, that the data is
collected from a representative, randomly selected portion of the total population.
3. The third assumption is the data, when plotted, results in a normal distribution, bell-
shaped distribution curve.
4. The final assumption is the homogeneity of variance. Homogeneous, or equal, variance
exists when the standard deviations of samples are approximately equal.
Conclusion
Mathematically, the t-test takes a sample from each of the two sets and establishes the
problem statement by assuming a null hypothesis that the two means are equal. Based
on the applicable formulas, certain values are calculated and compared against the
standard values, and the assumed null hypothesis is accepted or rejected accordingly.
If the null hypothesis qualifies to be rejected, it indicates that data readings are strong
and are probably not due to chance. The t-test is just one of many tests used for this
purpose. Statisticians must additionally use tests other than the t-test to examine more
variables and tests with larger sample sizes.