0% found this document useful (0 votes)
26 views2 pages

Categoricak Assign

Uploaded by

kenenichali06
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
26 views2 pages

Categoricak Assign

Uploaded by

kenenichali06
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

ODA BULRUM UNVERISTY

COLLEGE OF NATURAL AND COMPUTATIONAL SCIENCE

DEPARTMENT OF STATISTICS

Individual assignment for introduction to categorical data analysis

Weight: 20%

1. In the following examples, identify the response variable and the explanatory variables.
a) Attitude toward gun control (favor, oppose), Gender (female, male),
b) Mother’s education (high school, college).
c) Heart disease (yes, no), Blood pressure, Cholesterol level.
d) Race (white, nonwhite), Religion (Catholic, Jewish, Protestant), Vote for president
(Democrat, Republican, Other), Annual income.
e) Marital status (married, single, divorced, widowed), Quality of life (excellent, good, fair,
poor).
2. Compare and contrast logistic regression with regression model?

3. A class of 40 students was asked whether he/she is a vegetarian and 18 students


answered "yes".
a) Write the likelihood function
b) Estimate the probability of a student being vegetarian.
c) Test H0: π= 0.5 against the alternative H1: π≠0.5.
d) Find a 95% confidence interval for π.
4. Suppose it is hypothesized that a given six sided die is unbiased. To test this hypothesis,
the die is rolled 300 times and the frequency of occurrence of each of the faces is
observed. Because it is hypothesized that the die is unbiased, it is expected that the
number on each face will occur 50 times. However, the following frequencies of
occurrence are observed.

Value 1 2 3 4 5 6
Frequency 42 55 38 57 64 44

Is the die biased, or the difference is attributed to random fluctuation? Use a=0.05

5. Assume that the number of cases of tetanus reported during a single month in 2014 has a
Poisson distribution with parameter. The number of cases reported in January and February
are 1 and 3 respectively.
a) Find and plot the likelihood function over the space of potential values for µ.
b) What is the maximum likelihood estimate (MLE) of µ?
c) Give an estimate of the probability that there is no case of tetanus reported for a given month.

6. Consider the response is whether the subject achieved a three-year disease-free interval.

a) Show that each predictor has a significant effect when used individually without the
others.
b) Try to fit a main-effects logistic regression model containing all three predictors. Explain
why the ML estimate for the effect of lymphocytic infiltration is infinite.
c) Using conditional logistic regression,
i. Conduct an exact test for the effect lymphocytic infiltration, controlling for the
other variables; and,
ii. Find a 95% confidence interval for the effect. Interpret results.
7. Find the mean and variance for multinomial distribution? Show all necessary steps!

You might also like