ISYE 6644 Summer 2024 Final Exam Graded A
ISYE 6644 Summer 2024 Final Exam Graded A
ISYE 6644 Summer 2024 Final Exam Graded A
TRUE or FALSE? The Bechhofer procedure for selecting the normal population with
the largest mean specifies the appropriate number of observations to take from each
competing population, and simply selects the competitor having the largest sample
mean. - ANSWER True
Suppose that a Bernoulli selection procedure tells you to take 100 observations from
each of two populations, A and B. It turns out that A gets 85 successes and B gets
46 successes. What do you think? - ANSWER 1) A almost certainly has a higher
success probability than B.
2) We could've probably stopped sampling a bit earlier (i.e., with fewer than 100
observations) because A was so far ahead of B.
Suppose that we want to know which of Coke, Pepsi and Dr.pepper is the most
popular. We would like to make the correct selection with probability of at least
P*=0.90 in the event that the ration of the highest-to-second-highest preference
probabilities happens to be at least 0*=1.4. How many people does the single-stage
procedure Mbem require us to interview? - ANSWER 126
Go to the table in the notes and pick off the entry for k=3, P^⋆=0.90, and θ^⋆=1.4.
Which of the following parameters can you get confidence intervals for?
Means
Variances
Quantiles
Differences between the means of two systems - ANSWER All of the above
If we have an iid normal sample of observations, X1, X2,...Xn, what probability
distribution is most-commonly used to obtain cofidence intervals for the mean? -
ANSWER t
TRUE or FALSE? The paired CI for the differences in two means is designed to work
especially well if all of the observations from the first population are completely
independent of all of the observations from the second population. - ANSWER
FALSE. {In fact, it's easier to distinguish between the two means if Xi is positively
correlated with Yi. Think about my parallel parking example in the class notes.}
Statistical ranking and selection techniques have been designed to address a variety
of comparison problems. Which ones from the following list?
Find the population having the largest mean.
Find the system with the smallest variance.
Find the alternative with the highest success probability.
Find the most-popular candidate.
All of the above. - ANSWER All of the above.
Suppose we are dealing with i.i.d. normal observations with unknown variance.
Which of the following is true about a 95% confidence interval for the mean μ? -
ANSWER We are 95% sure that our CI will actually contain the unknown value of μ.
We are studying the waiting times arising from two queueing systems. Suppose we
make 4 independent replications of both systems, where the systems are simulated
independently of each other.
replication system 1 system2
1 10 25
2 20 10
3 5 40
4 30 30
Assuming that the average waiting time results from each replication are
approximately normal, find a two-sided 95% CI for the difference in the means of the
two systems. - ANSWER This is a two-sample CI problem assuming unknown and
unequal variances. We have
[-29.76, 9.76]
This is sort of the same as Question 2, except we have now used common random
numbers to induce positive correlation between the results of the two systems. Again
find a two-sided 95% CI for the difference in the means of the two systems. -
ANSWER This is a paired-t CI problem assuming unknown variance of the
differences.
[-16.5, -3.5]
Suppose A and B are two identically distributed, unbiased, antithetic estimators for
the mean μ of some random variable, and let C = ( A + B ) / 2. Which of the following
is true? - ANSWER E [ C ] = μ and V a r ( C ) < V a r ( A ) / 2.
Suppose that you want to pick that one of three normal populations having the
largest mean. We'll assume that the variances of the three competitors are all known
to be equal to σ 2 = 4. (Ya, I know that this is a crazy, unrealistic assumption, but
let's go with it anyway, okey dokey?) I want to choose the best of the three
populations with probability of correct selection of 95% whenever the best
population's mean happens to be at least δ ⋆ = 1 larger than the second-best
population's. How many observations from each population does Bechhofer's
procedure N B tell me to take before I can make such a conclusion? - ANSWER
Using the notation of the notes, we want to make sure to get the right answer with
probability of P ⋆ = 0.95 whenever μ [ k ] − μ [ k − 1 ] ≥ δ ⋆ = 1.
We simply go to NB's table with k = 3 and δ ⋆ / σ = 1 / 2 to obtain a sample size of n
= 30 from each population.
In the above problem, suppose that we take the necessary observations and we
come up with the following sample means: X ¯ 1 = 7.6, X ¯ 2 = 11.1, and X ¯ 3 = 3.6.
What do we do? - ANSWER Pick population 2 and say that we are right with
probability at least 95%
Suppose that we want to know which of Coke, Pepsi, and Dr. Pepper is the most
popular. We would like to make the correct selection with probability of at least P ⋆ =
0.90 in the event that the ratio of the highest-to-second-highest preference
probabilities happens to be at least θ ⋆ = 1.4. If we use procedure M B E M, then the
corresponding table in the notes (with k = 3) tells us to take 126 samples (taste
tests). Suppose we take those samples sequentially and after 100 have been taken it
turns out that 65 people prefer Coke, 25 love Pepsi, and 10 like Dr. Pepper. What to
do? - ANSWER Stop the test now and declare with confidence of at least 90% that
Coke is the most-preferred.
Suppose we want to estimate the expected average waiting time for the first m = 100
customers at a bank. We make r = 4 independent replications of the system, each
initialized empty and idle and consisting of 100 waiting times. The resulting replicate
means are:
i 1 2 3 4 Z i 5.2 4.3 3.1 4.2
Find a 90% confidence interval for the mean average waiting time for the first 100
customers. - ANSWER [3.188,5.212]
Consider a particular data set of 100,000 stationary waiting times obtained from a
large queueing system. Suppose your goal is to get a confidence interval for the
unknown mean. Would you rather use (a) 50 batches of 2000 observations or (b)
10000 batches of 10 observations each? - ANSWER 50 batches of 2000
observations
because the method of batch means requires a very large batch size
Consider the output analysis method of non overlapping batch means. Assuming
that you have a sufficiently large batch size, it can be shown that when the number
of batches b is even, the expected width of the 90% two-sided confidence interval for
μ is proportional tot 0.05 , b − 1 b − 1 ( b − 1 2 ) ( b − 3 2 ) ⋯ 1 2 ( b − 2 2 ) ! .Using
the above equation, determine which of the following values of b gives the smallest
expected width. - ANSWER b=6
Let h ( b ) denote the value of the above expression as a function of b. Then easy
calculations reveal that h ( b ) = 3.157, h ( 4 ) = 1.019, and h ( 6 ) = 0.845. So the
answer is b = 6
We often distinguish between two general types of simulations with regard to output
analysis. What are they called? - ANSWER Finite-horizon and steady-state
TRUE or FALSE? Suppose that X 1 , X 2 , . . . , X n are consecutive waiting times,
and we define the sample mean X ¯ = ∑ i = 1 n X i / n. Then V a r ( X ¯ ) = V a r ( X
i ) / n. - ANSWER Very FALSE! (The issue is that correlation between the
observations messes up the variance of the sample mean. In fact, this is one of the
main reasons why output analysis is difficult!)
TRUE or FALSE? The main method of attack for terminating simulations is via
independent replications. - ANSWER TRUE (Independent replications help to
mitigate issues with simulation outputs not being i.i.d.)
TRUE or FALSE? You can also conduct finite-horizon estimation for quantities other
than expected values, e.g., simulate a bank from 8:00 a.m. to 5:00 p.m., and find a
confidence interval for the 95th quantile of customer waiting times. - ANSWER
TRUE (the method of independent replications allows us to determine sample
variances, which subsequently allows us to determine confidence intervals)
Find the sample variance of -3, -2, -1, 0,1,2,3 - ANSWER 14/3
S^2 formula
If X 1 , ... , X 10 are i.i.d. Pois(6), what is the expected value of the sample variance
S^2? - ANSWER 6
S^2 is always unbiased for the variance of X i. Thus, we have E [ S^2 ] = V a r ( X i )
=λ=6
Suppose that estimator A has bias = 3 and variance = 12, while estimator B has bias
-2 and variance = 14. Which estimator (A or B) has the lower mean squared error? -
ANSWER MSE = Bias^ 2 + Var, so
M S E ( A ) = 9 + 12 = 21 and M S E ( B ) = 4 + 14 = 18.
What's a good distribution for modeling the number of random customer arrivals to a
store? - ANSWER The Poisson distribution is used for counting the number of
arrivals over some interval of time.
TRUE or FALSE? If the expected value of your estimator equals the parameter that
you're trying to estimate, then your estimator is unbiased. - ANSWER TRUE. (This
is the definition of unbiasedness in words.)
TRUE or FALSE? If X 1 , X 2 , . . . , X n are i.i.d. with mean μ, then the sample mean
X ¯ is unbiased for μ. - ANSWER TRUE
TRUE or FALSE? It's possible to estimate two MLEs simultaneously, e.g., for the N o
r ( μ , σ 2 ) distribution. - ANSWER TRUE (it's possible based on taking the partial
derivatives of the likelihood function with respect to each parameter).
TRUE or FALSE? Sometimes it might be difficult to obtain an MLE in closed form. -
ANSWER TRUE. (Think of the gamma example that we did.)
Suppose that the MLE for a parameter θ is θ ^ = 4. Find the MLE for θ. - ANSWER
Invariance immediately implies that the MLE of sqrt(θ) is simply sqrt( θ ^) = 2θ ^ = 2
If you're in this course, then you should always love your. . . - ANSWER Method of
Moments !
Suppose H 0 is true, but you've just rejected it! What have you done? - ANSWER
You've committed a Type I error.
Suppose that you are testing 100 observations to see if they are exponential (with
unknown rate parameter λ). You decide to break the hypothesized p.d.f. into 5
intervals. How many degrees of freedom will your resulting chi-square goodness-of-
fit test statistic have? - ANSWER The degrees of freedom is k−1−s=5−1−1=3
Which of the following problematic issues can arise in input data analysis? -
ANSWER Not enough data
Data coming from strange-looking, "non-standard" distributions
Nonstationary data (in which the distribution appears to change over time)
Correlated data
TRUE or FALSE? Arena has an Input Analyzer that automates distribution fitting for
certain well-known distributions. - ANSWER TRUE. (It's often a great time-saver!
TRUE or FALSE? The times between Poisson( λ ) arrivals are i.i.d. Exp( λ ). -
ANSWER TRUE. (... as long as ( λ ) doesn't change over time.)
Consider the time series Y i = ϕ Y i − 1 + ϵ i , i = 1 , 2 , ..., where the ϵ i's are i.i.d.
normal. What is this time series called? (Give the best answer.) - ANSWER first-
order autoregressive process
If 𝑋 is a continuous random variable with c.d.f. 𝐹(𝑥), what's the distribution of 𝐹(𝑋)? -
ANSWER Unif(0,1)
TRUE or FALSE? You can find the inverse c.d.f. Φ^−1(. ) of the standard normal
distribution in closed form. - ANSWER FALSE. You need to use an approximation.
If 𝑈 is Unif(0,1), what is ⌈6𝑈⌉? (Note that ⌈.⌉ is the round-up function.) - ANSWER A
6-side die toss
TRUE or FALSE? If you can't find a good theoretical distribution to model a certain
random variable, you might want to use the empirical distribution of the data to do
so. - ANSWER TRUE. That's the point of this lesson!
Support that U1 and U2 are PRNs. What's the distribution of U1 + U2? - ANSWER
Triangular (0,1,2)
Suppose that I want to generate a simple Unif(2/3, 1) via A-R. Support I generate a
PRN U1=0.16. Do I accept U1 as my Unif(2/3, 1)? - ANSWER NO. In this example,
we only accept if U1 ≥ 2/3; so we reject and try again until we meet that condition.
TRUE or FALSE? The proof that A-R works is really easy. - ANSWER FALSE.
Super false, in fact!
True or False? The A-R algorithm for X~Pois(λ) tells us to generate PRNs until e^-
λ .... Ui for the first time, and then set X=n. - ANSWER TRUE. Yup - that's how you
do it (though sometimes it's a bit tedious if λ is large).
Unif(0,1) PRNs can be used to generate which of the following random entities? -
ANSWER All of the above --- and just about anything else!
Suppose that U 1 , U 2 , ... , U 5000 are i.i.d. Unif(0,1) random variables. Using Excel
(or your favorite programming language), simulate X i = − ln ( U i ) for i = 1 , 2 , ... ,
5000. Draw a histogram of the 5000 numbers. What p.d.f. does the histogram look
like? - ANSWER Exponential
How would you simulate the sum of two 6-sided dice tosses? (Note that ⌈ ⋅ ⌉ is the
round-up function; and all of the U's denote PRNs.) - ANSWER ⌈ 6 U 1 ⌉ + ⌈ 6 U 2 ⌉
⌈ ln ( 1 − U ) / ln ( 0.4 ) ⌉
Suppose that U and V are PRNs. Let X = U + V. Simulate this 5000 times, and draw
a histogram of the 5000 numbers. What p.d.f. does the histogram look like? -
ANSWER Triangular
By the lesson notes, we know that the 5000 Xi's are all Triangular(0,1,2). Since we
have a histogram of 5000 of these, it really ought to look like a triangular p.d.f. Thus,
the answer is (c).