Data Analysis Assignment Help
Data Analysis Assignment Help
y = ax + b
where x and are independent random variables with means equal to 0.0
and variances equal to 1.0 and a and b are specified constants.
Solution:
Solution:
a2 + b2 = 2
Correl(x,y) = 0.5 = Cov(x,y)/(Std[x]Std[y])
Cov(x,y) = E[(x- x )(y- y )]
y- y = a(x- x )+b(- )
so: Cov(x,y) = E[a(x- x )2+b(x- x ) (- )] = aVar[x] = a
→ a = .5* 2 = 0.707 ; b = 1.22
Problem 2
Solution:
This estimator is both unbiased and consistent. Estimators that were not
unbiased and consistent were also given credit (since the problem does
not specify that they should be), but only estimators that actually
estimate x10 legitimately were given full credit.
Solution:
Solution:
Because the estimator in this case is x10 and not mx, the approximation
SD( aˆ ) ~ SD(x)/ N
CANNOT be used!
Problem 3
Solution:
Using this expression (as the problem states), a good estimator is:
aˆ 3sx
Solution:
where F-1 is the inverse of the unit normal CDF (large sample
assumption)
Solution:
function test(actual_data)
% This program will simulate replicates to determine the
% SD[ahat], which is the unknown quantity in Part c.
%
% Actual data vector is input as a function argument
N=length(actual_data) ; % number of data points per sample
nrep=1000;
ahat=sqrt(3)*std(actual_data) ; % estimate of ahat from the
sx of the data.
% Assume unknown true value of uniform distribution limit
% a is equal to ahat
% generate nrep replicates from unifrnd
sim_data=unifrnd(-ahat,ahat,N,nrep);
% compute estimate for each replicate
simahats=sqrt(3)*std(sim_data);
% find standard deviation over estimate
replicates
sdahat=std(simahats);
Problem 4
With PEs: 66 41 77 80 52 98 99 74 81 78
Using a large sample assumption and this data set, perform a two-sided
test of the hypothesis that the mean salaries of engineers with and
without PEs are the same. Summarize your results by reporting the p
value for the test. When picking the two groups of engineers how could
you minimize the impact of factors other than PE registration on your
conclusions? You may wish to
use the unit normal CDF plot provided at the end of this quiz.
Solution:
P=0.6