Dat QB
Dat QB
Unit 1
1. What is purpose of Using C in Statistical Computing? Or what are the features of C programming
Language that can be utilized in data analysis.
2. Write a C program to the likelihood that a student shares first persons birthday & the likelihood that
any 2 students share a birthday.
3. Write a note on declaring types & initializing in C.
4. How is a function defined in C & explain call by value.
5. Explain the concept of pointers & usage in C.
6. Compare call by address & call by value.
7. Explain memory allocation in C.
8. Explain malloc & calloc with an example.
9. Using pointer as an array to fill it with square numbers.
10. Explain the following string functions in C
Strlen,strncpy,strncat,snprintf,asprintf,strcmp
11. Explain assert macro with an example.
12. How to test functions using assert.
13. Explain select,from,where clause & distinct keyword.
14. Explain use of group by & having keyword.
15. Write a note on aggregate functions.
16. How to carry out sorting in databases.
17. How to limit the no. of records in the display.
18. Write a note on DDL commands in databases.
19. How to delete a table from databases. What is the use of apop_table_exists function from apophenia
library?
20. How to insert, delete & update records in table.
21. Write a note on joins.
22. Explain subqueries in detail.
23. Explain apophenia functions used to fold queries into C code.
24. Explain the following apophenia functions
apop_db_to_crosstab
apop_db_merge
apop_crosstab_to_db
Unit 2
Unit 3
Part II statistics
Unit 5
1. Explain the purpose of log likelihood test. Also state the log likelihood function,score S & information
matrix.
2. State the cramer Rao lowerbound lemma.
3. State the Neyman Pearson Lemma.
4. Discuss the methods of finding optima. OR
5. Explain the Nelder-Mead Simplex method of finding optima.
6. Explain the conjugate gradient method of finding optima.
7. Explain the root finding method of finding optima.
8. Explain the simulated annealing method of finding optima.
9. Compare global & local optima.
10. Explain the use of apop_estimate_restart function.
11. What are Monte carlo methods ?
12. List the gsl function to generate random numbers.
13. How to plot a distribution using random numbers.Explain with an example.
14. Explain the use of apop_histogram function.
15. Write a note on bootstrapping method estimating variance & standard error.
16. Write a note on Markov chain Monte carlo.
17. Explain the testing of bimodality.
Problems:
P #2.A bomber wants to destroy a bridge. Two bombs are sufficient to destroy
it. If 4 bombs are dropped, what is the probability that it is destroyed, if
the chance of a bomb hitting the target is 0.4?
P #3.If the chance that any of the 5 telephone lines is busy at any instant is 0.1,
find the probability that all the lines are busy. Also find the probability
that not more than 3 lines are busy.
P #5.The probability that a seed from a contain lot will germinate is 0.7. If 8
seeds are sown, find the probability that at least 6 of them will germinate.
P #6.The probability that India wins a cricket test match against England is
given to be 1/3. If India and England play 3 test matches, what is the
probability that – (i) India will lose all the 3 test matches, (ii) India will
win at least one test match?
P #8. A box contains 100 C.D.’s, 20 of which are defective , 10 are selected for
inspection. Find the probability that (i) all 10 are defectives (ii) al 10 are good
(iii) at least one is defective (iv) at most 2 are defectives
P# 9. An industrial chemical that will retard the spread of fire in paint has been
developed, The local sales representative has estimated, from past
experience that 48% of the sales calls result in an order.
a) If eight sales calls are made in a day, what is the probability of receiving
exactly six orders
b) If four sales calls are made before lunch, what is the probability that one
or fewer results in an order?
P# 12. A computer repair person is “beeped “ each time there is a call for service.
The number of beeps per hour is known to occur in accordance with a
Poisson distribution with a mean of two per hour. Find the probability of three
beeps in the next hour? ( Given e- 2 = 0.1353 )
P# 13.A local electrical appliances shop has found from experience that the demand
for tube lights is distributed as Poisson with a mean of four tube per week. If
the shop keeps six tubes during particular week, what is the probability that the
demand will exceed the supply during that week ?(given e-4=0.0183)
P #17. If the heights of 1000 soldiers in a regiment are normally distributed with a
mean of 172 cm. and s.d. of 5 cm. , how many soldiers have heights greater
than 180 cm
P #18. The income distribution of a group of 10000 persons was found to be normal
with mean of Rs. 7500 per month and s.d. of Rs. 500 per month. What
percentage of this group had income (i) exceeding Rs. 6680 (ii) not more
than Rs. 7000
P #20. IQ scores are normally distributed throughout society , with mean 100 and
s.d. 15. (a) A person with an IQ of 140 or higher is called a “genius.” What
proportion of society is in genius category ?
(b) What proportion of society will miss the genius category by 5 or less
points ?
(c) Suppose that an IQ of 110 or higher is required to make it through an
accredited college or university. What proportion could be eliminated for
completing a higher education by a low IQ score ?
multinomial distribution
The following formula gives the probability of obtaining a specific set of outcomes when there are
three possible outcomes for each event:
where
p is the probability,
n is the total number of events
n1 is the number of times Outcome 1 occurs,
n2 is the number of times Outcome 2 occurs,
n3 is the number of times Outcome 3 occurs,
p1 is the probability of Outcome 1
p2 is the probability of Outcome 2, and
p3 is the probability of Outcome 3.
Hypergeometric distribution
Problems on Z test
1. A random sample of 100 bundles gives a mean of 8.5 tons and standard deviation 4 tons.Can the
sample be regarded as drawn from a population with mean 7 tons?test this at level of
significance 5%.
2. The average life of an Indian is 70 years.A random sample of 100 Indians has an average life of
71.8 years with standard deviation of 7.8 years.test the hypothesis.
Problems on t test
formula: