Question Bank Fds
Question Bank Fds
Loyola Nagar, Thovalai – Rajavoor Road, Thovalai Post – 629 302, Kanyakumari District, Tamil Nadu.
Mob : 9244775545 / 55 / 44 / 11, 94431 24568 Website : www.lites.edu.in
e-mail – [email protected] , [email protected], [email protected], [email protected]
UNIT 1
PART A( 2 Mark)
1 What is data Cleansing?
2 List the V’s of big data
3 What is the role of data science in business, medical research, healthcare, education,
social media, technology and financial institutions?
4 Mention any 4 common errors that occur in data
5 How can you handle missing values in dataset?
6 List the facets of data with example
7 Briefly explain the architecture of data warehousing
8 What is project charter?
9 Mention the significance of setting goals in data science project
10 Identify the components of data science
PART B( 14 Mark)
UNIT 2
PART A (2 Mark)
1 What is frequency distribution?
2 what is grouped frequency distribution?
3 Define percentile ranks?
4 What is frequency polygon?
5 What is interquartile range (IQR)?
6 Differentiate between bar graph and a histogram
7 What is negatively skewed distribution?
8 How will you convert a z-score to original score?
9 What is a normal curve?
10 Compare discrete and continuous variables
PART B (14 Mark)
4 Determine the values of the range and the IQR for the following sets of data.
(a) Retirement ages: 60, 63, 45, 63, 65, 70, 55, 63, 60, 65, 63
(b) Residence changes: 1, 3, 4, 1, 0, 2, 5, 8, 0, 2, 3, 4, 7, 11, 0, 2, 3, 4
PART C (16 Mark)
1 Suppose that the burning times of electric light bulbs approximate a normal curve
with a mean of 1200 hours and a standard deviation of 120 hours. What proportion of
lights burn for
(a)less than 960 hours?
(b)more than 1500 hours?
(c)within 50 hours of the mean?
(d)between 1300 and 1400 hours?
UNIT 3
PART A (2 Mark)
1 What is a correlation coefficient?
2 When does regression fallacy occur?
3 What are the key properties of correlation coefficient r ?
4 What is interpretation of r2 ?
5 What is linear relationship and non-linear relationship?
6 What is causation?
7 What is curvilinear relationship
8 What is restricted range?
9 Compare correlation and regression
10 Define Regression.
PART B (14 Mark)
1 a) Categorize different types of Relationship using Scatter plots with example
b) Compute the formula for correlation coefficient and explain it with an example
2 Explain about regression and explain Least Squares Regression Line with an example
UNIT 4
PART A (2 Mark)
1 What is NumPy in python used for?
2 Write the output for the following numpy code?
(i) np.array([3,14,4,2,3])
(ii) np.array([1,2,3,4],dtype=’float32’)
(iv) np.zeros(10,dtype=int)
(vi) np.full((3,5),3.14)
(vii) np.arrange(0,20,20)
(viii) np.linespace(0,1,50
3 How a pandas data frame can be constructed?
4 List is mutable. Justify this with suitable example
5 Write the list of aggregate functions of numpy
6 What is universal function?
7 Identify the details maintained by python to store an integer
8 Write python code to create 1D,2D and 3D numpy arrays.
9 What is pivot table?
10 How missing data can be handled in python?
UNIT 5