0% found this document useful (0 votes)
5 views4 pages

Da QB

The document is a question bank for B.Tech. II Year - I Semester Examinations at Malla Reddy University, focusing on Data Analytics. It includes a comprehensive list of questions covering various topics such as data preprocessing, exploratory data analysis, machine learning concepts, and regression techniques. Each question is assigned a mark value and categorized by unit, providing a structured approach to the subject matter.

Uploaded by

habibitayhabibti
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
5 views4 pages

Da QB

The document is a question bank for B.Tech. II Year - I Semester Examinations at Malla Reddy University, focusing on Data Analytics. It includes a comprehensive list of questions covering various topics such as data preprocessing, exploratory data analysis, machine learning concepts, and regression techniques. Each question is assigned a mark value and categorized by unit, providing a structured approach to the subject matter.

Uploaded by

habibitayhabibti
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

MALLA REDDY UNIVERSITY

School of Engineering - AI&ML


B.Tech. II Year - I Semester Examinations
Question Bank (R22)

Subject Code: MR22-1CS0222 Subject Name: Data Analytics

Qno Question Marks UNIT


1 What is data analytics? What are different types of data analytics? 8 UNIT-I
2 Explain Data Preprocessing Techniques? 8 UNIT-I
3 Write notes on classification of Data and differentiate between 8 UNIT-I

them?
4 Describe the classification of Data analytics with example in detail. 8 UNIT-I

5 Explain Data acquisition and data collection methods? 8 UNIT-I


6 Apply the inter quartile range technique on a given data below and 8 UNIT-I
identify the outliers with source code :
5,7,10,15,19,21,21,22,22,23,23,23,23,23,24,24,24,24,25
7 Discuss the various steps involved in data preprocessing and their 8 UNIT-I
significance in ensuring the quality and reliability of the data used
for analysis.
8 Discuss the various techniques used for transforming data, 8 UNIT-I
including normalization, standardization, and log transformations.

9 Explain the necessity of data cleaning in the preprocessing phase. 8 UNIT-I


Discuss common issues encountered in raw datasets and elaborate
on techniques such as handling missing values, outlier detection,
and noise reduction.

10 Discuss the significance of feature engineering in data 8 UNIT-I


preprocessing. Explain how creating new features or modifying
existing ones can improve model performance.

11 Explain the various steps involved in exploratory data analysis with 8 UNIT-II
suitable examples.

12 Explain how to create basic plots (line plots, bar plots, scatter plots) 8 UNIT-II
using Matplotlib

13 What are the measures of central tendency and variability used in 8 UNIT-II
statistics? Explain how each measure can be used to describe a data
set?

14 In a box containing hundred screws, 10 are too short, 5 are too slim 8 UNIT-II
and 2 of these 15, are both slim and short. What are the conditional
probabilities of picking a short screw that is also slim as well as
picking slim screw that is also short.
15 Consider a discrete random variable X = Sum of two numbers 8 UNIT-II
thrown up by tossing two fair dies. Plot the PDF and CDF for this
random variable

16 Explain Label encoding and one hot encoding in Data 8 UNIT-II


Preprocessing with example?

17 Explain the concept of hypothesis testing. Discuss the key 8 UNIT-II


components, including null and alternative hypotheses, significance
level, and p-value. Provide a step-by-step guide on how hypothesis
testing is conducted.

18 Discuss the difference between Type I and Type II errors in 8 UNIT-II


hypothesis testing. Provide examples to illustrate each type of error
and explain their implications in decision-making.

19 Explore the role of data scaling and normalization in data 8 UNIT-II


preprocessing?

20 Discuss the key measures used in descriptive statistics and their 8 UNIT-II
roles in summarizing and presenting data.

21 Explain predictive analytics? 8 UNIT-III

22 Define Following 8 UNIT-III


a. Regression b. Classification c. Clustering d. Training Data
& Test Data e. Underfitting & Overfitting

23 Expalain supervised machine learning ? 8 UNIT-III

24 Expalain unsupervised machine learning ? 8 UNIT-III

25 What is prediction error? State and explain the prediction error in 8 UNIT-III
regression and classification with suitable example?

26 Compare regression, classification and clustering in machine 8 UNIT-III


learning along with suitable examples?

27 Distinguish between supervised and unsupervised learning with an 8 UNIT-III


example.

28 List the methods for Model evaluation metrics for classification? 8 UNIT-III
Explain each. How we can improve the performance of model.

29 Explain Supervised & Supervised machine learning with examples? 8 UNIT-III

30 Discuss the importance of metrics such as Mean Squared Error 8 UNIT-III


(MSE), Mean Absolute Error (MAE), and R-squared in assessing
the predictive accuracy of regression models.

31 Given data = {2, 3, 4, 5, 6, 7; 1, 5, 3, 6, 7, 8}. Compute the 8 UNIT-IV


principal component using PCA Algorithm
32 Discuss the concept of dimensionality reduction in PCA. 8 UNIT-IV

33 Explain the Confusion Matrix with respect to Machine Learning 8 UNIT-IV


Algorithms.

34 Define decision trees and explain their role in machine learning. 8 UNIT-IV
How do decision trees represent and model decision-making
processes?

35 Explain the concept of random forests and how they extend the 8 UNIT-IV
capabilities of decision trees?

36 Describe methods for evaluating the performance of decision trees 8 UNIT-IV


and random forests?

37 Explore the applications of decision trees and random forests in 8 UNIT-IV


machine learning.

38 Consider the following dataset with two-dimensional points: 8 UNIT-IV


{(2,3),(5,3),(8,3),(3,6),(6,6),(9,6),(4,9),(7,9),(10,9)}{(2,3),(5,3),
(8,3),(3,6),(6,6),(9,6),(4,9),(7,9),(10,9)}

Perform k-means clustering with k=3. Show the steps of the


algorithm, including the initial selection of centroids, assignment of
points to clusters, and updating centroids.

39 Discuss overfitting and underfitting situation in decision tree 8 UNIT-IV


learning.

40 Describe the step-by-step process of the k-means clustering 8 UNIT-IV


algorithm.

41 Discuss the assumptions underlying simple linear regression and 8 UNIT-V


their importance in the interpretation of results?

42 Explain the process of hypothesis testing in the context of simple 8 UNIT-V


linear regression. What are the null and alternative hypotheses?

43 Compare and contrast simple linear regression with multiple linear 8 UNIT-V
regression. When would one be preferred over the other?

44 What is the difference between linear and logistic regression with 8 UNIT-V
an example?

45 Define multiple regression and explain how it extends the concept 8 UNIT-V
of simple linear regression. What are the key differences between
simple linear regression and multiple regression? Provide a
conceptual overview.
46 Define Maximum Likelihood Estimation (MLE) and explain the 8 UNIT-V
basic concept behind it ?

47 Explain methods for evaluating the performance of linear regression 8 UNIT-V


models ?

48 Explain methods for evaluating the performance of logistic 8 UNIT-V


regression models ?

49 Explain Simple linear Regression & multiple Regression? 8 UNIT-V

50 a.In the context of regression, what is the role of dependent and 8 UNIT-V
independent variables?
b.Explain the interpretation of coefficients in a regression model?

You might also like