0% found this document useful (0 votes)
68 views

Data Analytics

The document provides an important question bank for the subject of Data Analytics for the class of T.Y.B.Sc. CS Sem-VI. It contains long answer questions covering four units - Introduction to Data Analytics, Machine Learning Overview, Mining Frequent Patterns, Associations and Correlations, and Social Media and Text Analytics. The questions require explanations of key concepts in data analytics with examples.

Uploaded by

alooreprasad45
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
68 views

Data Analytics

The document provides an important question bank for the subject of Data Analytics for the class of T.Y.B.Sc. CS Sem-VI. It contains long answer questions covering four units - Introduction to Data Analytics, Machine Learning Overview, Mining Frequent Patterns, Associations and Correlations, and Social Media and Text Analytics. The questions require explanations of key concepts in data analytics with examples.

Uploaded by

alooreprasad45
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

Sarhad College of Arts, Commerce and

Science, Katraj, Pune


Academic Year- 2023-24
Department of Computer Science
Important Question Bank

Subject- Data Analytics Class- T.Y.B.Sc.CS Sem-VI

E
EG
Unit 1 Introduction to Data Analytics
Long Answer Questions
1. Define data science. What is its purpose? Explain in detail.
2. What is data analytics? Enlist its different roles. Also state its advantages and

LL
disadvantages.
3. With the help of a diagram describing the lifecycle of data analytics.
O
4. Explain four layers in the data analytics framework diagrammatically.
5. Differentiate between data analysis and data analytics.
C
6. What are the types of data analytics? Describe two of them in detail.
7. What is prescriptive analytics? Explain in detail.
AD

8. What exploratory analytics? What is its purpose? Explain with example.


9. Write a short note on: Mechanistic analytics.
10. What is a mathematical model? List its types. Explain two of them in detail.
H

11. What is a linear and nonlinear model? Compare them.


12. What is the baseline model? Enlist two of them in detail.
R

13. How to evaluate a model? Describe in detail.


SA

14. Write a short note on: Metrics for evaluating classifiers.


15. What is a confusion matrix? How to use it in data analytics? Explain
diagrammatically.
16. Define accuracy, precision, recall and f-score.
17. What is the ROC curve? How to implement it? Explain with example.
18. What is class imbalance? Describe in detail.
19. Write a short note on: Evaluating value prediction models.
Unit 2 Machine Learning Overview
Long Answer Questions
1. What is machine learning? States its advantages and disadvantages. Also
list its various applications.
2. What is deep learning? How does it work? Explain diagrammatically.
3. What is AI? What is its purpose? State its advantages and disadvantages.
4. With the help of a diagram describing the relationship between AI, ML and
DL.

E
5. List application of machine learning in data science.
6. With the help of a suitable diagram describe machine learning model.

EG
7. How to train and validate a model? Describe in detail.
8. What are the types of machine learning? Compare them.
9. What is supervised learning? How does it works? State its advantages and
disadvantages.

LL
10.What is k-NN? How does it works? Explain diagrammatically. Also state its
advantages and disadvantages.
11.What is a decision tree? How does it works? State its advantages and
disadvantages.
O
12.Explain Support Vector Machine (SVM) with the help of diagram.
C
13.Write a short note on: Naïve Bayes.
14. Describe unsupervised learning with diagram and advantages and
disadvantages, With the help of an example explain k-means clustering
AD

algorithm.
15.What is association rule mining? Describe with an example.
16. Explain polynomial regression diagrammatically.
17.What is semi-supervised machine learning? With the help of diagram
H

explain its basic idea. Also state its advantages and disadvantages.
18. What is a regression model? Explain linear regression with a diagram.
R

19.Define logistic regression with assumptions.


20.Write a short note on: Ensemble techniques.
SA

21.With the help of example explain concept of classification. Also list various
classification techniques.
22.What is a random forest? Describe diagrammatically.
23.What is clustering? How does it works? Explain with example.
24.Describe various clustering techniques. Describe two of them in short.
25.What is reinforcement learning? Explain diagrammatically. Also state its
advantages and disadvantages.
26.Differentiate between supervised, unsupervised, semi-supervised and
reinforcement machine learning.
Unit 3
Mining Frequent Patterns, Associations and Correlations
Long Answer Questions:
1. What is data mining? Explain with a diagram? Also state its advantages and
disadvantages.
2. Explain usage of Market Basket Analysis with example?

E
3. Explain Apriori algorithm in detail.

EG
4. What are frequent itemsets, closed itemsets, and association rules? Describe in
detail.
5. What is outlier analysis? Describe in detail.
6. What kind of patterns can be mined? Explain IN detail.

LL
7. How to mine following:
(i) Frequent Patterns. O
(ii) Associations.
(iii) Correlations.
C
8. What are different types of data? Explain in detail with appropriate examples.
9. What are different sources of data in data science? Describe in detail.
AD

10. Explain different data formats in brief.


11. What is meant by followings and Explain in reference of mining frequent
patterns:
H

(i) Predictive analysis.


(ii) Cluster analysis.
R

(iii) Outlier analysis.


SA

12. What is structured and unstructured data? Distinguish between them.


13. How to generate association rules from frequent itemsets? Explain in detail. 14.
With the help of an example describe FP growth algorithm. Also state its
advantages
and disadvantages.
15. Which techniques are used for improving efficiency of Apriori algorithm?
Describe two of them in detail.
16. Write a short note on: Class/concept description.
17. A database has five transactions. Let min sup =60%and min conf=80%.

TID Items Bought


T100 {M,O,N,K,E,Y}
T200 {D,O,N,K,,E,Y}
T300 {C,A,K,E}

E
T400 {D,U,C,K,Y}

EG
T500 {C,O,O,K,I,E}

Find all frequent itemsets using Apriori and FP-growth, respectively. Compare the

LL
efficiency of the two mining processes.
18. A database has six transactions. Let min-sup = 50% and min-conf = 75%. Find
O
all frequent itemsets using Apriori algorithm. List all the strong association rules.
C
TID List of Items
001 Pencil, Sharpener, Eraser, Chart papers, Sketch pen
AD

002 Chart papers, Charts, Glue sticks

003 Pencil, Glue stick, Eraser, Pen, Sketch pen


H

004 Oil pastels, Poster colors, Correction tape


R

005 Whitener, Pen, Pencil, Charts, Glue stick


SA

006 Color pencils, Crayons, Eraser, Pen

19. Consider the following transaction database:


transaction1 = (product1, product2, product7)
transaction2 = (product4, product5, product7)
transaction3 = (product6, product7, product8, product9)
transaction4 = (product1, product3, product4, product6, product7
Find the frequent product set using FP Algorithm with minimum support 50%.

Unit 4 Social Media and Text Analytics


Long Answer Questions.
1. What is social media? State its advantages and disadvantages.
2. What is text analytics? What is its purpose? Also states its tasks.

E
3. What is social media analytics? What is its purpose? List its benefits.

EG
4. Explain process of social media analytics diagrammatically.
5. Describe layers of social media analytics with the help of diagram.
6. What is a social network? List any four examples of it. Explain two of them in
short.

LL
7. What is social media data? List its types. Also state how to accessing social
media O
data in detail.
8. What is social network analysis? Define it? Describe in detail.
C
9. With the help of suitable diagram describe life cycle of social media analytics.
10. What is link prediction? Explain with example.
AD

11. What is community detection? What are its different methods? Explain four of
them in short.
12. What is influence maximization? Explain its framework diagrammatically.
H

13. What is expert finding? How to find an expert? Describe with example.
14. Write a short note on: Prediction of trust.
R

15. Explain the term distrust among individuals in detail.


SA

16. What is NLP? What is its purpose? Describe its phases with the help of
diagram.
17. What is text analytics? Explain in detail.
18. What is tokenization: How is it used in text analytics?
19. What is a bag of words? How to use NLP? Explain in detail.
20. What is Word Weighting (TF-IDF)? Describe in detail.
21. Explain n-gram with example.
22. What is stemming and lemmatization? How do they differ from each other?
23. Describe the term synonyms with respect to NLP.
24. Write a short note on: Parts of speech tagging.
25. What is sentiment analysis? Explain its classification?
26. What is text analytics? Explain its steps diagrammatically. Also states its
advantages, disadvantages and applications.
27. What is text categorization? Describe diagrammatically. Also list approaches.

E
28. What is text summarization? Explain its two types in detail.

EG
29. What is trend analytics? Describe its methods in detail.
30. Write a short note on: Challenges to social media analytics.

LL
O
C
AD
H
R
SA

You might also like