Data Analytics
Data Analytics
E
EG
Unit 1 Introduction to Data Analytics
Long Answer Questions
1. Define data science. What is its purpose? Explain in detail.
2. What is data analytics? Enlist its different roles. Also state its advantages and
LL
disadvantages.
3. With the help of a diagram describing the lifecycle of data analytics.
O
4. Explain four layers in the data analytics framework diagrammatically.
5. Differentiate between data analysis and data analytics.
C
6. What are the types of data analytics? Describe two of them in detail.
7. What is prescriptive analytics? Explain in detail.
AD
E
5. List application of machine learning in data science.
6. With the help of a suitable diagram describe machine learning model.
EG
7. How to train and validate a model? Describe in detail.
8. What are the types of machine learning? Compare them.
9. What is supervised learning? How does it works? State its advantages and
disadvantages.
LL
10.What is k-NN? How does it works? Explain diagrammatically. Also state its
advantages and disadvantages.
11.What is a decision tree? How does it works? State its advantages and
disadvantages.
O
12.Explain Support Vector Machine (SVM) with the help of diagram.
C
13.Write a short note on: Naïve Bayes.
14. Describe unsupervised learning with diagram and advantages and
disadvantages, With the help of an example explain k-means clustering
AD
algorithm.
15.What is association rule mining? Describe with an example.
16. Explain polynomial regression diagrammatically.
17.What is semi-supervised machine learning? With the help of diagram
H
explain its basic idea. Also state its advantages and disadvantages.
18. What is a regression model? Explain linear regression with a diagram.
R
21.With the help of example explain concept of classification. Also list various
classification techniques.
22.What is a random forest? Describe diagrammatically.
23.What is clustering? How does it works? Explain with example.
24.Describe various clustering techniques. Describe two of them in short.
25.What is reinforcement learning? Explain diagrammatically. Also state its
advantages and disadvantages.
26.Differentiate between supervised, unsupervised, semi-supervised and
reinforcement machine learning.
Unit 3
Mining Frequent Patterns, Associations and Correlations
Long Answer Questions:
1. What is data mining? Explain with a diagram? Also state its advantages and
disadvantages.
2. Explain usage of Market Basket Analysis with example?
E
3. Explain Apriori algorithm in detail.
EG
4. What are frequent itemsets, closed itemsets, and association rules? Describe in
detail.
5. What is outlier analysis? Describe in detail.
6. What kind of patterns can be mined? Explain IN detail.
LL
7. How to mine following:
(i) Frequent Patterns. O
(ii) Associations.
(iii) Correlations.
C
8. What are different types of data? Explain in detail with appropriate examples.
9. What are different sources of data in data science? Describe in detail.
AD
E
T400 {D,U,C,K,Y}
EG
T500 {C,O,O,K,I,E}
Find all frequent itemsets using Apriori and FP-growth, respectively. Compare the
LL
efficiency of the two mining processes.
18. A database has six transactions. Let min-sup = 50% and min-conf = 75%. Find
O
all frequent itemsets using Apriori algorithm. List all the strong association rules.
C
TID List of Items
001 Pencil, Sharpener, Eraser, Chart papers, Sketch pen
AD
E
3. What is social media analytics? What is its purpose? List its benefits.
EG
4. Explain process of social media analytics diagrammatically.
5. Describe layers of social media analytics with the help of diagram.
6. What is a social network? List any four examples of it. Explain two of them in
short.
LL
7. What is social media data? List its types. Also state how to accessing social
media O
data in detail.
8. What is social network analysis? Define it? Describe in detail.
C
9. With the help of suitable diagram describe life cycle of social media analytics.
10. What is link prediction? Explain with example.
AD
11. What is community detection? What are its different methods? Explain four of
them in short.
12. What is influence maximization? Explain its framework diagrammatically.
H
13. What is expert finding? How to find an expert? Describe with example.
14. Write a short note on: Prediction of trust.
R
16. What is NLP? What is its purpose? Describe its phases with the help of
diagram.
17. What is text analytics? Explain in detail.
18. What is tokenization: How is it used in text analytics?
19. What is a bag of words? How to use NLP? Explain in detail.
20. What is Word Weighting (TF-IDF)? Describe in detail.
21. Explain n-gram with example.
22. What is stemming and lemmatization? How do they differ from each other?
23. Describe the term synonyms with respect to NLP.
24. Write a short note on: Parts of speech tagging.
25. What is sentiment analysis? Explain its classification?
26. What is text analytics? Explain its steps diagrammatically. Also states its
advantages, disadvantages and applications.
27. What is text categorization? Describe diagrammatically. Also list approaches.
E
28. What is text summarization? Explain its two types in detail.
EG
29. What is trend analytics? Describe its methods in detail.
30. Write a short note on: Challenges to social media analytics.
LL
O
C
AD
H
R
SA