0% found this document useful (0 votes)
34 views

Experiment List For Data Mining and Big Data

The document outlines an experiment list for data mining and big data projects. It includes tasks like implementing an association rule generation algorithm, performing clustering algorithms on image pixel data, discussing how graph and web mining can identify influential online users, explaining document ranking methods like PageRank, writing about Hadoop and its use at Facebook, and exploring applications of big data in areas like business intelligence, civil engineering, and customer relationship optimization.

Uploaded by

Nitin Thokare
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
34 views

Experiment List For Data Mining and Big Data

The document outlines an experiment list for data mining and big data projects. It includes tasks like implementing an association rule generation algorithm, performing clustering algorithms on image pixel data, discussing how graph and web mining can identify influential online users, explaining document ranking methods like PageRank, writing about Hadoop and its use at Facebook, and exploring applications of big data in areas like business intelligence, civil engineering, and customer relationship optimization.

Uploaded by

Nitin Thokare
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 1

Experiment list for Data Mining and Big Data:

1. Implement Association rule generation algorithm with following details:


a. Input: Transactions with items, min_supp and min_conf
b. Output: All association rules satisfying min_supp and min_conf
Submit implementation code with execution on sample test data.
2. Create an image as follows with your own initials instead of NDT and 2014 instead
of 2011.

Do sampling on this image pixels and take it as sample set (exclude background white
pixels). Apply following clustering algorithms on this sample set with number of clusters
as 2, 3, 4 and 7:
1) K-means clustering
2) DBSCAN
Submit all clustering results.
3. How graph and web mining can be used to find out influential people in various online
social networks?
4. What are various methods to rank documents (retrieved results) in information
retrieval? Explain in detail the Googles PageRank algorithm, include following topics:
a. Basic model (equation) used to find PageRank along with its proof.
b. Difficulties in using basic model.
c. Modifications required to overcome difficulties in basic model.
5. Write a brief note on Hadoop. Discuss in detail the usage of Hadoop at Facebook.
6. Explain in detail the application of Big-data in following areas:
a. Enterprise Business Intelligence (BI)
b. Civil Engineering
c. Customer Relationship Optimization

You might also like