The document outlines an experiment list for data mining and big data projects. It includes tasks like implementing an association rule generation algorithm, performing clustering algorithms on image pixel data, discussing how graph and web mining can identify influential online users, explaining document ranking methods like PageRank, writing about Hadoop and its use at Facebook, and exploring applications of big data in areas like business intelligence, civil engineering, and customer relationship optimization.
The document outlines an experiment list for data mining and big data projects. It includes tasks like implementing an association rule generation algorithm, performing clustering algorithms on image pixel data, discussing how graph and web mining can identify influential online users, explaining document ranking methods like PageRank, writing about Hadoop and its use at Facebook, and exploring applications of big data in areas like business intelligence, civil engineering, and customer relationship optimization.
1. Implement Association rule generation algorithm with following details:
a. Input: Transactions with items, min_supp and min_conf b. Output: All association rules satisfying min_supp and min_conf Submit implementation code with execution on sample test data. 2. Create an image as follows with your own initials instead of NDT and 2014 instead of 2011.
Do sampling on this image pixels and take it as sample set (exclude background white pixels). Apply following clustering algorithms on this sample set with number of clusters as 2, 3, 4 and 7: 1) K-means clustering 2) DBSCAN Submit all clustering results. 3. How graph and web mining can be used to find out influential people in various online social networks? 4. What are various methods to rank documents (retrieved results) in information retrieval? Explain in detail the Googles PageRank algorithm, include following topics: a. Basic model (equation) used to find PageRank along with its proof. b. Difficulties in using basic model. c. Modifications required to overcome difficulties in basic model. 5. Write a brief note on Hadoop. Discuss in detail the usage of Hadoop at Facebook. 6. Explain in detail the application of Big-data in following areas: a. Enterprise Business Intelligence (BI) b. Civil Engineering c. Customer Relationship Optimization
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB