0% found this document useful (0 votes)
5 views2 pages

Web Data Mining Unit Wise Important Questions

Uploaded by

workiimeee.02
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
5 views2 pages

Web Data Mining Unit Wise Important Questions

Uploaded by

workiimeee.02
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

GTU Web Data Mining (116AT02) - Unit-wise Important Questions

Unit 1: Introduction & Association Rules

- What is Web Data Mining? Explain with example.

- What is Market Basket Analysis?

- Differentiate between frequent items and rare items.

- What are the advantages and disadvantages of Association Rule Mining?

- Explain the Apriori algorithm for generating frequent itemsets.

- Apply Apriori to a transaction dataset (with given min support/confidence).

- Explain Association Rule Generation from frequent itemsets.

Unit 2: Sequential Pattern Mining

- What is a sequential pattern? How is it different from association rules?

- Explain GSP (General Sequential Pattern) algorithm with example.

- Explain PrefixSpan algorithm. How does it avoid candidate generation?

- Discuss the need for mining sequential patterns.

- Write a short note on mining with multiple minimum supports.

Unit 3: Information Retrieval & Preprocessing

- What are Phrase Queries and Proximity Queries?

- Explain Stemming and Lemmatization with examples.

- What is the Rocchio Method? Explain its use in relevance feedback.

- Explain Statistical Language Model for Information Retrieval.

- What is Meta-search? Give example.

- Explain Web Page Preprocessing steps: tokenization, stop-word removal, stemming.

- What is TF-IDF? How is it used in text mining?

Unit 4: Link Analysis & Web Crawling

- Explain PageRank Algorithm with example.

- Write the strengths and weaknesses of PageRank.

- What is the HITS Algorithm? How does it compute authority/hub values?

- Define and differentiate: Degree Prestige, Proximity Prestige, Rank Prestige.

- What is Co-citation and Bibliographic Coupling?


GTU Web Data Mining (116AT02) - Unit-wise Important Questions

- Explain Basic Crawler Algorithm.

- What are Preferential Crawlers?

- What are the Implementation Issues in Web Crawlers?

- Explain Crawler Ethics and Conflicts.

Unit 5: Opinion Mining & Web Usage Mining

- What is Opinion Mining? Explain with example.

- Applications of Opinion Mining.

- Differentiate: Sentiment Classification vs Phrase-based Classification.

- What is Feature-based Opinion Mining? How is summarization performed?

- Explain Opinion Search and challenges in Opinion Spam Detection.

- What is Web Usage Mining? Explain its process.

- Explain Cluster Analysis and Visitor Segmentation.

- What is Sessionization in Web Usage Mining?

- Explain Data Fusion and Cleaning.

You might also like