0% found this document useful (0 votes)
20 views2 pages

Sample Question Bank-1

The document discusses various topics related to web mining including authoritative web pages, hyperlink mining, hub pages, web usage mining, stream data applications, synopsis data structures, reservoir sampling, random sampling, histograms, lossy counting algorithm, page rank algorithm, HITS algorithm, classifier ensemble approaches, and automatic classification of web documents.

Uploaded by

Suman Ghorai
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
20 views2 pages

Sample Question Bank-1

The document discusses various topics related to web mining including authoritative web pages, hyperlink mining, hub pages, web usage mining, stream data applications, synopsis data structures, reservoir sampling, random sampling, histograms, lossy counting algorithm, page rank algorithm, HITS algorithm, classifier ensemble approaches, and automatic classification of web documents.

Uploaded by

Suman Ghorai
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

1) What is meant by authoritative Web pages?

2) how can a search engine automatically identify authoritative Web pages for my topic?
3) how can we use hub pages to find authoritative pages?”
4) What is Web usage mining
5) What are stream data? Explain 3 applications of stream data?
6) Write the working of page rank algorithm for ranking the webpages?
9) Explain Synopsis data structure? What is the complexity of synopsis data structure?
10) Give name of synopsis data structure?
11) Explain Reservoir sampling with an example?
12) Illustrate Random sampling?
13) How is histogram is used to approximate the frequency distribution of element values in a data stream
14) Lossy counting algorithm?
15) Explain page rank? And show how page rank is given for a page?
17) Give the algorithm of HITS? What is its significance?
18) Classifier Ensemble Approach
19) Explain the design of Web Search engine?
20) Automatic classification of web documents?
21) Hub and autorotative web pages?

Major crawling strategies

Authorative web pages

Hyperlink minining

Hub
Explain Web usage mining with a frameowkr?

Write steps for Automatic classification of web documents in your own words?

Synopses

• In data streams an infrequent item may become frequent and vice versa
Lossy Counting Algorithm

Hoeffding Tree Algorithm

Classifier Ensemble Approach

Complex data type

Keyword based retrieval


Polysemy synoneme

Different types of web minin

Explain page rank


Explain HITS

Web Search engine

Major crawling strategies

Authorative web pages

Hyperlink minining

Hub
Explain Web usage mining with a frameowkr?

Write steps for Automatic classification of web documents in your own words?

You might also like