Building Fast Search Engines
Building Fast Search Engines
• Answers
• What is a good answer?
• Index design
or other techniques
• Searching within categories or groups to narrow a
search
How Search Engines
Work
• Search engines work as follows:
• They retrieve (spider or crawl) documents from the Web
repository
• The collection is indexed to allow fast ranking to find
answers
• A web interface is provided for entering queries and
presenting answers
• Document summarisation is used to present short
trade-off in time
• Documents are stored in a compressed format for the
same reason
• Different compression schemes are used for lists (which
35
1
30
25 0.8
% of
20 Average 0.6
collection Query
size 15
Time (sec) 0.4
10
5 0.2
0 0
Compressed Uncompressed Compressed Uncompressed
Fast Search Engines...