Application NLP
Application NLP
Information Retrieval
Presentation Outline
Query
IR
Retrieval syste
Document Answer list
collection m
5
Basics of IR Systems
Basics of IR Systems (contd…)
Indexing the collection of documents.
Indexing involves:
Tokenizationof string
Removing frequent words
Stemming (removing ing, ed, etc)