0% found this document useful (0 votes)
29 views2 pages

April 2019

This document is a paper on the topic of Information Retrieval. It contains 5 questions with multiple subparts. Question 1 contains multiple choice and fill in the blank questions about basic Information Retrieval concepts. Question 2 asks to explain components of IR, its history, forms of spelling correction, and to draw an inverted index for a sample document collection. Questions 3-5 ask to further explain concepts relating to IR including Hubs and Authorities, cosine similarity, personalized search, collaborative filtering, question answering, cross-lingual retrieval, web graphs, search engine architecture, XML retrieval, web size measurement, and sponsored search.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
29 views2 pages

April 2019

This document is a paper on the topic of Information Retrieval. It contains 5 questions with multiple subparts. Question 1 contains multiple choice and fill in the blank questions about basic Information Retrieval concepts. Question 2 asks to explain components of IR, its history, forms of spelling correction, and to draw an inverted index for a sample document collection. Questions 3-5 ask to further explain concepts relating to IR including Hubs and Authorities, cosine similarity, personalized search, collaborative filtering, question answering, cross-lingual retrieval, web graphs, search engine architecture, XML retrieval, web size measurement, and sponsored search.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

Paper / Subject Code: 87004 / Information Retrieval

(2 ½ Hours) [Total Marks: 75]

N.B: 1) All questions are compulsory.


2) Figures to the right indicate marks.
3) Illustrations, in-depth answers and diagrams will be appreciated.
4) Mixing of sub-questions is not allowed.

Q. 1 Attempt All (Each of 5Marks) (15M)


(a) Multiple Choice Questions (5M)
i) _______are indexed units in incidence matrix.
a. Terms b. Collection c. Information d. Data

ii) The number of documents in the collection that contain a term t is called as
________
a. Document Index dit b. Document frequency dft
c. Document Inverse dint d. Document Incidence Matrix dimt

iii) The standard way of quantifying the similarity between two documents d1 and d2 is
to compute the _________of their vector representations.
a. sine similarity b. cot similarity c. cosine similarity d. None

iv) CPM stands for ____________________


a. Cost per mil b. Cost per making
c. Cost per manage d. Cost per migrating

v) _______fraction of the returned results are relevant to the information need.


a. Proximity b. Posting Merge c. Posting list d. Precision

(b) Fill in the blanks (5M)


(in-links, Static , semistructured, Document Object Model, two)

i) IR is also used to facilitate ___________ search such as finding a document where


the title contains Java and the body contains threading.

ii) __________web pages are those whose content does not vary from one request for
that page to the next.

iii) Every web page is assigned __________ scores.

iv) The standard for accessing and processing XML documents is the XML _______.

v) The hyperlinks into a page as ___________.

75839 Page 1 of 2

C6B996C8DA4C6C30A8DCA5FE78393FE1
Paper / Subject Code: 87004 / Information Retrieval

(c) Short Answers- Define the following terms: (5M)


i) Edit distance
ii) Boolean retrieval model
iii) Cloaking
iv) Spam
v) Crawler

Q. 2 Attempt the following (Any THREE)(Each of 5Marks) (15M)


(a) Brief overview of Information retrieval.
(b) What are the components of Information retrieval? Explain with diagram.
(c) Brief the history of Information retrieval.
(d) List the forms of spelling correction in Information retrieval. Explain.
(e) Explain the architecture of open source engine framework.
(f) Draw the inverted index that would be built for the following document collection.
Doc 1 one fish, two fish
Doc 2 red fish, blue fish
Doc 3 one red bird

Q. 3 Attempt the following (Any THREE) (Each of 5Marks) (15M)


(a) Discuss Hubs and Authorities.
(b) Explain the concept of cosine similarity with example.
(c) What is Personalized search? State factors affecting it.
(d) Explain the concept of Collaborative filtering.
(e) What is Question answering? Explain.
(f) Give the meaning of cross lingual retrieval. Analyse its process.

Q. 4 Attempt the following (Any THREE) (Each of 5Marks) (15M)


(a) Explain the terms: Web, Web pages, Web graph with example.
(b) Discuss categories of user needs in web queries for query analysis.
(c) What are the basic building blocks of Search Engine Architecture? Explain.
(d) Give the challenges in XML retrieval.
(e) Write a note on Web Size Measurement.
(f) Write a note on sponsored search.

Q. 5 Attempt the following (Any THREE) (Each of 5Marks) (15M)


(a) Compute the Levenshtein edit distance between “GUMBO” and “GAMBOL”.
(b) Give the concept of wild card queries in IR.
(c) Define Page rank. How to compute page rank for a webpage? Give example.
(d) What is MapReduce? Explain its paradigm.
(e) Differentiate between Text Centric v/s Data Centric XML.

********************

75839 Page 2 of 2

C6B996C8DA4C6C30A8DCA5FE78393FE1

You might also like