0% found this document useful (0 votes)
13 views9 pages

IRS UNITS-1,2,3 Objective Type Questions

The document contains a series of objective type questions and fill-in-the-blank exercises related to Information Retrieval Systems (IRS) and indexing techniques. It covers various concepts such as precision, recall, data structures, and algorithms used in information retrieval. The questions are organized into different units, each focusing on specific aspects of IRS and indexing methodologies.

Uploaded by

fastestbhau18
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
13 views9 pages

IRS UNITS-1,2,3 Objective Type Questions

The document contains a series of objective type questions and fill-in-the-blank exercises related to Information Retrieval Systems (IRS) and indexing techniques. It covers various concepts such as precision, recall, data structures, and algorithms used in information retrieval. The questions are organized into different units, each focusing on specific aspects of IRS and indexing methodologies.

Uploaded by

fastestbhau18
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 9

OBJECTIVE TYPE QUESTIONS

UNIT-1
1. The ability to retrieve top-ranked documents that are mostly relevant
a. Recall b. Precision c. Ranking d. Zoning
Ans: b
2. ______ will follows the Structural Data format
a. IR b. DBMS c. Both d. None of the Above
Ans: b
3. ______ will follows the Unstructured Data format
a. IR b. DBMS c. Both d. None of the Above
Ans: a
4. Mapping between users’ specified need and the items in the IR systems is
done by _________ Capability
a. Search b. Browse c. Miscellaneous d. All of the Above
Ans:a
5. User can retrieve the Information as per the needs in IRS using _________
Capabilities.
a. Search b. Browse c. Miscellaneous d. All of the Above
Ans:d
6. The general objective of an IR system is
a. to minimize the overhead of a user
b. locating needed information
c. to maximize the overhead of a user
d. Both a & b
Ans: d
7. The Retrieved Results done by the user, the IRS will produce the results in
____no. of ways.
a. 2 b. 3 c. 4 d. 5
Ans: c
8. Precision Ranges from________
a. 0-1 b. 1-1 c. 1-0 d. Any Range
Ans: a
9. Recall Ranges from _______
a. 0-1 b. 1-1 c. 1-0 d. Any Range
Ans: c
10. Identification of processing tokens in IRS is in ______ level
a. Item Normalization
b. Selective Dissemination of Information
c. Archival Document Database Search
d. Index Database Search + Automatic File Build Process
Ans: a
11. Logical restructuring of items is ___________
a. Ranking b. Zoning c. Classification d. None
Ans: b
12. The capability to create private and public index files is frequently
implemented via a
a. structured Database Management System
b. Information Retrieval Systems
c. Digital Library
d. Data Warehouse
Ans: a
13. In the__________ process, the user can logically store an item in a file along
with additional index terms and descriptive text.
a. Zoning
b. Index
c. Classification
d. Ranking
Ans : b
14. _________ Algorithm Save system resources by eliminating from the set of
searchable processing tokens those have little value to the search
a. Stemming
b. Identify Processing Tokens
c. Stop Algorithm
d. Characterize Tokens
Ans:c
15. _______Provides the capability to dynamically compare newly received items
in the information system against standing statements of interest of users and
deliver the item to those users whose statement of interest matches the
contents of the items.
a. Item Normalization
b. Selective Dissemination of Information
c. Archival Document Database Search
d. Index Database Search + Automatic File Build Process
Ans: b
16. To assist the users in generating indexes, the system provides a process called
________
a. Item Normalization
b. Selective Dissemination of Information
c. Automatic File Build
d. None of the Above
Ans: c
17. ________ will restrict the distance allowed within an item between two search
terms.
a. Boolean Operators
b. Proximity
c. Contiguous Word Phrase
d. Term Masking
Ans:b
18. “United States of America” is an Example for
a. Boolean Operators
b. Proximity
c. Contiguous Word Phrase
d. Term Masking
Ans: c
19. Natural Language Queries is
a. Decrease recall/ Improve precision
b. Improve recall/ Improve precision
c. Decrease recall/Decrease precision
d. Improve recall/Decrease precision
Ans: d
20. The capability to name a query and store it to be retrieved and executed during
a later user session is called ________
a. Natural Language Queries
b. Proximity
c. Contiguous Word Phrase
d. Canned or Stored queries
Ans : d
21. Rather than typing in a complete new query, the results of the previous search
can be used as a constraining list to create a new query that is applied against
it is called _________
a. Canned or Stored queries
b. Proximity
c. Iterative Search
d. Canned or Stored queries
Ans: c
22. Term masking is useful when applied to_________
a. words and work for finding ranges of numbers of numeric dates
b. words, but does not work for finding ranges of numbers of numeric dates
c. does not work for finding words and ranges of numbers of numeric dates
d. finding ranges of numbers of numeric dates but does not work for words
Ans : b
23. Applications of information retrieval are
a. Personal b. Educational c. Career d. All of the above
Ans :d
24. ______ is used to mapping between a user’s specified need and the items in the IR
systems that will answer that need.
a. Search Capabilities b. Browse Capabilities
c. Miscellaneous Capabilities d. All of the Above
Ans: a
25.______ search is a frequently a search returns a Hit file containing many more
items than the user want to review
a. Item Search b. Data Search c. Iterative Search D. All
Ans: c
26. The capability to name a query and store it to be retrieved and executed during a
later user session is called________
a. User Query b. Ask Query c. Search Query d. Canned Query
Ans : d
27. Term masking is useful when applied to words, but does not work for finding
ranges of numbers of numeric dates.
a. Some times May be b. Always True
c. Never True d. None of the Above
Ans: b
28. _______ Restrict the distance allowed within an item between two search terms.
a. Proximity b. Term Masking c. Boolean Logic d. Fuzzy Searches
Ans: a
29. Boolean Operators in Search will improve_________
a. Recall b. Precision c. Both d. None
Ans: b
30. Fuzzy Searches will improve______
a. Recall b. Precision c. Both d. None
Ans: a
31. ________ the user quickly focuses on the potentially relevant parts of the text to
scan for item relevance.
a. Highlighting b. Zoning c. Ranking d. All
Ans: a
32. WAIS is Wide Area Information servers.

OBJECTIVE TYPE QUESTIONS


UNIT-2

1. __________Represent the concepts within an item to facilitate the user are finding
relevant information.
a. IRS b. Indexing c. DBMS d. Data Mining

Ans: b

2. The full text searchable data structures for items in the Document File provides a new
Class of indexing called _____
a. full document indexing b. total document indexing c. document indexing d.
None Ans:b

3._____define what level of detail the subject index will contain.

a. Scope of the indexing b. Scope of the Data Base


c. Scope of the Information d. Scope of the Text
Ans : a
4. _______ is the extent to which the different concepts in the item are indexed.
a. Scope b. Specificity c. Exhaustivity d. All
Ans: c
5.________ is the preciseness of the index terms used in indexing
a. Scope b. Specificity c. Exhaustivity d. All
Ans: b
6. Low exhaustivity has an adverse effect on ____________
a. Precision b. Recall c. Both d. None
Ans: c
7. ______ is an attempt is made to place a value on the index term’s representation of its
associated concept in the document
a. Weighted Automatic Indexing b. Un weighted Automatic Indexing
c. Both d. None
Ans: a
8. The process of creating term linkages at index creating time is called
a. Post Coordination b. pre-coordination c. Specificity d. Exhaustivity
Ans: b
9. ______ provides searchers with ways of finding morphological variants of search
terms.
a. pre-coordination b. STOP Algorithm c. Stemming d. All
Ans: c
10. Stemming Improves________
a. Precision b. Recall c. Both d. None
Ans: b
11. _______ determines a canonical set of concepts based on a test set of terms and uses
them as a basis for indexing all items.
a. Indexing by Term b. Indexing By Concept c. Multimedia Indexing d. All
Ans:b
12. ____ is coordinating terms at search time by ANDing index terms together, which only
finds indexes that have all of the search term.
a. Post Coordination b. pre-coordination c. Specificity d. Exhaustivity
Ans: a
13. Stemming Causes the problem for Natural Language Processing.
a. True b. False c. Not Determined d. Not Related
Ans: a
14. *<X> Stem ends with letter________
a. Any b. X c. All d. None
Ans: b
15. _______ System uses the stemming Technique K-Stem.
a. CONVECTICS b. INFORMIX c. INQUERY d. All
Ans : c
16. K-Stem Algorithm uses ____ no. of data files to control and limit the stemming process.
a. 7 b. 5 c. 4 d. 6
Ans: d
17. ______ is the most Common Data Structure used in both database and IRS.
a. Inverted File Structure b. N-Gram Data Structure c. PAT DS d. All
Ans: a
18._______ will provide the optimum performance in searching large databases.
a. Inversion List d. N-Grams c. sistring d. signature
Ans: a
19._______ data Structure are used to retrieve the information for continuous text.
a. N-Gram b. PAT c. signature d. Both a&b
Ans: d
20. The substring in a PAT DS is ______
a. sstring b.sistring c.substring d.sting-sub
Ans: b
21. Fuzzy Searches are easy to implement using PAT DS.
a. True b. False c. Not Determined d. Not Related

Ans: b
22. _______ DS eliminates the majority of items that are not related to a query.
a. Inverted File b. N-Gram c. PAT DS d. Signature
Ans: d

Fill in the Blanks


1.Indexing is the oldest technique for identifying the contents of item to assist in their
retrieval.

2.Statistical techniques Calculation of weights use statistic information such as the


frequency of Occurrence of words and their distributions in the searchable DB

3.Stemming can be used to reduce the size of index files.

4.Automatic indexing is the capability to automatically determine the index terms to be


assigned to an Item.

5. sistring is semi-infinite string

6.Over-stemming leads to the conflated retrieval of non-relevant documents

7.Under-stemming Prevent related terms from being conflated relevant documents will not
be retrieved

8.Specificity is the preciseness of the index terms used in indexing

9.Total document indexing (uncontrolled vocabulary) is Fast indexing but difficult search
process.

10.The Process of link index terms together in a single index for a particular concept is called
Pre-Coordination and Linkages

11.Inverted File Structure is the most Common Data Structure used in both database and
IRS.
OBJECTIVE TYPE BITS
UNIT-3

1. is the process of analyzing an item to extract the information to be


keptpermanently in an index.
a. Class Indexing b. Automatic Indexing c. Manual Indexing d.
AnyAns: b
2. is used mostly in commercial systems.
a. Statistical b. Natural Language c. Concept d. Hypertext
LinkagesAns: a
3. indexing stores the information that are used in calculating a probability
thata particular item satisfies a particular query.

a. Probalistic b. Bayesian c .vector space d. neural net. Ans: a

4. approaches store information used in generating a relative confedence


level ofan item relevance to a query.
a. Bayesian b. vector space c. both d.
noneAns: c
5. are dynamic learning structures that are discussed under concept indexing
wherethey are used to determine concept classes.
a. Probalistic b. Bayesian c .vector space d. neural net. Ans: d
6. indexing uses words with in an item to correlate to concepts discussed in the
item.
a. Statistical b. Natural Language c. Concept d. Hypertext
LinkagesAns: c
7. approach ia based upon direct application of the theory of probability to IRS.
a. Probabilistic b. Natural Language c. Concept d. Hypertext
LinkagesAns: a
8. produces the efficient results when data is retrieving from multiple databases.
a. Probabilistic b. Natural Language c. Concept d. Hypertext
LinkagesAns: a
9. processing is used to semantic information in addition to statistical
informationto enhance the indexing of the item.
a. Probabilistic b. Natural Language c. Concept d. Hypertext
LinkagesAns: b
10. Tagged Text Parser structure allows for identification of potential term phrases
basedupon identification.
a. verb b. noun c. adjective d.
allAns: b
11. processing will use DR-LINK System.
a. Probabilistic b. Natural Language c. Concept d. Hypertext
LinkagesAns: b
12. system attempts to introduce a higher level of abstraction indexing on top
of thestatistical processes.
a. Probabilistic b. Natural Language c. Concept d. Hypertext
LinkagesAns: b
13. indexing is a statistical technique whose goal is to determine a
canonicalrepresentation of concept.
a. Probabilistic b. Natural Language c. Concept d. Hypertext
LinkagesAns:c
14. techniques have very powerful representation.
a. Binary b. Vector c. Both c.
NoneAns : b
15. pages at each Internet site are indexed automatically.
a. Automatically generated b. manually generated c. Crawlers d.
AllAns: a
16. In users define search terms, and it goes to various sites searching for
thedesired information.
a. Automatically generated b. manually generated c. Crawlers d.
AllAns: c
17. is the example for WebCrawler’s
a. WebCrawler’s b. Open Text c. Path Finder d.
AllAns: d

Fill in the blanks

1. Term Frequency TFij is the frequency of occurrence of a term Ti in


adocument Dj.
2. Total Term Frequency TTFi is the frequency of occurrence of a term Ti in
the entire collection.
3. Document Frequency DFi is the number of unique documents in
thecollection that contain a term Ti..
4. Tagged Text Parser structure allows for identification of potential term
phrasesbased upon Noun identification.
5. Automatic Indexing is the process of analyzing an item to extract
theinformation to be kept permanently in an index.
6. Manually generated (e.g. Yahoo!) pages are indexed manually into a
linkedhierarchy(an “index”). Users browse in the hierarchy by following
links.
7. Automatically generated (e.g. Alta Vista) pages at each Internet site
areindexed automatically (creating a “searchable data structure”).
8. Automatically generated structures are used for querying, rather
thanbrowsing.
9. Crawlers (e.g. WebCrawler) No a priori indexing.
10. Crawlers (e.g. WebCrawler) Users define search terms, and the crawler
goesto various sites searching for the desired information.
11. Hypertext Linkages Provides virtual threads of concepts between
itemsversus directly defining the concepts with in an item.
12. The SMART system uses Vector Model.

You might also like