IRS Objective
IRS Objective
a. Recall
b. Precision ✓
c. Ranking
d. Zoning
a. IR
b. DBMS ✓
c. Both
a. IR ✓
b. DBMS
c. Both
4. Mapping between users’ specified need and the items in the IR systems is done by
_________ Capability
a. Search ✓
b. Browse
c. Miscellaneous
5. User can retrieve the Information as per the needs in IRS using _________ Capabilities.
a. Search
b. Browse
c. Miscellaneous
d. Both a & b ✓
7. The Retrieved Results done by the user, the IRS will produce the results in ____no. of
ways.
a. 2
b. 3
c. 4 ✓
d. 5
a. 0-1 ✓
b. 1-1
c. 1-0
d. Any Range
a. 0-1
b. 1-1
c. 1-0 ✓
d. Any Range
a. Item Normalization ✓
a. Ranking
b. Zoning ✓
c. Classification
d. None
12. The capability to create private and public index files is frequently implemented via a
c. Digital Library
d. Data Warehouse
13. In the__________ process, the user can logically store an item in a file along with
additional index terms and descriptive text.
a. Zoning
b. Index ✓
c. Classification
d. Ranking
14. _________ Algorithm Save system resources by eliminating from the set of searchable
processing tokens those have little value to the search
a. Stemming
c. Stop Algorithm ✓
d. Characterize Tokens
15. _______Provides the capability to dynamically compare newly received items in the
information system against standing statements of interest of users and deliver the item to
those users whose statement of interest matches the contents of the items.
a. Item Normalization
16. To assist the users in generating indexes, the system provides a process called ________
a. Item Normalization
17. ________ will restrict the distance allowed within an item between two search terms.
a. Boolean Operators
b. Proximity ✓
d. Term Masking
18. “United States of America” is an Example for
a. Boolean Operators
b. Proximity
d. Term Masking
20. The capability to name a query and store it to be retrieved and executed during a later
user session is called ________
b. Proximity
21. Rather than typing in a complete new query, the results of the previous search can be used
as a constraining list to create a new query that is applied against it is called _________
b. Proximity
c. Iterative Search ✓
b. words, but does not work for finding ranges of numbers of numeric dates ✓
c. does not work for finding words and ranges of numbers of numeric dates
d. finding ranges of numbers of numeric dates but does not work for words
a. Personal
b. Educational
c. Career
24. ______ is used to mapping between a user’s specified need and the items in the IR
systems that will answer that need.
a. Search Capabilities ✓
b. Browse Capabilities
c. Miscellaneous Capabilities
25. ______ search is a frequently a search returns a Hit file containing many more items than
the user want to review
a. Item Search
b. Data Search
c. Iterative Search ✓
d. All
26. The capability to name a query and store it to be retrieved and executed during a later
user session is called________
a. User Query
b. Ask Query
c. Search Query
d. Canned Query ✓
27. Term masking is useful when applied to words, but does not work for finding ranges of
numbers of numeric dates.
b. Always True ✓
c. Never True
28. _______ Restrict the distance allowed within an item between two search terms.
a. Proximity ✓
b. Term Masking
c. Boolean Logic
d. Fuzzy Searches
a. Recall
b. Precision ✓
c. Both
d. None
a. Recall ✓
b. Precision
c. Both
d. None
31. ________ the user quickly focuses on the potentially relevant parts of the text to scan for
item relevance.
a. Highlighting ✓
b. Zoning
c. Ranking
d. All
a. IRS
b. Indexing ✓
c. DBMS
d. Data Mining
34. The full text searchable data structures for items in the Document File provides a new
class of indexing called _____
c. document indexing
d. None
35. _____ define what level of detail the subject index will contain.
36. _______ is the extent to which the different concepts in the item are indexed.
a. Scope
b. Specificity
c. Exhaustivity ✓
d. All
37. ________ is the preciseness of the index terms used in indexing.
a. Scope
b. Specificity ✓
c. Exhaustivity
d. All
a. Precision
b. Recall
c. Both ✓
d. None
39. ______ is an attempt to place a value on the index term’s representation of its associated
concept in the document.
c. Both
d. None
40. The process of creating term linkages at index creation time is called ______.
a. Post Coordination
b. Pre-coordination ✓
c. Specificity
d. Exhaustivity
41. ______ provides searchers with ways of finding morphological variants of search terms.
a. Pre-coordination
b. STOP Algorithm
c. Stemming ✓
d. All
a. Precision
b. Recall ✓
c. Both
d. None
43. _______ determines a canonical set of concepts based on a test set of terms and uses them
as a basis for indexing all items.
a. Indexing by Term
b. Indexing By Concept ✓
c. Multimedia Indexing
d. All
44. ______ is coordinating terms at search time by ANDing index terms together, which only
finds indexes that have all of the search terms.
a. Post Coordination ✓
b. Pre-coordination
c. Specificity
d. Exhaustivity
a. True ✓
b. False
c. Not Determined
d. Not Related
a. Any
b. X ✓
c. All
d. None
a. CONVECTICS
b. INFORMIX
c. INQUERY ✓
d. All
48. K-Stem Algorithm uses ______ no. of data files to control and limit the stemming
process.
a. 7
b. 5
c. 4
d. 6 ✓
49. ______ is the most Common Data Structure used in both database and IRS.
c. PAT DS
d. All
50. _______ will provide the optimum performance in searching large databases.
a. Inversion List ✓
b. N-Grams
c. Sistring
d. Signature
51. _______ data Structure is used to retrieve the information for continuous text.
a. N-Gram
b. PAT
c. Signature
d. Both a & b ✓
a. Sstring
b. Sistring ✓
c. Substring
d. Sting-sub
a. True
b. False ✓
c. Not Determined
d. Not Related
54. _______ DS eliminates the majority of items that are not related to a query.
a. Inverted File
b. N-Gram
c. PAT DS
d. Signature ✓
55. Indexing is the oldest technique for identifying the contents of an item to assist in their
retrieval.
56. Statistical techniques Calculation of weights use statistic information such as the
frequency of occurrence of words and their distributions in the searchable DB.
58. Automatic indexing is the capability to automatically determine the index terms to be
assigned to an item.
61. Under-stemming prevents related terms from being conflated, and relevant documents
will not be retrieved.
63. Total document indexing (uncontrolled vocabulary) is fast indexing but a difficult
search process.
64. The process of linking index terms together in a single index for a particular concept is
called Pre-Coordination and Linkages.
65. Inverted File Structure is the most common data structure used in both database and
IRS.
UNIT – 3
66. _______ is the process of analyzing an item to extract the information to be kept
permanently in an index.
a. Class Indexing
b. Automatic Indexing ✅
c. Manual Indexing
d. Any
a. Statistical ✅
b. Natural Language
c. Concept
d. Hypertext Linkages
68. _______ indexing stores the information that is used in calculating a probability that a
particular item satisfies a particular query.
a. Probabilistic ✅
b. Bayesian
c. Vector Space
d. Neural Net
69. _______ approaches store information used in generating a relative confidence level of an
item's relevance to a query.
a. Bayesian
b. Vector Space
c. Both ✅
d. None
70. _______ are dynamic learning structures that are discussed under concept indexing,
where they are used to determine concept classes.
a. Probabilistic
b. Bayesian
c. Vector Space
d. Neural Net ✅
71. _______ indexing uses words within an item to correlate to concepts discussed in the
item.
a. Statistical
b. Natural Language
c. Concept ✅
d. Hypertext Linkages
72. _______ approach is based upon the direct application of the theory of probability to IRS.
a. Probabilistic ✅
b. Natural Language
c. Concept
d. Hypertext Linkages
73. _______ produces efficient results when data is retrieved from multiple databases.
a. Probabilistic ✅
b. Natural Language
c. Concept
d. Hypertext Linkages
b. Natural Language ✅
c. Concept
d. Hypertext Linkages
75. Tagged Text Parser structure allows for identification of potential term phrases based
upon _______ identification.
a. Verb
b. Noun ✅
c. Adjective
d. All
a. Probabilistic
b. Natural Language ✅
c. Concept
d. Hypertext Linkages
77. _______ system attempts to introduce a higher level of abstraction indexing on top of the
statistical processes.
a. Probabilistic
b. Natural Language ✅
c. Concept
d. Hypertext Linkages
a. Probabilistic
b. Natural Language
c. Concept ✅
d. Hypertext Linkages
a. Binary
b. Vector ✅
c. Both
d. None
a. Automatically generated ✅
b. Manually generated
c. Crawlers
d. All
81. In _______, users define search terms, and it goes to various sites searching for the
desired information.
a. Automatically generated
b. Manually generated
c. Crawlers ✅
d. All
a. WebCrawler’s
b. Open Text
c. Path Finder
d. All ✅
83. Term Frequency TFij is the frequency of occurrence of a term Ti in a document Dj.
84. Total Term Frequency TTFi is the frequency of occurrence of a term Ti in the entire
collection.
85. Document Frequency (DFi) is the number of unique documents in the collection that
contain a term Ti.
86. Tagged Text Parser structure allows for identification of potential term phrases based
upon Noun identification.
87. Automatic Indexing is the process of analyzing an item to extract the information to be
kept permanently in an index.
88. Manually generated (e.g., Yahoo!) pages are indexed manually into a linked hierarchy (an
“index”). Users browse in the hierarchy by following links.
89. Automatically generated (e.g., Alta Vista) pages at each Internet site are indexed
automatically (creating a “searchable data structure”).
90. Automatically generated structures are used for querying, rather than browsing.
92. Crawlers (e.g., WebCrawler) allow users to define search terms, and the crawler goes to
various sites searching for the desired information.
93. Hypertext Linkages provide virtual threads of concepts between items versus directly
defining the concepts within an item.