0% found this document useful (0 votes)

26 views13 pages

Information Retrieval Systmem: Assignment Qa

Uploaded by

manvithamaha95

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views13 pages

Information Retrieval Systmem: Assignment Qa

Uploaded by

manvithamaha95

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

INFORMATION RETRIEVAL SYSTMEM

ASSIGNMENT QA

IRS ASSIGNMENT QUESTIONS FOR MID-II

1. Given the following Term-Term matrix:

a. Determine the Term Relationship matrix using a threshold of 10 or higher

b. Determine the clusters using the clique technique

c. Determine the clusters using the single link technique

d. Determine the clusters using the string technique

e. Determine the clusters using the star technique

2.Explain about Fast Data Finder architecture?

"Fast Data Finder Architecture" is an efficient framework designed to enhance the retrieval speed and
accuracy of data from large datasets or databases. Below is an outline of its key components and
functionalities:

Key Components of Fast Data Finder Architecture

1. Indexing Module
Function: Converts raw data into an organized index structure for faster access.
Process: Uses inverted indexing or B-trees for quick lookup.
Example: Keywords are linked to document IDs for fast retrieval.
2. Query Processing Engine
Function: Interprets and processes user queries to retrieve relevant information.
Process: Breaks down queries into tokens, normalizes them, and matches them against the index.
Optimization: Implements techniques like query expansion, stop-word removal, and stemming.
3. Ranking and Scoring Module
Function: Orders the retrieved results based on relevance to the query.
Methods:
Uses TF-IDF (Term Frequency-Inverse Document Frequency) or BM25 scoring.
Includes user-centric relevance feedback for adaptive ranking.
4. Data Storage Optimization
Function: Structures data in storage to ensure quick access and retrieval.
Techniques:
Employs partitioning and sharding for distributed data management.
Caches frequently accessed data for low-latency retrieval.
5. Scalability and Parallel Processing
Function: Handles large-scale data retrieval requests efficiently.
Approach:
Uses distributed computing frameworks like Hadoop or Spark for scalability.
Integrates with NoSQL databases for handling unstructured or semi-structured data.
6. User Interface and Interaction Layer
Function: Provides intuitive search and navigation capabilities for users.
Features:
Auto-completion, query suggestions, and spell-check.
Visual representation of results like clusters or tag clouds.

Advantages of Fast Data Finder Architecture in Information Retrieval

Speed: Reduces query processing time through pre-computed indices and caching mechanisms.
Accuracy: Improves result relevance with sophisticated ranking and scoring algorithms.
Scalability: Manages exponential data growth using distributed and parallel processing.
User-Friendliness: Enhances user experience with intelligent query interpretation and result
presentation.

This architecture emphasizes the retrieval of relevant data rather than employing machine learning
methods, focusing on established indexing, query processing, and ranking techniques for optimal
performance.

3.a) Explain the advantages and disadvantages of spoken language audio retrieval.

Advantages and Disadvantages of Spoken Audio Retrieval

Category Advantages Disadvantages

User Interaction Natural and intuitive, allowing voice- May struggle with accents, unclear
based queries instead of typing. pronunciation, or homonyms,
leading to recognition errors.

Accessibility Benefits users with disabilities, such as Limited effectiveness for users
visual impairments. with speech impairments or strong
dialects.

Efficiency Hands-free usage is ideal for Speech processing can introduce

multitasking or environments where latency in real-time applications.
typing isn’t feasible.

Contextual Advanced systems can interpret tone, Limited ability to understand

Understanding emotion, and context for better results. sarcasm, humor, or nuanced
context.

Multimedia Support Enables indexing and searching within High computational and storage
podcasts, lectures, and other audio demands for processing and
content. indexing spoken data.

Multilingual Supports multiple languages if paired Struggles with code-switching or

Capabilities with robust speech recognition. mixed-language inputs.

Real-Time Allows real-time query processing for Real-time processing may require
Applications live streams or broadcasts. significant computational power,
leading to delays.

Privacy Facilitates voice-based searches without Raises privacy concerns, especially

revealing visual information. if sensitive conversations are
inadvertently recorded.

Storage Efficiently organizes audio data for Requires significant storage for
retrieval. audio files, metadata, and
transcriptions.

Audio Quality High-quality results are achievable with Poor audio quality (background
Dependency clear audio. noise, low bitrate) reduces retrieval
accuracy.
b) Explain multimedia information retrieval system.

Multimedia Information Retrieval (MIR)

1. Definition
MIR involves searching, indexing, and retrieving information from multimedia content such as
text, images, audio, video, and graphics.
2. Key Components
Feature Extraction: Converts raw data into structured representations (e.g., visual features in
images, spectral features in audio).
Indexing: Organizes multimedia data for efficient storage and retrieval.
Query Processing: Interprets user queries across different modalities (text, image, audio, etc.).
Similarity Matching: Measures relevance using metrics like cosine similarity or cross-modal
matching.
Ranking and Scoring: Orders results by relevance using methods like TF-IDF or deep learning
models.
Metadata Integration: Enhances retrieval accuracy by leveraging metadata (tags, timestamps, etc.).
Relevance Feedback: Incorporates user feedback to improve retrieval accuracy.
3. Techniques
Text: NLP, TF-IDF, semantic analysis.
Images: Feature descriptors (SIFT, SURF), CNNs for recognition/classification.
Audio: Spectral analysis, MFCCs, automatic speech recognition (ASR).
Video: Keyframe extraction, motion detection, scene analysis, video summarization.
4. Applications
Healthcare: Retrieval of medical images or videos based on symptoms or features.
Entertainment: Content-based music or video recommendations.
Education: Indexing and retrieving recorded lectures or e-learning materials.
E-commerce: Visual search for products using images or videos.
Surveillance: Retrieving relevant video feeds or images from security systems.
5. Advantages
Multimodal search across diverse content.
Intuitive search capabilities like visual or voice-based queries.
Efficient handling of large-scale multimedia repositories.
6. Challenges
Feature Representation: Extracting meaningful features from varied multimedia data.
Semantic Gap: Bridging low-level features (e.g., pixels) and high-level concepts (e.g., objects).
Storage and Scalability: Managing the large volume of multimedia data efficiently.
Cross-Modal Retrieval: Effective retrieval across different modalities (e.g., image-to-text).
User Intent: Interpreting ambiguous or vague multimedia queries.

4.Given the following set of retrieved documents with relevance judgments

Calculate a new query using a factor of 1/2 for positive feedback and 1/4 for negative feedback and
determine which documents would be retrieved by the original and by the new query?

5.Demonstrate Boyer-Moore algorithm with example, explain each step.

The Boyer-Moore algorithm is a highly efficient string-searching algorithm that finds occurrences of a
pattern (substring) within a text. It is particularly well-suited for large texts, leveraging the characteristics
of the pattern to skip sections of the text, minimizing unnecessary comparisons.

6.Compare and contrast Jaccard measure with Dice measure similarity

The similarity measures is that the goal is to retrieve documents or items that are most relevant to a user's
query.

Jaccard and Dice similarity measures are commonly applied to quantify the similarity between a query
and documents in the database.

1. Formula Structure

Jaccard Similarity:

Dice Similarity:
The Dice similarity introduces a factor of 2 in the numerator and uses the sum of the cardinalities of
the two sets in the denominator.

2. Normalization and Sensitivity to Common Terms

Jaccard:
The normalization in Jaccard is heavily influenced by the union of the sets. This makes Jaccard
sensitive to differences in the overall size of the sets. As the number of common elements increases,
the similarity value can decrease quickly when the union is large.
Dice:
The Dice measure simplifies the denominator and normalizes based on the average size of the two
sets (as the sum of their sizes is divided by 2). It is less sensitive to variations in set size compared to
Jaccard and emphasizes the overlap more strongly.

3. Range of Values

Jaccard:
The similarity value is always between 0 and 1, where 1 indicates perfect similarity, and 0 indicates no
similarity. There is no possibility of negative values.
Dice:
Like Jaccard, the Dice similarity also ranges between 0 and 1, with similar interpretations of these
limits.

4. Behavior with Sparse Data

Jaccard:
When the sets are large but share only a few elements, the similarity value can become very small
because the union dominates the denominator.
Dice:
The Dice measure tends to give higher similarity scores in cases of sparse data due to its emphasis on
the intersection size relative to the total size of the sets.

5. Use Cases

Jaccard:
Jaccard is often used in cases where the size of the union is significant, such as comparing document
sets or clustering algorithms that require strict discrimination based on overlap and total coverage.
Dice:
The Dice coefficient is commonly used in applications where emphasizing commonalities is more
critical, such as in medical image analysis or text similarity where the overlap matters more than the
total set size.

7.Demonstrate Knuth-Morris-Pratt algorithm with example, explain each step.

The Knuth-Morris-Pratt (KMP) algorithm is an efficient method for finding a substring (also known as a
pattern) within a string (also known as the text).

It improves the brute-force search method by skipping over portions of the text that have already been
matched, avoiding unnecessary comparisons.

8. Explain about weighted searches for Boolean systems?

1. Boolean queries use strict operators (AND, OR, NOT), while weighted systems assign importance to
terms.
2. Strict Boolean operators with weighted systems can lead to suboptimal results.
3. Fox and Sharat proposed a fuzzy set approach, adding degrees of membership for terms.
4. The MMM model combines minimum and maximum weights to refine retrieval.
5. Paice expanded MMM, considering all term weights for AND and OR queries.
6. The P-norm model treats query terms as coordinates, adjusting operator strictness with a parameter.
7. Salton suggested refining Boolean results with term weights ranging from 0.0 to 1.0.
8. As term weights change, results gradually include more or fewer matching items.
9. The algorithm involves initial Boolean operations, adjusting results based on similarity.
10. Venn diagrams visualize changes in result sets as term weights adjust.
The normal Boolean operations produce the following results:

“A OR B” retrieves those items that contain the term A or the term B or both

“A AND B” retrieves those items that contain both terms A and B

“A NOT B” retrieves those items that contain term A and not contain term B.

If weights are then assigned to the terms between the values 0.0 to 1.0, they may be interpreted as the
significance that users are placing on each term.

The value 1.0 is assumed to be the strict interpretation of a Boolean query.

The value 0.0 is interpreted to mean that the user places little value on the term.

Under these assumptions, a term assigned a value of 0.0 should have no effect on the retrieved set.

Thus should return the set of items that contain A as a term will also return the set of items that contain
term A also return set A.

9. a) Explain about image retrieval.

The increasing volume of imagery has made effective access and retrieval critical, emphasizing both
metadata and visual content-based indexing.

1. Early work focused on automatically indexing visual features such as color, texture, and shape for
retrieving similar images.
2. The ultimate goal is to enable semantic-based access to imagery, going beyond manual indexing.
3. QBIC (Query By Image Content) allows search based on visual attributes like color, shape, texture,
and sketches, replacing traditional keyword searches.
4. QBIC can retrieve images based on specific attributes, such as searching for "red stamps" or stamps
related to a president.
5. Users can also combine queries like "red round object with green square" to refine results.
6. Automatic and semi-automatic tools were developed to assist with object identification and database
population.
7. Content-based video retrieval techniques have been applied, such as shot detection and representative
frame extraction for video searches.
8. Face processing research distinguishes between face detection, recognition, and retrieval, allowing for
precise identification in various contexts.
9. Real-world applications like the US Immigration Service use face recognition for verifying fast lane
drivers at the border.
10. Face recognition systems track human movement and expressions, contributing to emotional
recognition for human-computer interaction.
11. Video retrieval systems, like Informedia, use face recognition to allow users to search for or identify
faces in video content.

b) Explain the content-based video retrieval system.

Content-based video retrieval focuses on searching and accessing video data based on the content within
the video itself, rather than relying on manual annotations or keywords. Here are the key points related to
this approach:

1. Video Mail, Surveillance, and TV Access: Content-based retrieval can be applied to various domains,
such as video mail, surveillance systems, and broadcast television.
2. Broadcast News Navigator (BNN): BNN is a system that automates the process of capturing,
annotating, segmenting, summarizing, and visualizing broadcast news video to facilitate content-
based search and retrieval.
3. Multistream Analysis: BNN integrates text, speech, and image processing to analyze video content,
enabling search based on a variety of media streams.
4. Search Capabilities: Users can search by text keywords, speech transcriptions, or named entities (e.g.,
people, places) within video content.
5. Query Refinement: Users can refine their search using filters like date ranges or specific named entities
(e.g., searching for news related to "George Bush" and "New York").
6. Story Skims: BNN generates a “story skim,” which presents a keyframe along with the most frequent
named entities in a news story, making it easier for users to locate relevant video content.
7. Time-Interval Browsing: Users can browse news stories within specific time intervals or from
particular sources, further enhancing content navigation.
8. Named Entity Analysis: BNN allows users to mine correlations between named entities, improving
the precision of content retrieval.
9. Improved Performance: BNN’s automated video segmentation (based on visual, speaker, or topic
changes) allows users to find video content much faster than with traditional search methods.
10. Topic Detection and Tracking: Systems like TDT (Topic Detection and Tracking) aim to identify
topics, segment stories, and track the occurrence of topics over time in video content.
11. GeoNODE: GeoNODE is another advanced system for analyzing broadcast video and news in a
geospatial and temporal context, helping users access relevant information by location or time.
12. Geospatial Visualization: GeoNODE can map the frequency of mentions across different
geographical locations, visualizing news coverage based on location.
13. High Accuracy: GeoNODE has been shown to accurately identify topics and detect stories, achieving
results comparable to other advanced retrieval initiatives.
14. Future Potential: Content-based video retrieval systems will continue to evolve, relying on machine
learning, multimedia corpora, and evaluation strategies to improve performance and extraction
methods.

10. Briefly explain about Relevance feedback?

Thesuari and semantic networks provide utility in generally expanding a user’s search statement to
include potential related search terms.
But this still does not correlate to the vocabulary used by the authors that contributes to a
particular database.
There is also a significant risk that the thesaurus does not include the latest jargon being used,
acronyms or proper nouns.
In an interactive system, users can manually modify an inefficient query or have the system
automatically expand the query via a thesaurus.
The user can also use relevant items that have been found by the system (irrespective of their
ranking) to improve future searches, which is the basis behind relevance feedback.
Relevant items (or portions of relevant items) are used to reweight the existing query terms and
possibly expand the user’s search statement with new terms.
The relevance feedback concept was that the new query should be based on the old query modified to
increase the weight of terms in relevant items and decrease the weight of terms that are in non-
relevant items.
This technique not only modified the terms in the original query but also allowed expansion of new
terms from the relevant items.
The formula used is:

Positive feedback is weighted significantly greater than negative feedback. Many timesonly positive
feedback is used in a relevance feedback environment.
Positive feedback is more likely to move a query closer to a user’s information needs. Negative feedback may
help, but in some cases it actually reduces the effectiveness of a query.
Figure 7.6 gives an example of the impacts of positive and negative feedback. The filled circles
represent non-relevant items; the other circles represent relevant items.
The oval represents the items that are returned from the query. The solid box is logically where the query
is initially.
The hollow box is the query modified by relevance feedback (positive only or negative only in the
Figure).
NOTE: We hereby inform you that all the information provided by us is true. @cse_sydicate
will not be responsible for any issues that may arise.

.@CSE_SYNDICATE

Irs r22 Unit 4 Lecture Notes User Search Techniques Ranking Algorithms
No ratings yet
Irs r22 Unit 4 Lecture Notes User Search Techniques Ranking Algorithms
24 pages
Step-By-Step Example For Practical PCB Design - Power Supply Design Tutorial Section 3-3 - Power Electronics
No ratings yet
Step-By-Step Example For Practical PCB Design - Power Supply Design Tutorial Section 3-3 - Power Electronics
28 pages
Irs Unit - 4
No ratings yet
Irs Unit - 4
29 pages
DP-500 Designing and Implementing Enterprise-Scale Analytics Solutions Using Microsoft Azure and Microsoft Power BI Exam Guide
From Everand
DP-500 Designing and Implementing Enterprise-Scale Analytics Solutions Using Microsoft Azure and Microsoft Power BI Exam Guide
Anand Vemula
No ratings yet
9 A
No ratings yet
9 A
1 page
Industry Essentials: Enterprise Storage: Delivery Type
No ratings yet
Industry Essentials: Enterprise Storage: Delivery Type
1 page
Irs Sem Unit 5
No ratings yet
Irs Sem Unit 5
8 pages
UI Flutter
No ratings yet
UI Flutter
43 pages
Big Mumbai
No ratings yet
Big Mumbai
10 pages
VISI Machining 5axis
No ratings yet
VISI Machining 5axis
2 pages
Irs U-1
No ratings yet
Irs U-1
49 pages
Irs Unit-4 Notes - 241202 - 150037
No ratings yet
Irs Unit-4 Notes - 241202 - 150037
18 pages
Rasa Conversational AI Framework: The Complete Guide for Developers and Engineers
From Everand
Rasa Conversational AI Framework: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Deep Learning For Middle School Students
No ratings yet
Deep Learning For Middle School Students
34 pages
Applied HuggingSound for Speech Recognition: The Complete Guide for Developers and Engineers
From Everand
Applied HuggingSound for Speech Recognition: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
James Instruments - Windsor - HP - Probe - System - Data - Manual
No ratings yet
James Instruments - Windsor - HP - Probe - System - Data - Manual
80 pages
DMV 6
No ratings yet
DMV 6
15 pages
R2 - Data Acquisition From Greenhouses by Using Autonomous Mobile Robot
No ratings yet
R2 - Data Acquisition From Greenhouses by Using Autonomous Mobile Robot
5 pages
Unit V
No ratings yet
Unit V
43 pages
IRS Unit 4 by Krishna
No ratings yet
IRS Unit 4 by Krishna
23 pages
OpenAI Whisper for Developers: The Complete Guide for Developers and Engineers
From Everand
OpenAI Whisper for Developers: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
ISE Information Retrieval Mod-V
No ratings yet
ISE Information Retrieval Mod-V
48 pages
CN Mid-2
No ratings yet
CN Mid-2
11 pages
Greedy Method DAA
No ratings yet
Greedy Method DAA
32 pages
Unit I - Irs
No ratings yet
Unit I - Irs
116 pages
Irs 1
No ratings yet
Irs 1
4 pages
IRS - Notes - I&2 CSE A&B
No ratings yet
IRS - Notes - I&2 CSE A&B
27 pages
Sybase Administration Guid 1 PDF
No ratings yet
Sybase Administration Guid 1 PDF
432 pages
Untitled Presentation
No ratings yet
Untitled Presentation
11 pages
Irs Cie Objective Paper
No ratings yet
Irs Cie Objective Paper
11 pages
Unit-4 1
No ratings yet
Unit-4 1
7 pages
Important Questions
No ratings yet
Important Questions
3 pages
Ir U6
No ratings yet
Ir U6
30 pages
Felcom SSASInfo SVC Manual
No ratings yet
Felcom SSASInfo SVC Manual
56 pages
Unit I - Irs
No ratings yet
Unit I - Irs
85 pages
Git Commands
No ratings yet
Git Commands
7 pages
11 Multimedia Media IR
No ratings yet
11 Multimedia Media IR
19 pages
AECS QP 2024 Dec
No ratings yet
AECS QP 2024 Dec
2 pages
DevOps Record Internal 2
No ratings yet
DevOps Record Internal 2
26 pages
NP Hard and NP Complete Problems DAA
No ratings yet
NP Hard and NP Complete Problems DAA
16 pages
BS Iso 10005-2005 PDF
100% (1)
BS Iso 10005-2005 PDF
32 pages
Irs Saq
No ratings yet
Irs Saq
3 pages
Unit I
No ratings yet
Unit I
65 pages
Most Important Verbs in Writing
No ratings yet
Most Important Verbs in Writing
12 pages
Table No. 5 Research Criteria and Weights
No ratings yet
Table No. 5 Research Criteria and Weights
3 pages
ISE Information Retrieval Mod-V (Uploaded by Snaptricks - In)
No ratings yet
ISE Information Retrieval Mod-V (Uploaded by Snaptricks - In)
48 pages
Rethinking Search: Making Domain Experts Out of Dilettantes
No ratings yet
Rethinking Search: Making Domain Experts Out of Dilettantes
27 pages
IRS Unit 1 Part 2
No ratings yet
IRS Unit 1 Part 2
6 pages
LIBS 894 Assignment Three Classic Models
No ratings yet
LIBS 894 Assignment Three Classic Models
8 pages
Re Wiring Feeder
100% (1)
Re Wiring Feeder
31 pages
Cmrit Isr Notes - Docx New
No ratings yet
Cmrit Isr Notes - Docx New
54 pages
Practical Parquet Engineering: Definitive Reference for Developers and Engineers
From Everand
Practical Parquet Engineering: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Daily English Grammer ACL
No ratings yet
Daily English Grammer ACL
6 pages
Putty - Transfer Files From Linux To Windows Using PSCP or Some Other Tool
No ratings yet
Putty - Transfer Files From Linux To Windows Using PSCP or Some Other Tool
3 pages
Practical Kaldi for Speech Recognition: The Complete Guide for Developers and Engineers
From Everand
Practical Kaldi for Speech Recognition: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Pandas Essentials for Data Analysis: Definitive Reference for Developers and Engineers
From Everand
Pandas Essentials for Data Analysis: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
List of Hewlett-Packard Products
No ratings yet
List of Hewlett-Packard Products
28 pages
Haystack for Natural Language Search and Question Answering: The Complete Guide for Developers and Engineers
From Everand
Haystack for Natural Language Search and Question Answering: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Information Retrieval 1
No ratings yet
Information Retrieval 1
10 pages
Motivation Letter For Undergraduate Scholarship
80% (5)
Motivation Letter For Undergraduate Scholarship
4 pages
Multimedia Information
No ratings yet
Multimedia Information
33 pages
Content-Based Audio Retrieval Using A Generalized Algorithm
No ratings yet
Content-Based Audio Retrieval Using A Generalized Algorithm
13 pages
2019 Aug TechTIPS-JUSTIFIED
No ratings yet
2019 Aug TechTIPS-JUSTIFIED
9 pages
Managing Multimedia and Unstructured Data in the Oracle Database
From Everand
Managing Multimedia and Unstructured Data in the Oracle Database
Marcelle Kratochvil
No ratings yet
Ambarella CV52S Product Brief 15OCT2021
No ratings yet
Ambarella CV52S Product Brief 15OCT2021
2 pages
Caffe Deep Learning Framework Essentials: Definitive Reference for Developers and Engineers
From Everand
Caffe Deep Learning Framework Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Multimedia Retrieval Survey of Methods and Approaches
No ratings yet
Multimedia Retrieval Survey of Methods and Approaches
4 pages
Software Architecture with Python
From Everand
Software Architecture with Python
Anand Balachandran Pillai
3/5 (1)
Conversational AI Development with Rasa: Definitive Reference for Developers and Engineers
From Everand
Conversational AI Development with Rasa: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Thought Works
No ratings yet
Thought Works
6 pages
Content-Based Audio Retrieval Using A Generalized Algorithm
No ratings yet
Content-Based Audio Retrieval Using A Generalized Algorithm
13 pages
Of 280fbpkmhy
No ratings yet
Of 280fbpkmhy
9 pages
Background (1/4) : Slide 1 Slide 3
No ratings yet
Background (1/4) : Slide 1 Slide 3
7 pages
Deep Learning with Fast.ai: Definitive Reference for Developers and Engineers
From Everand
Deep Learning with Fast.ai: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Web Design Proposal
100% (1)
Web Design Proposal
15 pages
4 6filter Banks
No ratings yet
4 6filter Banks
9 pages
Web Information Retrieval
No ratings yet
Web Information Retrieval
10 pages
Multimedia Information Retrieval
No ratings yet
Multimedia Information Retrieval
143 pages
Web Mining UNIT-II Chapter-01 - 02 - 03
No ratings yet
Web Mining UNIT-II Chapter-01 - 02 - 03
19 pages
NEB Letter Authorizing Seismic Tests
No ratings yet
NEB Letter Authorizing Seismic Tests
5 pages
Modern Information Retrieval Amit Singhal
No ratings yet
Modern Information Retrieval Amit Singhal
9 pages
Duplicati Essentials: Definitive Reference for Developers and Engineers
From Everand
Duplicati Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
2008d Sigirforum Murdock
No ratings yet
2008d Sigirforum Murdock
4 pages
ANSYS Stress Linearization
No ratings yet
ANSYS Stress Linearization
15 pages
Introduction To Information Retrieval Systems
No ratings yet
Introduction To Information Retrieval Systems
2 pages
Catalogo Mosfets Rohm
No ratings yet
Catalogo Mosfets Rohm
20 pages
SIGIR 2003 Workshop On Distributed Information Retrieval: Jamie Callan Fabio Crestani Mark Sanderson
No ratings yet
SIGIR 2003 Workshop On Distributed Information Retrieval: Jamie Callan Fabio Crestani Mark Sanderson
5 pages
IRS Notes
No ratings yet
IRS Notes
10 pages
CSC118 - Fundamentals of Algorithm Development
0% (1)
CSC118 - Fundamentals of Algorithm Development
3 pages
Wellcomm User Guide
100% (1)
Wellcomm User Guide
23 pages
Knight Eod Robot
No ratings yet
Knight Eod Robot
11 pages
Completed Unit II 17.7.17
No ratings yet
Completed Unit II 17.7.17
113 pages
IRS Spectrum
100% (1)
IRS Spectrum
150 pages
CCS369 - TSS-Unit 3
No ratings yet
CCS369 - TSS-Unit 3
55 pages
Internet of Things (IoT) A Quick Start Guide: A to Z of IoT Essentials
From Everand
Internet of Things (IoT) A Quick Start Guide: A to Z of IoT Essentials
Chitra Lele
No ratings yet
Working With Time - Lab Solutions Guide: Index Type Sourcetype Interesting Fields
No ratings yet
Working With Time - Lab Solutions Guide: Index Type Sourcetype Interesting Fields
10 pages
Autocad MEP 2016
No ratings yet
Autocad MEP 2016
20 pages
Irs Unit-V
No ratings yet
Irs Unit-V
48 pages
Automatic Image Annotation: Enhancing Visual Understanding through Automated Tagging
From Everand
Automatic Image Annotation: Enhancing Visual Understanding through Automated Tagging
Fouad Sabry
No ratings yet
Automatic Image Annotation: Fundamentals and Applications
From Everand
Automatic Image Annotation: Fundamentals and Applications
Fouad Sabry
No ratings yet

Information Retrieval Systmem: Assignment Qa

Uploaded by

Information Retrieval Systmem: Assignment Qa

Uploaded by

INFORMATION RETRIEVAL SYSTMEM

IRS ASSIGNMENT QUESTIONS FOR MID-II

1. Given the following Term-Term matrix:

a. Determine the Term Relationship matrix using a threshold of 10 or higher

b. Determine the clusters using the clique technique

c. Determine the clusters using the single link technique

d. Determine the clusters using the string technique

e. Determine the clusters using the star technique

Key Components of Fast Data Finder Architecture

Advantages of Fast Data Finder Architecture in Information Retrieval

Advantages and Disadvantages of Spoken Audio Retrieval

Efficiency Hands-free usage is ideal for Speech processing can introduce

Contextual Advanced systems can interpret tone, Limited ability to understand

Multilingual Supports multiple languages if paired Struggles with code-switching or

Privacy Facilitates voice-based searches without Raises privacy concerns, especially

Multimedia Information Retrieval (MIR)

4.Given the following set of retrieved documents with relevance judgments

5.Demonstrate Boyer-Moore algorithm with example, explain each step.

6.Compare and contrast Jaccard measure with Dice measure similarity

2. Normalization and Sensitivity to Common Terms

4. Behavior with Sparse Data

7.Demonstrate Knuth-Morris-Pratt algorithm with example, explain each step.

8. Explain about weighted searches for Boolean systems?

“A AND B” retrieves those items that contain both terms A and B

The value 1.0 is assumed to be the strict interpretation of a Boolean query.

9. a) Explain about image retrieval.

b) Explain the content-based video retrieval system.

10. Briefly explain about Relevance feedback?

You might also like