ISR Chap..7

Chapter seven discusses various query languages used in information retrieval, including keyword-based, single-word, phrase, multiple-word, Boolean, and weighted queries. Each type of query has its own characteristics, advantages, and disadvantages, impacting how documents are retrieved and ranked based on user input. The chapter also includes an assignment to explore probabilistic models in information retrieval, emphasizing their significance and application.

Uploaded by

biruktilahundinki

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views9 pages

ISR Chap..7

Uploaded by

biruktilahundinki

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Chapter seven

Query Languages
Keyword-based querying
• Queries are combinations of words.
• The document collection is searched for documents
that contain these words.
• Word queries are
• intuitive,
• easy to express and
• provide fast ranking.
• The concept of word must be defined.
• A word is a sequence of letters terminated by a
separator (period, comma, space, etc).
• Definition of letter and separator is flexible; e.g.,
hyphen could be defined as a letter or as a
separator.
• Usually, common words (such as “a”, “the”, “of”, …)
are ignored.
Single-word queries
• A query is a single word – Usually used for
searching in document images
• Simplest form of query.
• All documents that include this word are
retrieved.
• Documents may be ranked by the frequency of
this word in the document.
• Disadvantages:
• Ambiguity:
• Lack of Specificity
Phrase queries
• A query is a sequence of words treated as a single unit.
• Phrase is usually surrounded by quotation marks.
• All documents that include this phrase are retrieved.
• Usually, separators (commas, colons, etc.) and common
words (e.g., “a”, “the”, “of”, “for”…) in the phrase are
ignored.
•In effect, this query is for a set of words that must
appear in sequence.
• Allows users to specify a context and thus gain precision.
• Example: “Information Processing for Document
Retrieval”.
Multiple-word queries
• A query is a set of words (or phrases).
• Two options: A document is retrieved if it includes
• any of the query words, or
• each of the query words.
•Documents are ranked by the number of query words
they contain:
• A document containing n query words is ranked higher
than a document containing m < n query words.
• Documents are ranked in decreasing order:
• those containing all the query words are ranked at the
top, only one query word at bottom.
• –Frequency counts may be used to break tie among
documents that contain the same query words.
Boolean queries
•Based on concepts from logic: AND, OR, NOT
• It describes the information needed by relating multiple
words with Boolean operators.
•Semantics: For each query word w a corresponding set
Dw is constructed that includes the documents that
contain w.
• AND: Finds only documents containing all of the
specified words or phrases.
• OR: Finds documents containing at least one of the
specified words or phrases.
• NOT: Excludes documents containing the specified word
or phrase.
Examples: Boolean queries
1.computer OR server
• Finds documents containing either computer, server or
both
2. (computer OR server) NOT mainframe
• Select all documents that discuss computers or servers,
do not select any documents that discuss mainframes.
3. computer NOT (server OR mainframe)
• Select all documents that discuss computers, and do
not discuss either servers or mainframes.
4. computer OR server NOT mainframe
• Select all documents that discuss computers, or
documents that discuss servers but do not discuss
mainframes.
Weighted queries
•Each of the words is assigned a different weight, expressing
the relative importance of the word within the query.
•A query is then a set of word-weight pairs:
(k1, w1), …, (kn, wn).
•The ranking of a document is the sum of the weights for the
query words that it satisfies.
• Example: given Query: (A,0.8,), (B,0.9), (C,0.3); and
• Document 1: (A, B, D) and Document 2: (A, C, D) which document
ranked first ?
• Rank of Document 1: 0.8+0.9 = 1.7
• Rank of Document 2: 0.8+0.3 = 1.1
• Each document includes two words from the query, but
Document1 is ranked higher because it includes more
important words.
Assignment
Explore and summarize the following concepts related to
probabilistic models in information retrieval:
a. Probabilistic Indexing
b. Information Retrieval as Probabilistic Inference
c. Binary Independence Model (BIM)
d. Bayesian Networks for Text Retrieval
e. Language Model Approach to Information Retrieval
For each concept, provide a brief overview highlighting
its significance and application in the field of
information retrieval.

The Dialects of Modern German A Linguistic Survey Russ PDF Download
100% (4)
The Dialects of Modern German A Linguistic Survey Russ PDF Download
73 pages
đề cương b1 23.4 đây đủ
No ratings yet
đề cương b1 23.4 đây đủ
35 pages
Actual Test 3 - Toeic Writing
No ratings yet
Actual Test 3 - Toeic Writing
10 pages
Sentence Based Error Questions Shortcut Trick With 100 Solved Example
No ratings yet
Sentence Based Error Questions Shortcut Trick With 100 Solved Example
37 pages
Room 6 Term 1 Week 9 2025 Timetable
No ratings yet
Room 6 Term 1 Week 9 2025 Timetable
3 pages
English3 - Q2 - Mod1 - Using Be-Verbs Correctly - V2
100% (6)
English3 - Q2 - Mod1 - Using Be-Verbs Correctly - V2
21 pages
4PS Sequence-Plan-01
No ratings yet
4PS Sequence-Plan-01
2 pages
Query Languages
No ratings yet
Query Languages
54 pages
Module 7
No ratings yet
Module 7
53 pages
Case in English Grammar - Nominative, Possessive, Accusative and Dative Case
No ratings yet
Case in English Grammar - Nominative, Possessive, Accusative and Dative Case
9 pages
EIG Chapter1
No ratings yet
EIG Chapter1
27 pages
Chapter One
No ratings yet
Chapter One
23 pages
Chapte 3
No ratings yet
Chapte 3
16 pages
Unit 5 - Query Operations and Languages
No ratings yet
Unit 5 - Query Operations and Languages
11 pages
Acconting Note
No ratings yet
Acconting Note
9 pages
Be Verb-2
No ratings yet
Be Verb-2
8 pages
ISE Information Retrieval Mod-V (Uploaded by Snaptricks - In)
No ratings yet
ISE Information Retrieval Mod-V (Uploaded by Snaptricks - In)
48 pages
Bulu
No ratings yet
Bulu
47 pages
CS726 Information Retrieval Techniques Complete Handouts (Downloded From Cluesbook - Com)
No ratings yet
CS726 Information Retrieval Techniques Complete Handouts (Downloded From Cluesbook - Com)
237 pages
IR Merged Merged
No ratings yet
IR Merged Merged
132 pages
Lecture1 Intro
No ratings yet
Lecture1 Intro
57 pages
MIR Mod - 03 (Chapter04-Query Languages)
No ratings yet
MIR Mod - 03 (Chapter04-Query Languages)
31 pages
Module 1-1
No ratings yet
Module 1-1
12 pages
Query Languages
No ratings yet
Query Languages
5 pages
Unit3 QueryLanguages Berlin
No ratings yet
Unit3 QueryLanguages Berlin
29 pages
Chapter 1
No ratings yet
Chapter 1
5 pages
Sample 4as Lesson Plan English1
No ratings yet
Sample 4as Lesson Plan English1
4 pages
New Password B1+ UT 2A
No ratings yet
New Password B1+ UT 2A
4 pages
Elective 2 Sir Jayson
No ratings yet
Elective 2 Sir Jayson
4 pages
Query Languages
No ratings yet
Query Languages
34 pages
IR Unit-3
No ratings yet
IR Unit-3
75 pages
Query Languages-WPS Office
No ratings yet
Query Languages-WPS Office
8 pages
LAS-Q'1 W1 Language - Corrected
No ratings yet
LAS-Q'1 W1 Language - Corrected
11 pages
7 Query Languages Operations
No ratings yet
7 Query Languages Operations
12 pages
Emutye
No ratings yet
Emutye
20 pages
HTML
No ratings yet
HTML
1 page
Chapter 1
No ratings yet
Chapter 1
52 pages
IR Chapter 4
No ratings yet
IR Chapter 4
15 pages
Editing and Revising - Fall 2023
No ratings yet
Editing and Revising - Fall 2023
25 pages
Examen Anglais 2020 Session Normale Corrige 10
No ratings yet
Examen Anglais 2020 Session Normale Corrige 10
1 page
Week 2 - Information Retrieval Basics
No ratings yet
Week 2 - Information Retrieval Basics
74 pages
PEG - To BE - Class Activity
No ratings yet
PEG - To BE - Class Activity
6 pages
Information Retrieval - Lecture 4 5
No ratings yet
Information Retrieval - Lecture 4 5
15 pages
IR-Lec1 - Ch1-2023
No ratings yet
IR-Lec1 - Ch1-2023
41 pages
6-Query Languages
No ratings yet
6-Query Languages
19 pages
Verb With Both - Ing and To + Infinitive: Presentation
No ratings yet
Verb With Both - Ing and To + Infinitive: Presentation
2 pages
7 B - Query Languages
No ratings yet
7 B - Query Languages
33 pages
Unit 2
No ratings yet
Unit 2
58 pages
Information Retrieval Models
No ratings yet
Information Retrieval Models
15 pages
Department of Education: Republic of The Philippines
100% (1)
Department of Education: Republic of The Philippines
2 pages
Unit 2 Irt
No ratings yet
Unit 2 Irt
33 pages
Information Retrieval - 1
No ratings yet
Information Retrieval - 1
47 pages
(Alan Jenkins (Auth.) ) The Social Theory of Claude
No ratings yet
(Alan Jenkins (Auth.) ) The Social Theory of Claude
198 pages
Lecture1-Intro - Realted To Ch1
No ratings yet
Lecture1-Intro - Realted To Ch1
60 pages
Pointers To Review - Second
No ratings yet
Pointers To Review - Second
4 pages
85 13 Job-Performance US Student
No ratings yet
85 13 Job-Performance US Student
5 pages
Purcom Academic Writing Questions
No ratings yet
Purcom Academic Writing Questions
3 pages
Guide 2 - Present Perfect Vs Present Perfect Continuous
No ratings yet
Guide 2 - Present Perfect Vs Present Perfect Continuous
7 pages
Unit II
No ratings yet
Unit II
73 pages
All Units Notes TYBSC-CS-Information-Retrieval
No ratings yet
All Units Notes TYBSC-CS-Information-Retrieval
89 pages
Class 11 Infinitive
No ratings yet
Class 11 Infinitive
14 pages
Informaiton Retrieval and Web Search
No ratings yet
Informaiton Retrieval and Web Search
44 pages
Adjectives Ending in - Ing or - Ed - Teacher
No ratings yet
Adjectives Ending in - Ing or - Ed - Teacher
4 pages
FCE Tips: How To Write An Article
No ratings yet
FCE Tips: How To Write An Article
4 pages
Adjectives That Describe Places PDF
No ratings yet
Adjectives That Describe Places PDF
2 pages
Tamrakar 2015
No ratings yet
Tamrakar 2015
6 pages
Great Expectations - Second-Language Acquisition Research and Classroom Teaching - Lightbown
No ratings yet
Great Expectations - Second-Language Acquisition Research and Classroom Teaching - Lightbown
17 pages
Cs8080 Ir Unit2 I Modeling and Retrieval Evaluation
No ratings yet
Cs8080 Ir Unit2 I Modeling and Retrieval Evaluation
42 pages
11 Multimedia Media IR
No ratings yet
11 Multimedia Media IR
19 pages
CS583 Info Retrieval
No ratings yet
CS583 Info Retrieval
33 pages
Information Retrieval Detailed Lecture Nov 2023
No ratings yet
Information Retrieval Detailed Lecture Nov 2023
39 pages
Irs 3
No ratings yet
Irs 3
14 pages
Chapter Five (ISR)
No ratings yet
Chapter Five (ISR)
17 pages
Information Retrieval: Adt-V Unit
No ratings yet
Information Retrieval: Adt-V Unit
106 pages
Standard Akademik Sekolah-Sekolah Negeri Johor Piawaian Latihan Akademik Negeri Johor Plan J Mata Pelajaran:Bahasa Inggeris . Tingkatan: Satu
No ratings yet
Standard Akademik Sekolah-Sekolah Negeri Johor Piawaian Latihan Akademik Negeri Johor Plan J Mata Pelajaran:Bahasa Inggeris . Tingkatan: Satu
8 pages
NLP - Module 5
No ratings yet
NLP - Module 5
58 pages
IR Chap7
No ratings yet
IR Chap7
30 pages
Query Languages: Chapter Seven
No ratings yet
Query Languages: Chapter Seven
36 pages
Web Information Retrieval
No ratings yet
Web Information Retrieval
10 pages
Chapter 4: Query Languages: Baeza-Yates, 1999 Modern Information Retrieval
No ratings yet
Chapter 4: Query Languages: Baeza-Yates, 1999 Modern Information Retrieval
29 pages
Introduction of IR Models
No ratings yet
Introduction of IR Models
67 pages
Introduction To Information Retrieval
No ratings yet
Introduction To Information Retrieval
50 pages
Unit Ii Modeling
No ratings yet
Unit Ii Modeling
15 pages
Unit 1
No ratings yet
Unit 1
181 pages
IR Models: - Why IR Models? - Boolean IR Model - Vector Space IR Model - Probabilistic IR Model
No ratings yet
IR Models: - Why IR Models? - Boolean IR Model - Vector Space IR Model - Probabilistic IR Model
46 pages
Unit 1: Introduction and Data Pre-Processing
No ratings yet
Unit 1: Introduction and Data Pre-Processing
71 pages
Query Languages and Query Operation: Chapter Seven
No ratings yet
Query Languages and Query Operation: Chapter Seven
20 pages
Modern Information Retrieval: Queries: Languages & Properties
No ratings yet
Modern Information Retrieval: Queries: Languages & Properties
67 pages
CompletedUNIT 1 PPT 10.7.17
100% (6)
CompletedUNIT 1 PPT 10.7.17
87 pages
Information Retrieval
No ratings yet
Information Retrieval
5 pages
Made By:-Bhawana Agarwal Cs Iiiyr
No ratings yet
Made By:-Bhawana Agarwal Cs Iiiyr
29 pages
Completed Unit II 17.7.17
No ratings yet
Completed Unit II 17.7.17
113 pages
cs419-519 Slides Part 2
No ratings yet
cs419-519 Slides Part 2
6 pages
Lecture 3-Skip Pointers and Phrase Queries
No ratings yet
Lecture 3-Skip Pointers and Phrase Queries
12 pages

ISR Chap..7

Uploaded by

ISR Chap..7

Uploaded by

Chapter seven

You might also like