0% found this document useful (0 votes)

65 views20 pages

5 Retrieval Effectiveness

The document discusses evaluation methods for information retrieval systems. It covers relevance judgments, common performance measures like precision and recall, and tradeoffs between precision and recall. It also discusses techniques like single-valued measures and problems with only using precision and recall.

Uploaded by

halal.army07

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

65 views20 pages

5 Retrieval Effectiveness

Uploaded by

halal.army07

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 20

Chapter 5 : Retrieval

Effectiveness
Adama Science and Technology University
School of Electrical Engineering and Computing
Department of CSE
Dr. Mesfin Abebe Haile (2024)
Retrieval Effectiveness

 Evaluation of IR systems,
 Relevance judgement,
 Performance measures:
 Recall,
 Precision,
 Single-valued measures
 etc.
Why System Evaluation?

 Any systems needs validation and verification.

 Check whether the system is right or not, (Verification)
 Check whether it is the right system or not,(Validation)

 It provides the ability to measure the difference between IR

systems.
 How well do our search engines work?
 Is system A better than B?
 Under what conditions?

3
Why System Evaluation?

 Evaluation drives what to study:

 Identify techniques that work well and do not work,
 There are many retrieval models/algorithms,
Which one is the best?
 What is the best component for:
Similarity measures (dot-product, cosine, …)
Index term selection (tokenization, stop-word removal,
stemming…)
Term weighting (TF, TF-IDF,…)

4
Evaluation Criteria

 What are the main evaluation measures to check the performance

of an IR system?

 Efficiency:
 Time and space complexity:
 Speed in terms of retrieval time and indexing time,
 Speed of query processing,
 The space taken by corpus vs. index file,
 Index size: determine Index/corpus size ratio
 Is there a need for compression?
Evaluation Criteria

 Effectiveness:
 How is a system capable of retrieving relevant documents from the
collection?
 Is system X better than other systems?
 User satisfaction: How “good” are the documents that are returned
as a response to user query?
 Relevance of results to meet information need of users.
Types of Evaluation Strategies

 System-centered evaluation:
 Given documents, queries, and relevance judgments,
 Try several variations of the system,
 Measure which system returns the “best” hit list.

 User-centered evaluation:
 Given several users, and at least two retrieval systems:
 Have each user try the same task on both systems,
 Measure which system works the “best” for users information
need,
 How to measure users satisfaction?
The Notion of Relevance Judgment

 Relevance is the measure of a correspondence existing between

a document and query.
 Construct document - query as determined by:
 (i) The user who posed the retrieval problem;
 (ii) An external judge;
 (iii) Information specialist.

 Is the relevance judgment made by users and external person the

same ?
The Notion of Relevance Judgment

 Relevance judgment is usually:

 Subjective: Depends upon a specific user’s judgment.
 Situational: Relates to user’s current needs.
 Cognitive: Depends on human perception and behavior.
 Dynamic: Changes over time.
Measuring Retrieval Effectiveness
Relevant Irrelevant
• Metrics often “Type
used to evaluate
effectiveness of
the system
Retrieved
A B one error”

Not
retrieved
C D
“Type two error”
Retrieval of documents may result in:
 False positive (Errors of omission): some irrelevant documents may be
retrieved by the system as relevant.
 False negative (False drop or Errors of commission): some relevant
documents may not be retrieved by the system as irrelevant.
 For many applications a good index should not permit any false drops, but
may permit a few false positives.
Measuring Retrieval Effectiveness

Relevant Not
relevant
Collection size = A+B+C+D
Relevant = A+C
Retrieved A B Retrieved = A+B
Not retrieved C D

| {Relevant}  {Retrieved} |
Re call  Relevant +
| {Relevant} | Relevant
Retrieved Retrieved

| {Relevant}  {Retrieved} |
Pr ecision 
| {Retrieved} |
Not Relevant + Not Retrieved

 When is precision important? When is recall important?

Example

 Assume that there are a total of 10 relevant document.

Ranking Relevance Recall Precision
1. Doc. 50 R 0.10 = (1/10) 1.00 = (1/1)
2. Doc. 34 NR 0.10 = (1/10) 0.50 = (1/2)
3. Doc. 45 R 0.20 = (2/10) 0.67 = (2/3)
4. Doc. 8 NR 0.20 = (2/10) 0.50 = (2/4)
5. Doc. 23 NR 0.20 = (2/10) 0.40 = (2/5)
6. Doc. 16 NR 0.20 = (2/10) 0.33 = (2/6)
7. Doc. 63 R 0.30 = (3/10) 0.43 = (3/7)
8. Doc 119 R 0.40 = (4/10) 0.50 = (4/8)
9. Doc 21 NR 0.40 = (4/10) 0.44 = (4/9)
10. Doc 80 R 0.50 = (5/10) 0.50 = (5/10)
Graphing Precision and Recall

 Plot each (recall, precision) point on a graph.

 Recall is a non-decreasing function of the number of documents
retrieved,
 Precision usually decreases (in a good system)
 Precision/Recall tradeoff:
 Can increase recall by retrieving many documents (down to a low
level of relevance ranking),
 But many irrelevant documents would be fetched, reducing
precision.
 Can get high recall (but low precision) by retrieving all documents
for all queries.
Graphing Precision and Recall

 Plot each (recall, precision) point on a graph.

1 The ideal
Returns relevant
Precision

documents but
misses many Returns most relevant
useful ones too. documents but includes
0 lots of junk.
Recall 1
Exercise

 Let total number of relevant documents = 6, compute recall and

precision for each cut off point n:
n doc # relevant Recall Precision
1 588 x 0.167 1
2 589 x 0.333 1
3 576
4 590 x 0.5 0.75
5 986
6 592 x 0.667 0.667
7 984
8 988
9 578
10 985
11 103
12 591
13 772 x 0.833 0.38
14 990
 Missing one relevant document. Never reach 100% recall.
15
Single-valued measures

 Single value measures: may want a single value for each query
to evaluate performance:
 Mean average precision at seen relevant documents.
 Typically average performance over a large set of queries.
 R- Precision
 Precision at R-th relevant documents
 F-Measure (2*PR/P+R)
 E-Measure (1-Fm)
Problems with both precision and recall

 Number of irrelevant documents in the collection is not taken into

account.
 Recall is undefined when there is no relevant document in the
collection. Precision is undefined when no document is retrieved.
Other measures
 Noise = retrieved irrelevant docs / retrieved docs.
 Silence/Miss = non-retrieved relevant docs / relevant docs.
 Noise = 1 – Precision; Silence = 1 – Recall
| {Relevant}  {NotRetrieved } |
Miss 
| {Relevant} |
| {Retrieved }  {NotRelevant} |
Fallout 
| {NotRelevant} |
17
Programming Assignment

 Select the language that interest you and design an IR system

for document collection written in that language.
 Form a group of not more than three members
1. Construct indexing structure:
 Given text document collection generate index terms and
organize them using inverted file indexing, include TF, DF &
CF for each index term and position/location information of
terms in each document.

1. Develop Vector space retrieval model:

 Using Vector space model generate any two queries (with two
or more words each) and retrieve relevant documents in ranking
order.
Question & Answer

03/28/24 19
Thank You !!!

03/28/24 20

Nature of Teaching and Teacher Roles
100% (5)
Nature of Teaching and Teacher Roles
23 pages
UE20CS332 Unit3 Slides
No ratings yet
UE20CS332 Unit3 Slides
461 pages
Information Storage and Retrival
No ratings yet
Information Storage and Retrival
31 pages
Irs Notes - Merged
No ratings yet
Irs Notes - Merged
166 pages
Classwork For Information Retrieval
No ratings yet
Classwork For Information Retrieval
118 pages
Book - Handbook of Collaborative Learning (2013)
100% (1)
Book - Handbook of Collaborative Learning (2013)
498 pages
Lecture 7 - Evaluation in IR, Relevance Feedback, Query Expansion
No ratings yet
Lecture 7 - Evaluation in IR, Relevance Feedback, Query Expansion
79 pages
Chapter 6-8IR Revised
No ratings yet
Chapter 6-8IR Revised
76 pages
Lecture 6
No ratings yet
Lecture 6
58 pages
Evaluation 1
No ratings yet
Evaluation 1
63 pages
IR Unit 5
No ratings yet
IR Unit 5
5 pages
Chapter3 MA212 Evaluation
No ratings yet
Chapter3 MA212 Evaluation
63 pages
Performance Evaluation of Information Retrieval Systems
No ratings yet
Performance Evaluation of Information Retrieval Systems
46 pages
Unit-V
No ratings yet
Unit-V
54 pages
Chapter 5
No ratings yet
Chapter 5
57 pages
Slides Chap04 PDF
No ratings yet
Slides Chap04 PDF
144 pages
Ir Mod3 Notes
No ratings yet
Ir Mod3 Notes
54 pages
Modern Information Retrieval
No ratings yet
Modern Information Retrieval
58 pages
Lecture5 6
No ratings yet
Lecture5 6
30 pages
Performance Evaluation of Information Retrieval Systems
No ratings yet
Performance Evaluation of Information Retrieval Systems
28 pages
Evaluation
No ratings yet
Evaluation
41 pages
Evaluation of Information Retrieval Systems: Thanks To Marti Hearst, Ray Larson, Chris Manning
No ratings yet
Evaluation of Information Retrieval Systems: Thanks To Marti Hearst, Ray Larson, Chris Manning
108 pages
Unit 3
No ratings yet
Unit 3
27 pages
L15 IRSW Evaluation
No ratings yet
L15 IRSW Evaluation
49 pages
Nouns & Pronouns: Subject Predicate Nominative Appositive Direct & Indirect Object Object of The Preposition
100% (1)
Nouns & Pronouns: Subject Predicate Nominative Appositive Direct & Indirect Object Object of The Preposition
66 pages
10 Evaluation FSS20
No ratings yet
10 Evaluation FSS20
24 pages
CS336 MIR w5 Evaluation
No ratings yet
CS336 MIR w5 Evaluation
38 pages
IR Lecture 5b
No ratings yet
IR Lecture 5b
36 pages
5-Retrieval Effectiveness
No ratings yet
5-Retrieval Effectiveness
20 pages
1727759531-6 Evaluation in Information Retrieval
No ratings yet
1727759531-6 Evaluation in Information Retrieval
24 pages
09 Evaluation
No ratings yet
09 Evaluation
22 pages
5 Retrieval Evaluation
No ratings yet
5 Retrieval Evaluation
20 pages
3 Retrieval Evaluation
No ratings yet
3 Retrieval Evaluation
31 pages
IR-Module 1 and 2
No ratings yet
IR-Module 1 and 2
48 pages
6 Retrieval Effectiveness
No ratings yet
6 Retrieval Effectiveness
18 pages
L05-IR Models MMN
No ratings yet
L05-IR Models MMN
23 pages
Information Retrieval - 1
No ratings yet
Information Retrieval - 1
47 pages
IR Chapt 5
No ratings yet
IR Chapt 5
55 pages
Chapter 5 Retrieval Efective
No ratings yet
Chapter 5 Retrieval Efective
24 pages
Practical Design of Experiments: DoE Made Easy
From Everand
Practical Design of Experiments: DoE Made Easy
Colin Hardwick
4.5/5 (7)
SIT772 Lecture 10
No ratings yet
SIT772 Lecture 10
34 pages
ISR Chap... 6
No ratings yet
ISR Chap... 6
14 pages
IR Lecture 5b
No ratings yet
IR Lecture 5b
36 pages
5 Retrievalefective
No ratings yet
5 Retrievalefective
13 pages
Chapter Four (ISR)
No ratings yet
Chapter Four (ISR)
25 pages
Arpan Halder-0001 - 20230802234722 - Assessing The Reliability of Information Retrieval NLP and Fuzzy
No ratings yet
Arpan Halder-0001 - 20230802234722 - Assessing The Reliability of Information Retrieval NLP and Fuzzy
10 pages
Sec 5
No ratings yet
Sec 5
14 pages
Unit3 ISR
No ratings yet
Unit3 ISR
15 pages
Introduction To Telecom Technologies (Telecom) : Getachew Mamo
No ratings yet
Introduction To Telecom Technologies (Telecom) : Getachew Mamo
65 pages
Information Retrieval: IR Evaluation
No ratings yet
Information Retrieval: IR Evaluation
36 pages
5 Retrievalefective
No ratings yet
5 Retrievalefective
22 pages
Lecture8-Evaluation 2013
No ratings yet
Lecture8-Evaluation 2013
44 pages
Unit 3 (Isr)
No ratings yet
Unit 3 (Isr)
9 pages
Performance Evaluation of Information Retrieval Systems
No ratings yet
Performance Evaluation of Information Retrieval Systems
45 pages
IR - Chapter 5
No ratings yet
IR - Chapter 5
28 pages
6 Retrieval Evaluation
No ratings yet
6 Retrieval Evaluation
28 pages
Exposure Therapy For Anxiety Disorders
100% (1)
Exposure Therapy For Anxiety Disorders
6 pages
Introduction To: Information Retrieval
No ratings yet
Introduction To: Information Retrieval
50 pages
Ch5 Retrieval Evaluation 2021
No ratings yet
Ch5 Retrieval Evaluation 2021
26 pages
IR Evaluation Tugas Kampus
No ratings yet
IR Evaluation Tugas Kampus
25 pages
Information Retrieval CMSC 476/676: Evaluation and Result Summaries
No ratings yet
Information Retrieval CMSC 476/676: Evaluation and Result Summaries
45 pages
CH 1 - Introduction To FLAT
No ratings yet
CH 1 - Introduction To FLAT
41 pages
Primary Mental Abilities 1
100% (1)
Primary Mental Abilities 1
10 pages
CH 3 - Regular Languages Amd Regular Grammars
No ratings yet
CH 3 - Regular Languages Amd Regular Grammars
67 pages
Capstone Paper
0% (1)
Capstone Paper
13 pages
Leadership - Three OpEx Questions You Need To Know The Answers To
No ratings yet
Leadership - Three OpEx Questions You Need To Know The Answers To
26 pages
Retrieval Evaluation
No ratings yet
Retrieval Evaluation
7 pages
Evaluation of Information Retrieval Systems
No ratings yet
Evaluation of Information Retrieval Systems
9 pages
TREC Evalution Measures
No ratings yet
TREC Evalution Measures
10 pages
Teaching The Flute at The Beginner and Intermediate Levels PDF
No ratings yet
Teaching The Flute at The Beginner and Intermediate Levels PDF
8 pages
CH 5 - Pushdown Automata
No ratings yet
CH 5 - Pushdown Automata
32 pages
2023 Midyear Performance Review
No ratings yet
2023 Midyear Performance Review
4 pages
The Philosophy of Emotion
No ratings yet
The Philosophy of Emotion
23 pages
Duarte Araujo - Hubert Ripoll - Markus Raab - Perspectives On Cognition and Action in Sport-Nova Science Publishers, Incorporated (2009)
No ratings yet
Duarte Araujo - Hubert Ripoll - Markus Raab - Perspectives On Cognition and Action in Sport-Nova Science Publishers, Incorporated (2009)
249 pages
4-IR Models
No ratings yet
4-IR Models
33 pages
3 Index Construction
No ratings yet
3 Index Construction
43 pages
Annurev Food 060721 023619
No ratings yet
Annurev Food 060721 023619
25 pages
Arnold Gesell Theorist Project
No ratings yet
Arnold Gesell Theorist Project
11 pages
Em15 - Compilation
No ratings yet
Em15 - Compilation
348 pages
CH 4 - Context Free Languages Amd Grammars
No ratings yet
CH 4 - Context Free Languages Amd Grammars
86 pages
1-Overview of Information Retrieval
No ratings yet
1-Overview of Information Retrieval
44 pages
Unit 104 Engineering Perspectives and Skills
No ratings yet
Unit 104 Engineering Perspectives and Skills
10 pages
2 Text Operations
No ratings yet
2 Text Operations
32 pages
Enclosure No 05 PRESENTATION PORTFOLIO ASSESSMENT SCORING SHEET
No ratings yet
Enclosure No 05 PRESENTATION PORTFOLIO ASSESSMENT SCORING SHEET
1 page
Auroville Teaching Methods
No ratings yet
Auroville Teaching Methods
14 pages
ES
No ratings yet
ES
17 pages
Wjec English Language Gcse Coursework
100% (2)
Wjec English Language Gcse Coursework
4 pages
Reference STG Fu 2007
No ratings yet
Reference STG Fu 2007
11 pages
Development of A Program To Increase Personal Happiness: Michael W. Fordyce
No ratings yet
Development of A Program To Increase Personal Happiness: Michael W. Fordyce
11 pages
DRTA
No ratings yet
DRTA
8 pages
(Dey, Pradip Ghosh, Manas) Computer Fundamentals (B-Ok - Xyz)
No ratings yet
(Dey, Pradip Ghosh, Manas) Computer Fundamentals (B-Ok - Xyz)
42 pages
Simple Past Tense Recount TEXT Explain and Example
No ratings yet
Simple Past Tense Recount TEXT Explain and Example
5 pages
Qualitative Research (Research in Daily Life 1)
No ratings yet
Qualitative Research (Research in Daily Life 1)
2 pages
Jayden Case Study
No ratings yet
Jayden Case Study
16 pages
Case Study On The Transformation
No ratings yet
Case Study On The Transformation
6 pages
Sound Cylinders MC
No ratings yet
Sound Cylinders MC
3 pages
M.Ed. in Higher Education Program Learning Outcomes: Self-Assessment
No ratings yet
M.Ed. in Higher Education Program Learning Outcomes: Self-Assessment
4 pages
Guided Notes Sample
No ratings yet
Guided Notes Sample
1 page

5 Retrieval Effectiveness

Uploaded by

5 Retrieval Effectiveness

Uploaded by

Chapter 5 : Retrieval

 Any systems needs validation and verification.

 It provides the ability to measure the difference between IR

 Evaluation drives what to study:

 What are the main evaluation measures to check the performance

 Relevance is the measure of a correspondence existing between

 Is the relevance judgment made by users and external person the

 Relevance judgment is usually:

 When is precision important? When is recall important?

 Assume that there are a total of 10 relevant document.

 Plot each (recall, precision) point on a graph.

 Plot each (recall, precision) point on a graph.

 Let total number of relevant documents = 6, compute recall and

 Number of irrelevant documents in the collection is not taken into

 Select the language that interest you and design an IR system

1. Develop Vector space retrieval model:

You might also like