Information Retrieval: Solutions To Practice Exercises

Uploaded by

This document provides solutions to practice exercises for an information retrieval chapter. It includes the following: 1) A table showing the results of computing relevance scores for questions against the keywords "SQL" and "relation" using term frequency and equations from the chapter. 2) An algorithm to find all documents containing at least k keywords from a set S. It works by maintaining a list of document identifiers and reference counts that is merged with lists of identifiers for each keyword. 3) Notes on the time complexity of the algorithm being proportional to n times the total number of document identifiers for all keywords.

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Information Retrieval: Solutions To Practice Exercises

Uploaded by

NUBG Gamer

0% found this document useful (0 votes)

36 views2 pages

Original Title

19-web.pdf

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

0% found this document useful (0 votes)

36 views2 pages

Information Retrieval: Solutions To Practice Exercises

Uploaded by

NUBG Gamer

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

Jump to Page

You are on page 1of 2

Search inside document

C H A P T E R 1 9

Information Retrieval

Solutions to Practice Exercises

19.1 We do not consider the questions containing neither of the keywords as their
relevance to the keywords is zero. The number of words in a question include
stop words. We use the equations given in Section 19.2.1 to compute relevance;
the log term in the equation is assumed to be to the base 2.

Q# #wo- # #“rela- “SQL” “relation ” “SQL” “relation ” Tota

-rds “SQL” -tion” term freq. term freq. relv. relv. relv.
1 84 1 1 0.0170 0.0170 0.0002 0.0002 0.0004
4 22 0 1 0.0000 0.0641 0.0000 0.0029 0.0029
5 46 1 1 0.0310 0.0310 0.0006 0.0006 0.0013
6 22 1 0 0.0641 0.0000 0.0029 0.0000 0.0029
7 33 1 1 0.0430 0.0430 0.0013 0.0013 0.0026
8 32 1 3 0.0443 0.1292 0.0013 0.0040 0.0054
9 77 0 1 0.0000 0.0186 0.0000 0.0002 0.0002
14 30 1 0 0.0473 0.0000 0.0015 0.0000 0.0015
15 26 1 1 0.0544 0.0544 0.0020 0.0020 0.0041

19.2 Let S be a set of n keywords. An algorithm to find all documents that contain
at least k of these keywords is given below :
This algorithm calculates a reference count for each document identifier. A
reference count of i for a document identifier d means that at least i of the key-
words in S occur in the document identified by d. The algorithm maintains a

91
92 Chapter 19 Information Retrieval

list of records, each having two fields – a document identifier, and the refer-
ence count for this identifier. This list is maintained sorted on the document
identifier field.
initialize the list L to the empty list;
for (each keyword c in S) do
begin
D := the list of documents identifiers corresponding to c;
for (each document identifier d in D) do
if (a record R with document identifier as d is on list L) then
R.ref erence count := R.ref erence count + 1;
else begin
make a new record R;
R.document id := d;
R.ref erence count := 1;
add R to L;
end;
end;
for (each record R in L) do
if (R.ref erence count >= k) then
output R;

Note that execution of the second for statement causes the list D to “merge”
with the list L. Since the lists L and D are sorted, the time taken for this merge
is proportional to the sum of the lengths of the two lists. Thus the algorithm
runs in time (at most) proportional to n times the sum total of the number of
document identifiers corresponding to each keyword in S.
19.3 No answer
19.4 No answer
19.5 No answer

AQA Assembly Language Questions and MS
Document9 pages
AQA Assembly Language Questions and MS
Ben Daffada
No ratings yet
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
Rating: 1 out of 5 stars
1/5 (1)
Learn R Programming in 24 Hours
From Everand
Learn R Programming in 24 Hours
Alex Nordeen
No ratings yet
Samsung Unlock
Document4 pages
Samsung Unlock
arunyadhav
No ratings yet
Information Retrieval: Solutions To Practice Exercises
Document2 pages
Information Retrieval: Solutions To Practice Exercises
Bada Sainath
No ratings yet
Information Retrieval: Practice Exercises
Document4 pages
Information Retrieval: Practice Exercises
NUBG Gamer
No ratings yet
Python Cost Model: Docdist1
Document12 pages
Python Cost Model: Docdist1
Alireza Kafaei
No ratings yet
18 Radix Sort
Document51 pages
18 Radix Sort
Fabio Firmino
No ratings yet
CD Unit 5 RV
Document23 pages
CD Unit 5 RV
Akash
No ratings yet
Basic Tokenizing, Indexing, and Implementation of Vector-Space Retrieval
Document33 pages
Basic Tokenizing, Indexing, and Implementation of Vector-Space Retrieval
Diantika Ochan Puspitasari
No ratings yet
Learn R Programming - GeeksforGeeks
Document66 pages
Learn R Programming - GeeksforGeeks
Karthikeyan RV
100% (3)
Lab Manual DAR
Document81 pages
Lab Manual DAR
Harry Kunar
No ratings yet
Assembly Language Worksheet 1
Document4 pages
Assembly Language Worksheet 1
Dheekshithaa Saravanan
No ratings yet
Microcontroller Lab Software MANUAL
Document17 pages
Microcontroller Lab Software MANUAL
shivakeshichoupiri
No ratings yet
Compiler Design - Code Generation
Document62 pages
Compiler Design - Code Generation
shanthi prabha
No ratings yet
Instruction Set: Operand Notation
Document90 pages
Instruction Set: Operand Notation
SeahJiaChen
No ratings yet
Code Generation
Document49 pages
Code Generation
Candy Angel
No ratings yet
Chapter 8 - Code Generation
Document62 pages
Chapter 8 - Code Generation
Ekansh Gupta
No ratings yet
LR 0 Notes
Document14 pages
LR 0 Notes
Shridhar Venkatanarasimhan
No ratings yet
R Programming First Unit
Document34 pages
R Programming First Unit
Bhagyalaxmi Tambad
No ratings yet
CSC3201 - Compiler Construction (Part II) - Lecture 5 - Code Generation
Document64 pages
CSC3201 - Compiler Construction (Part II) - Lecture 5 - Code Generation
Ahmad Abba
No ratings yet
Compiler Design and Construction Lecture Notes
Document28 pages
Compiler Design and Construction Lecture Notes
valentina ojeah
No ratings yet
Code Generation
Document25 pages
Code Generation
lyeabsra
No ratings yet
Dzone R Refcard
Document9 pages
Dzone R Refcard
clungaho7109
No ratings yet
لاب 3.1 +3.2 1
Document12 pages
لاب 3.1 +3.2 1
vip14
No ratings yet
CA Project 5
Document5 pages
CA Project 5
Yasir Manzoor
No ratings yet
Subject Code: 6CS63/06IS662 NO. of Lectures Per Week: 04 Total No. of Lecture HRS: 52 IA Marks: 25 Exam HRS: 03 Exam Marks:100
Document27 pages
Subject Code: 6CS63/06IS662 NO. of Lectures Per Week: 04 Total No. of Lecture HRS: 52 IA Marks: 25 Exam HRS: 03 Exam Marks:100
yachsin27
No ratings yet
SL Lab PDF
Document29 pages
SL Lab PDF
Shadow Monarch
No ratings yet
Research Methodology For Commerce Lab
Document35 pages
Research Methodology For Commerce Lab
rashimakkar80
No ratings yet
2.R Concepts - BDSM - Oct2020 PDF
Document37 pages
2.R Concepts - BDSM - Oct2020 PDF
rakesh
No ratings yet
Chapter 2: Relational Model: Database System Concepts, 5 Ed
Document96 pages
Chapter 2: Relational Model: Database System Concepts, 5 Ed
Hemant Tulsani
No ratings yet
CTCD Unit 4
Document25 pages
CTCD Unit 4
Ranjit47 H
No ratings yet
Pract 1 Measuring The Document Similarity in Python
Document6 pages
Pract 1 Measuring The Document Similarity in Python
tryhackkme123
No ratings yet
Patt Patel CH 07
Document22 pages
Patt Patel CH 07
Nirmal Gupta
No ratings yet
SLIDES - ICT444!1!17 (Compatibility Mode)
Document17 pages
SLIDES - ICT444!1!17 (Compatibility Mode)
kwekutmen
No ratings yet
QB Mod2
Document3 pages
QB Mod2
shreelalmadhuvana0405
No ratings yet
Bottom-Up Parsing: Goal of Parser: Build A Derivation
Document31 pages
Bottom-Up Parsing: Goal of Parser: Build A Derivation
Bacha Hunde
No ratings yet
Solved Problems 1
Document4 pages
Solved Problems 1
kimo
No ratings yet
1research Methodology For Commerce Lab
Document35 pages
1research Methodology For Commerce Lab
rashimakkar80
No ratings yet
CH 14
Document70 pages
CH 14
dhanrajkamat
No ratings yet
Hyde Appendix B
Document42 pages
Hyde Appendix B
htdvul
No ratings yet
Fibonacci Numbers: An Exercise in Assembly Language Programming Andreas Klappenecker September 7, 2004
Document6 pages
Fibonacci Numbers: An Exercise in Assembly Language Programming Andreas Klappenecker September 7, 2004
Kamal B Vadhar
No ratings yet
PPMIPS
Document246 pages
PPMIPS
infamous0218
67% (3)
LAB03 Report
Document8 pages
LAB03 Report
ahmadmujeebcbf
No ratings yet
Aqa 75162 75172 Ali
Document2 pages
Aqa 75162 75172 Ali
Ayaan Saif
No ratings yet
Basics of R Programming
Document29 pages
Basics of R Programming
Nash4400 Nash4400
No ratings yet
Soln2 Dbms
Document3 pages
Soln2 Dbms
Nirmal Singhania
No ratings yet
How To Run R
Document48 pages
How To Run R
rashimakkar80
No ratings yet
STA1007S Lab 1: R Interface: Getting Started
Document9 pages
STA1007S Lab 1: R Interface: Getting Started
mlungu
No ratings yet
Radix Sort - Wikipedia, The Free Encyclopedia
Document13 pages
Radix Sort - Wikipedia, The Free Encyclopedia
sbaikunje
No ratings yet
DATA ANALYTICS LAB MANUAL
Document57 pages
DATA ANALYTICS LAB MANUAL
Shivam Singh
No ratings yet
Instructions: Language of The Computer: CMPS290 Class Notes (Chap02) Page 1 / 45 by Kuo-Pao Yang
Document45 pages
Instructions: Language of The Computer: CMPS290 Class Notes (Chap02) Page 1 / 45 by Kuo-Pao Yang
zeeshan
No ratings yet
Chapter1 PDF
Document42 pages
Chapter1 PDF
techmoorthi
No ratings yet
Standard AQA Assembly Language Instruction Set
Document1 page
Standard AQA Assembly Language Instruction Set
Marcos Seris
No ratings yet
IR Practical Code
Document13 pages
IR Practical Code
tryhackkme123
No ratings yet
c3-paper(1)
Document4 pages
c3-paper(1)
Manish Nagpure
No ratings yet
CH 4 - Semantic Analysis PDF
Document36 pages
CH 4 - Semantic Analysis PDF
ethiopia tonetor
100% (1)
Instruction Set Architecture: Mips Section 2.1-2.5
Document27 pages
Instruction Set Architecture: Mips Section 2.1-2.5
api-26072581
No ratings yet
RPubs - How To Open and Work With NetCDF Data in R
Document2 pages
RPubs - How To Open and Work With NetCDF Data in R
Yaseen Muhammad
No ratings yet
Module-5: Syntax Directed Translation, Intermediate Code Generation, Code Generation 5.1,5.2,5.3, 6.1,6.2,8.1,8.2
Document37 pages
Module-5: Syntax Directed Translation, Intermediate Code Generation, Code Generation 5.1,5.2,5.3, 6.1,6.2,8.1,8.2
sname
No ratings yet
Lectures on P-Adic L-Functions
From Everand
Lectures on P-Adic L-Functions
Kinkichi Iwasawa
No ratings yet
18s PDF
Document6 pages
18s PDF
NUBG Gamer
No ratings yet
13s PDF
Document10 pages
13s PDF
NUBG Gamer
No ratings yet
22s PDF
Document6 pages
22s PDF
NUBG Gamer
No ratings yet
Data Analysis and Mining: Practice Exercises
Document4 pages
Data Analysis and Mining: Practice Exercises
NUBG Gamer
No ratings yet
Distributed Databases: Practice Exercises
Document8 pages
Distributed Databases: Practice Exercises
NUBG Gamer
No ratings yet
Advanced Data Types and New Applications: Practice Exercises
Document6 pages
Advanced Data Types and New Applications: Practice Exercises
NUBG Gamer
No ratings yet
Advanced Application Development: Practice Exercises
Document4 pages
Advanced Application Development: Practice Exercises
NUBG Gamer
No ratings yet
Advanced Transaction Processing: Practice Exercises
Document4 pages
Advanced Transaction Processing: Practice Exercises
NUBG Gamer
No ratings yet
TCCS3023
Document20 pages
TCCS3023
சிவன் மணியம்
No ratings yet
Sdaccel Development Environment: Release Notes, Installat On, and Licensing Guide
Document28 pages
Sdaccel Development Environment: Release Notes, Installat On, and Licensing Guide
hiperboreoatlantec
No ratings yet
MyFAX Brochure
Document5 pages
MyFAX Brochure
sanjaya 黄保元
No ratings yet
Honeywell DPR 2300 and 3000
Document338 pages
Honeywell DPR 2300 and 3000
kmpoulos
No ratings yet
RRU5909 Technical Specifications
Document7 pages
RRU5909 Technical Specifications
Dmitry059
100% (1)
Feature List
Document159 pages
Feature List
Rao Dheeru
100% (1)
CMD Series 20231205 v2
Document4 pages
CMD Series 20231205 v2
Wiro Hamjen
No ratings yet
TPS and ERP
Document21 pages
TPS and ERP
Triscia Quiñones
No ratings yet
Microsoft CEA End User License Agreement For Students
Document3 pages
Microsoft CEA End User License Agreement For Students
ray snow
No ratings yet
Test-Expert Brochure
Document6 pages
Test-Expert Brochure
mouloud hadbii
No ratings yet
Pixel: Multi-Signatures For Consensus
Document19 pages
Pixel: Multi-Signatures For Consensus
Taegyun Kim
No ratings yet
Goldmine 5 Reference
Document346 pages
Goldmine 5 Reference
minutemen_us
No ratings yet
How To Install Java
Document17 pages
How To Install Java
sjeff
No ratings yet
Schedule Margin Key
Document2 pages
Schedule Margin Key
Prathibha Ch
No ratings yet
Ring Counters
Document10 pages
Ring Counters
Mr.Vishal Sheth
No ratings yet
Um Altx 131451G 70 05 o Oooo - Rev - B 1 Awt
Document85 pages
Um Altx 131451G 70 05 o Oooo - Rev - B 1 Awt
bilallkhadim
No ratings yet
XSL (Extensible Stylesheet Language)
Document11 pages
XSL (Extensible Stylesheet Language)
kanna karthik reddy
No ratings yet
ITQSB Sample Paper 1
Document7 pages
ITQSB Sample Paper 1
razorsbk
No ratings yet
Final Project On Cybersecurity
Document71 pages
Final Project On Cybersecurity
ScribdTranslations
No ratings yet
Overview of A VPN: Dial This Initial Connection, and Then Click Your Dial-Up Internet Connection
Document4 pages
Overview of A VPN: Dial This Initial Connection, and Then Click Your Dial-Up Internet Connection
Swadi Rajeswar
No ratings yet
Businessobjects Web Intelligence Xi 3.0: Report Design: Course Description
Document4 pages
Businessobjects Web Intelligence Xi 3.0: Report Design: Course Description
bk_pinky
No ratings yet
IS550 MidTerm Spring 2015 Solution
Document7 pages
IS550 MidTerm Spring 2015 Solution
aboaziz2130
No ratings yet
T-GCPBDML-B - M3 - Big Data With BigQuery - ILT Slides
Document75 pages
T-GCPBDML-B - M3 - Big Data With BigQuery - ILT Slides
Adha Jamil
No ratings yet
Installation of Microwin Programming Software
Document18 pages
Installation of Microwin Programming Software
Andrew
No ratings yet
Form Validation Check
Document24 pages
Form Validation Check
Merlin Mathew
No ratings yet
Pyinstaller Documentation: Release 4.1.Dev0+G81D7E6A7
Document121 pages
Pyinstaller Documentation: Release 4.1.Dev0+G81D7E6A7
lopezjohnjairo5
No ratings yet
Business Analytics Assignment: Group No: - 10 Group Members
Document9 pages
Business Analytics Assignment: Group No: - 10 Group Members
Sakshi Sharda
No ratings yet
Data Manager
Document2 pages
Data Manager
Radu
No ratings yet
Igcse Ict 0417 Syllabus 2013
Document43 pages
Igcse Ict 0417 Syllabus 2013
Alvin Swami
No ratings yet