0% found this document useful (0 votes)
315 views4 pages

CS F469 Handout

This document provides details about the Information Retrieval course offered at BIRLA INSTITUTE OF TECHNOLOGY AND SCIENCE, Pilani. The course covers key concepts in information retrieval including Boolean retrieval models, indexing, scoring and weighting, vector space models, probabilistic models, evaluation methods, text mining, web search, link analysis, cross-language IR, and recommender systems. The course is divided into 5 modules taught over 42 lectures. Student performance will be evaluated through a mid-semester test, quizzes/assignments, and a comprehensive final exam.

Uploaded by

Sania Agrawal
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
315 views4 pages

CS F469 Handout

This document provides details about the Information Retrieval course offered at BIRLA INSTITUTE OF TECHNOLOGY AND SCIENCE, Pilani. The course covers key concepts in information retrieval including Boolean retrieval models, indexing, scoring and weighting, vector space models, probabilistic models, evaluation methods, text mining, web search, link analysis, cross-language IR, and recommender systems. The course is divided into 5 modules taught over 42 lectures. Student performance will be evaluated through a mid-semester test, quizzes/assignments, and a comprehensive final exam.

Uploaded by

Sania Agrawal
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

BIRLA INSTITUTE OF TECHNOLOGY AND SCIENCE, Pilani

Pilani Campus
AUGS/ AGSR Division

SECOND SEMESTER 2019-20


COURSE HANDOUT
Date: 14.01.2020

In addition to part I (General Handout for all courses appended to the Time table) this portion gives further
specific details regarding the course.
Course No : CS F469
Course Title : Information Retrieval
Instructor-in-Charge : Abhishek ([email protected])

1. Course Description:
This course studies the theory, design, and implementation of text-based information systems. The
Information Retrieval core components of the course include statistical characteristics of text, representation
of information needs and documents, several important retrieval models (Boolean, vector space,
probabilistic, inference net, language modeling, link analysis), clustering algorithms, collaborative filtering,
automatic text categorization, and experimental evaluation. The software architecture components include
design and implementation of high-capacity text and multimedia retrieval and filtering systems.

2. Scope and Objective of the Course:

The course is designed to provide students with a broad understanding in the design and use of information
retrieval techniques. The course also aims at providing a holistic view of information retrieval, which
includes several retrieval concepts and techniques such as representation and indexing of data, text mining,
websearch: basics and advances, multimedia retrieval, etc.

3. Text Books​:

T1.​ C. D. Manning, P. Raghavan and H. Schutze. Introduction to Information Retrieval, Cambridge


University Press, 2008. ​http://nlp.stanford.edu/IR-book/

4. Reference Books:
R1:​ Modern Information Retrieval, Ricardo Baeza-Yates and Berthier Ribeiro-Neto, Addison-Wesley, 2000.
http://people.ischool.berkeley.edu/~hearst/irbook/
R2:​ Search Engines: Information Retrieval in Practice by Bruce Croft, Donald Metzler, and Trevor
Strohman, Addison-Wesley, 2009.
R3:​ Cross-Language Information Retrieval by By Jian-Yun Nie Morgan & Claypool Publisher series 2010
R4:​ Multimedia Information Retrieval by Stefan M. Rüger Morgan & Claypool Publisher series 2010.
R5​ Ricci, F.; Rokach, L.; Shapira, B.; Kantor, P.B. (Eds.), Recommender Systems Handbook. 1st Edition.,
2011, 845 p. 20 illus., Hardcover, ISBN: 978-0-387-85819-7

5. Course Plan:
1
BIRLA INSTITUTE OF TECHNOLOGY AND SCIENCE, Pilani
Pilani Campus
AUGS/ AGSR Division

Module No. Lecture Session Reference Learning outcomes

M1: Lecture 1: T1 Ch1 Introduction to the


Basic Course Overview course
Information
Retrieval ​ ectures 2-4:
L T1 Ch 1 & The term vocabulary
Concepts Boolean retrieval 2, R1 2.5 postings lists and
introduction to ad-hoc
search
Lectures 5-6: T1 Ch 3 Wildcard queries,
Dictionaries and Spelling correction, Edit
tolerant retrieval distances, Phonetic
correction
Lectures 7-9: T1 Ch 4 Blocked sort-based
Index construction indexing, Single-pass
in-
memory indexing,
Distributed indexing,
Dynamic indexing
Lectures 10-11: T1 Ch 6 Learning weights, Term
Scoring, term frequency and
weighting weighting, tf-idf
weighting
Lecture 12: T1 Ch 6 Dot products, Queries
The vector space model as vectors,
for scoring Variant tf-idf functions,
Document and query
weighting schemes
Lecture 13: T1 Ch 8 Evaluation of unranked
Evaluation of IR retrieval sets
Evaluation of ranked
retrieval sets
Lectures 14-15: T1 Ch 11 Probabilistic
Probabilistic Model Information retrieval
M2: Lectures 16-21: T1 Ch 13, Text Classification,
Text Mining Text Mining 14, 16,17 Text Clustering
M3: Lectures 22-25: T1 Ch 19 Search Engine
Web Search Web search basics R1 Ch13, Architecture
and Link R2 Ch2 Web characteristics
Analysis The search user
experienceIndex size
and
estimation
Lectures 26-28: T1 Ch 20 Crawling, Crawler
Web crawlers and R2 Ch 3 architecture,
indexes Distributing
2
BIRLA INSTITUTE OF TECHNOLOGY AND SCIENCE, Pilani
Pilani Campus
AUGS/ AGSR Division

indexes
Lectures 29-31: T1 Ch 21 The Web as a
Link Analysis graph,Google’s
Pagerank,
Hub and authorities
(HITS), Web spam
M4: Lectures 32-35: R3 Ch2 Language Problems in
Multimedia Cross Language IR, Translation
and Cross Information Retrieval Approaches for CLIR,
Lingual IR (CLIR) Handling many
Languages, Using
manually constructed
Translation systems and
resources for CLIR,
Research issues
Lectures 36-40: R4 Ch2,3 Basic Multimedia
Multimedia search technologies,
Information retrieval Content based Retrieval,
(MIR) Research issues in
MIR
M5: Lectures 40-42: R5 Introduction to
Recommender Recommender systems Ch1,2,3,4,5 recommendation
Systems systems,
Collaborative, Content,
Knowledge and
Hybrid recommendation
systems

6. Evaluation Scheme​:
Component Duration Weightag Date & Time Nature of component
e (%) (Close Book/ Open Book)
Mid-Semester Test 90 Min. 25 2/3 09:00 - 10:30 AM Closed Book
Quiz(es)/Assignments --- 35 To be announced
/Notes
Comprehensive 3h 40 01/05 FN Partly Open
Examination

7. Chamber Consultation Hour​:


To be announced in the class.
8. Notices:
All the notices concerning this course will be displayed on the CSIS notice board or course website.
9. Make-up Policy:
Prior Permission is a must and Make-up shall be granted only in genuine cases based on individual’s needs
and circumstances.
3
BIRLA INSTITUTE OF TECHNOLOGY AND SCIENCE, Pilani
Pilani Campus
AUGS/ AGSR Division

10. Note (if any):


Assignment(s) (programming/reading) will be given to the students. This will immensely help the students
in gaining a better understanding of the subject.

Instructor-in-charge
Course No. CS F469

You might also like