0% found this document useful (0 votes)
18 views9 pages

Information Retrieval: by Akanksha Singh M.Tech CS

The document discusses information retrieval and provides an overview of key concepts like structured vs unstructured data, keywords, repositories of documents, and the goal of information retrieval which is to find relevant documents from a large set. It uses examples like Google to illustrate main points.

Uploaded by

Akanksha Singh
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
18 views9 pages

Information Retrieval: by Akanksha Singh M.Tech CS

The document discusses information retrieval and provides an overview of key concepts like structured vs unstructured data, keywords, repositories of documents, and the goal of information retrieval which is to find relevant documents from a large set. It uses examples like Google to illustrate main points.

Uploaded by

Akanksha Singh
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 9

INFORMATION

RETRIEVAL
By Akanksha Singh
M.Tech CS
INTRODUCTION

 “Information retrieval (IR) is a technique of finding material (such as


documents) of an unstructured nature (usually text) that satisfies an
information need from large collection of data (usually stored on computers
or internet).”
WHY IR?

 Web sites increasing sharply


 Internet users increasing continuously
 Current web (1 billion users more than 1000 billion pages)
 Google
3 billion documents indexed
10-20 TB of text on web
Billion TB of information produced every year
KEYWORDS IN IR

 A large repository of documents are stored on computers(Corpus).


 There is topic about which I desire to get information (information need).
 Some of the documents may contain the information that satisfies my
need(relevance).
 How do I retrieve these documents?
 I communicate my information need in the form of a query.
STRUCTURED DATA

 Structured data allows for expressive queries like:


 Give me the social security number of all the employees who have stayed
with company for mare than 5 years, and whose yearly salaries are three
standard deviations above average salary.

Employees Manager Salary

Smith Hari 5000000

Honi Ravee 80000

Jones Ree 100000


UNSTRUCTURED DATA

 Unstructured data does not have clear, overt semantic structure (e.g, free
text on a web page, audio, video).
 Allow less expressive queries
 Give me all documents that have keywords “These Romans are crazy”.

Structured data Database Systems

Unstructured data Information retrieval


THE GOAL OF IR

 Goal : Find documents relevant to an information need from a large document


set
EXAMPLE

GOOGLE

WEB

You might also like