(Fhi) Chapter 3
(Fhi) Chapter 3
• List components of IR
• Describe types of IR
Information Retrieval(IR) and EBM
• Introduction to information retrieval
• Information is processed data like news or
facts about something
• These information can be represented in
the form of:
• Text, image, audio, video
• XML and structured documents
• Source codes
• Applications/web services
Information Retrieval(IR) and EBM…cont’d
• Introduction to information retrieval……
• Retrieval
• Fetching something that has been stored
• Main objective of IR
• Provide the users with effective access to and
interaction with information resources
• Goal of IR
• To search large document collection to retrieve
small subsets relevant to user’s information need
Information Retrieval(IR) and EBM…cont’d
• Introduction to information retrieval…..
• Purpose/role of an IR system
• An information retrieval system is designed to retrieve the
document or information required by the user community
• It should make the right information available to the right
user
• It aims at collecting and organizing information in one or
more subject areas in order to provide it to the user as soon
as possible.
• It serves as a bridge between the world of creators or
generators of information and the users of that information
Information Retrieval(IR) and EBM…cont’d
• Introduction to information retrieval…..
• Application areas within Information
retrieval:
• Cross language retrieval
• Speech/broadcast retrieval
• Text categorization
• Text summarization
• Structured document element retrieval (XML)
Information Retrieval(IR) and EBM…cont’d
• Introduction to information retrieval…..
• Information Retrieval vs information extraction
• Information retrieval
• given a set of terms and a set of document
terms select only the most relevant document
• Information extraction
• extract from the text what the document
means
Information Retrieval(IR) and EBM…cont’d
• Introduction to information retrieval…..
Deference between data Retrieval and information extraction
Parameters Database/data Information retrieval
retrieval
Example Database query www search
What we are Structured data Mostly unstructured
retrieving
Queries we are posing Formally define Expressed in natural
queries, unambiguous language
Matching Exact Partial, best match
Inferences Deduction Induction
Model Deterministic Probabilistic
Information Retrieval(IR) and EBM…cont’d
• Introduction to information retrieval…..
• Kinds of information retrieval systems
• In house information retrieval systems
• are set up by a particular library or information center
to serve mainly the users within the organization
• One particular type of database is the library catalogue
• Online information retrieval systems
• retrieve data from web sites, web pages and servers
that may include data bases, images, texts, tables, etc.
Information Retrieval(IR) and EBM…cont’d
• Introduction to information retrieval…..
• Features of an information retrieval system
• an effective information retrieval system must
have provisions for:
• Prompt dissemination of information
• Filtering of information
• The right amount of information at the right time
• Active switching of information
• Receiving information in an economical way
Information Retrieval(IR) and EBM…cont’d
• Introduction to information retrieval…..
• Features of an information retrieval system
• an effective information retrieval system must have
provisions for:
• Browsing
• Getting information in an economical way
• Current literature
• Access to other information system
• Interpersonal communications
• Personalized help
Information Retrieval(IR) and EBM…cont’d
• What is information retrieval (IR)?
• Very broad term
• is finding material (usually documents) of
unstructured nature (usually text) that satisfies
an information need from within large
collections (usually stored on computers)
EBM
Patient Best
values and external
expectation evidence
Information Retrieval(IR) and EBM…cont’d
• What is Evidence Based Medicine (EBM)?
• The three fundamental principles of EBM are:
A/ Optimal clinical decision making requires
awareness of the evidence
B/ EBM provides guidance to decide whether
evidence is more or less trustworthy
• How confident we be:
• of the properties of diagnostic tests
• of our patients prognoses or of the impact
of our therapeutic options
Information Retrieval(IR) and EBM…cont’d
C/ Evidence alone is never sufficient to make a
clinical decision
• Decision makers must always trade off:
• High heterogeneity
• Retrieval strategies
Types of information Retrieval
• Retrieval
• the two broad approaches to information
retrieval are:
• Exact-match searching
• allows the user precise control over the items
retrieved
• Partial-match searching,
• recognizes the inexact nature of both indexing and
retrieval
• instead attempts to return the user content ranked
by how close it comes to the user’s query
Types of Information Retrieval…cont’d
• Exact-Match Retrieval
• In exact-match searching, the IR system gives the
user all documents that exactly match the criteria
specified in the search statement(s).
• Uses Boolean searching - the Boolean operators
AND, OR, and NOT are usually required to create a
manageable set of documents.
• Most of the early operational IR systems in the
1950s through the 1970s used the exact-match
approach
Types of Information Retrieval…cont’d
• In modern times, exact-match searching tends to
be associated with retrieval from bibliographic
and annotated databases, while the partial-
match approach tends to be used with full-text
searching.
• Typically the first step in exact-match retrieval is
to select terms to build sets.
• use attributes, such as the author name,
publication type, or gene identifier (in the
secondary source identifier field of MEDLINE)
Types of Information Retrieval…cont’d
• Once the search term(s) and attribute(s) have been selected,
they are combined with the Boolean operators.
• The Boolean AND operator is typically used to narrow a
retrieval set to contain only documents with two or more
concepts.
• The Boolean OR operator is usually used when there is more
than one way to express a concept.
• The Boolean NOT operator is often employed as a
subtraction operator that must be applied to another set.
• Some systems more accurately call this the ANDNOT
operator.
Types of Information Retrieval…cont’d
• Some retrieval systems allow terms in searches to be
expanded by using the wild-card character, which adds
all words to the search that begin with the letters up
until the wild-card character. This approach is also called
truncation.
• Unfortunately, there is no standard approach to using
wild-card characters, so syntax for them varies from
system to system.
• PubMed, for example, allows a single asterisk at the end
of a word to signify a wild-card character. Thus, the
query word can* will lead to the words cancer and
Candid, among others, being added to the search
Types of Information Retrieval…cont’d
• Partial-Match Retrieval
• Although partial-match searching was
conceptualized very early, it did not see
widespread use in IR systems until the advent
of Web search engines in the 1990s.