Group 5 Assignment for IR
Group 5 Assignment for IR
Group 5 Assignment for IR
3. Desta Nigusse------------------------------------117/11
4. Tefera Misganaw--------------------------------325/11
5. Lemlem Kebede---------------------------------215/10
Sublimissin date:- 01/11/21G.c
Submitted to:-
2 Introductions
There is huge amount of data available on Internet and it is growing exponentially. This
unconstrained information-growth has not been accompanied by an analogous expansion of
approaches for extracting relevant information. Often, a web-search does not yield relevant
results. There are multiple reasons for this. First, the keywords submitted by the user can be
related to multiple topics; the search results are not focused on the topic of interest. Second, the
query can be too short to express appropriately what the user is looking for. This can happen
simply as a matter of habit (average size of a web search is 2.4 words1. Third, the user is often
not sure about what he is looking for until he sees the results. Fourth, even if the user knows
what he is searching for, he does not know how to formulate the appropriate query .
Query expansion reformulates user’s original query to enhance the information retrieval
effectiveness. Let a user query consist of n terms, Q = {t1, t2, ..., ti , ti+1, ..., TN}. The 1query can
have two components: addition of new terms T 0={t 0 1 , t 0 2 , ..., t 0 m} from the data source(s)
content-based filtering
A content discovery platform is an
implemented software recommendation platform which uses recommender
system tools. It utilizes user metadata in order to discover and recommend
appropriate content and
collaborative filtering
Collaborative filtering (CF) is a technique used by recommender systems.[1] Collaborative
filtering has two senses, a narrow one and a more general one.[2]
In the newer, narrower sense, collaborative filtering is a method of making
automatic predictions (filtering) about the interests of a user by collecting preferences
or taste information from many users (collaborating)
.
Plagiarism detection
Event search
Text classification
Patent retrieval
In dynamic process in IoT
Classification of e-commerce and etc….
One-to-One Association. Such as WordNet to find synonyms and similar terms for the
query terms.
One-to-Many Association. Correlates one query term to many expanded query terms.
Feature Distribution of Top Ranked Documents. Deals with the top retrieved documents
from the initial query and considers the top weighted terms from these documents.
1 Global analysis
In global analysis, query expansion techniques implicitly select expansion terms from hand-built
knowledge resources or from large corpora for expanding/reformulating the initial query
2 Local analysis
Local analysis includes query expansion techniques that select expansion terms from documents
collection retrieved in response to the user’s initial (unmodified) query.
9 References
https://en.wikipedia.org/wiki/Query_expansion
https://queryunderstanding.com/query-expansion-2d68d47cf9c8
https://nlp.stanford.edu/IR-book/html/htmledition/query-expansion-1.html
Abdulla, A.A.A., Lin, H., Xu, B., Banbhrani, S.K.: Improving biomedical information retrieval by linear
combinations of different query expansion techniques. BMC bioinformatics 17(7), 238 (2016)