0% found this document useful (0 votes)
5 views4 pages

Information Retrieval Systems and Web Search Engin

The paper surveys information retrieval systems (IRS) and web search engines, highlighting their architectures and challenges in effectively retrieving relevant information. It discusses the importance of indexing, query processing, and user guidelines for optimizing search results. The authors also compare various search engines, including Google, Bing, and DuckDuckGo, emphasizing their unique features and functionalities.

Uploaded by

adekarjayesh27
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
5 views4 pages

Information Retrieval Systems and Web Search Engin

The paper surveys information retrieval systems (IRS) and web search engines, highlighting their architectures and challenges in effectively retrieving relevant information. It discusses the importance of indexing, query processing, and user guidelines for optimizing search results. The authors also compare various search engines, including Google, Bing, and DuckDuckGo, emphasizing their unique features and functionalities.

Uploaded by

adekarjayesh27
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/319100659

Information Retrieval systems and Web Search Engines: A Survey

Conference Paper · January 2017


DOI: 10.22161/ijaers/nctet.2017.25

CITATIONS READS
2 4,538

3 authors, including:

Jabbar Akhil
Vardhaman College of Engineering
97 PUBLICATIONS 2,812 CITATIONS

SEE PROFILE

All content following this page was uploaded by Jabbar Akhil on 30 October 2017.

The user has requested enhancement of the downloaded file.


Trends in Engineering and Technology (NCTET-2K17)
International Journal of Advanced Engineering Research and Science (IJAERS) Special Issue-3
https://dx.doi.org/10.22161/ijaers/nctet.2017.25 ISSN: 2349-6495(P) | 2456-1908(O)

Information Retrieval systems and Web Search


Engines: A Survey
R. Arun Kumar, M. A. Jabbar, Y.V. Bhaskar Reddy
1
Department of CSE, Vardhaman College of engineering, Hyderabad-501218
Email: [email protected]
1
Department of CSE, Vardhaman College of engineering, Hyderabad-501218
Email: [email protected]
1
Department of CSE, Vardhaman College of engineering, Hyderabad501218
Email: [email protected]
Abstract: crawl the web and fetch web pages. The words
Information retrieval systems (IRS) are field used in these web pages are added to the index
concerned with retrieval of information. A search along with where the words came from.
engine is the application of IR techniques. A web
search engine is a tool to find information on the Web search engines, must be able to crawl, and
www. Search engines are updating their index to should provide respond to millions of queries .To
the World Wide Web. In this paper we review engineer a search engine is a challenging task.
search engines and their architectures in brief and Web Search engines index millions of web pages.
suggest some guidelines for the users. The World Wide Web Worm (WWWW) had an
index of 110,000 web pages [2]. With the
1. Introduction: increasing number of users on the web, and due to
automated systems which query search engines,
An information retrieval system (IRS) is a field top search engines are handling hundreds of
concerned with retrieval of information. millions of queries per day.
Information retrieval includes a wide range of Today, most of the search engines are based in
information. Applications of information retrieval the U.S. The users search documents by
involve multimedia documents with structure, keywords. However, there are other search
significant text content, and other media. engines in other languages such as Chinese,
Dimensions of IR are listed in table 1. Korean, and Japanese.
Table 1. Some dimensions of information
retrieval [1] Problems faced by users when facing search
Examples of Examples of Examples of engines.
Content Applications Tasks
Text Web search Adhoc search
1. The users generally do not know how to
Images Vertical search filtering
Video Enterprise search classification Search.
Scanned Question 2. The user cannot perform advanced searching
Desktop search
documents answering 4. Many users only look at the first page [3]
Audio Peer to peer search music Guidelines helping users to search.
1. Clearly specify the words
Relevance is an important concept in information 2. Provide as many particular terms as possible
retrieval. Search engines are constantly building 3. Some search engines are specialized in some
and updating their index to the World Wide Web. areas.
Spiders are used to 2. Search Engine Architectures

www.ijaers.com Page |123


Trends in Engineering and Technology (NCTET-2K17)
International Journal of Advanced Engineering Research and Science (IJAERS) Special Issue-3
https://dx.doi.org/10.22161/ijaers/nctet.2017.25 ISSN: 2349-6495(P) | 2456-1908(O)
This section discusses architecture of search Table 2: Search engines
engines.
Architecture is designed to satisfy the goals. Search
Description
Goals of a search engine Engine
• Effectiveness: retrieve the relevant set of Google is the most popular search
documents possible for a query. Google
engine globally.
• Efficiency: process queries from users [4] Launched by Microsoft.
Bing
Also delivers Yahoo’s results.
Actions performed by the search engine It was originally known as Ask
Ask
Jeeves.
1. looks for the keyword in the index It was launched by Digital
2. Web crawler search for the information. AltaVista Equipment Corporation, it is
3. search engine shows the relevant web powered by Yahoo
pages after web crawler finds the pages AOL.Search It is powered by Google.

Major functions supported by search engines 2.1. AltaVista Architecture

1) Indexing process 2) Query process. The crawler sends requests to remote Web
The indexing builds the structures, and the query servers. The index is used to reply queries from
process produce documents in ranked order users. Figure 3 shows software architecture of
AltaVista. Query engine and user interface are in
first part. The second part contains the indexer
and crawler [5].

Figure 1.Index process

Figure 3: Architecture of AltaVista search engine.


2.2. Google

The word Google comes from the word googol,


which means 10100. 64.0% of searches were
powered by Google. [6].
Figure 2 query process
Google is written in C/C++.
Figure 3 shows architecture of Google..
Following are the several search engines available
today:

www.ijaers.com Page |124


Trends in Engineering and Technology (NCTET-2K17)
International Journal of Advanced Engineering Research and Science (IJAERS) Special Issue-3
https://dx.doi.org/10.22161/ijaers/nctet.2017.25 ISSN: 2349-6495(P) | 2456-1908(O)
is a big challenge. In this research paper we found
that the users on average use two to three
keywords query for search and there is
vocabulary gap between user query and keywords
used in the document.

References

[1] W. Bruce Croft et.al Search Engines


Information Retrieval in Practice, Pearson
education 2015

[2] McBryan 94 Oliver A. McBryan. GENVL and


WWWW: Tools for Taming the Web. First
International Conference on the World Wide
Web. CERN, Geneva (Switzerland), May 25-26-
2.3. Bing 27 1994.
http://www.cs.colorado.edu/home/mcbryan/mypa
Bing is Microsoft’s attempt to challenge Google pers/www94.ps
in the area of search .
[3]Sunny lam,” The Overview of Web Search
2.4. Yahoo Engines”, February 2001

Yahoo search is powered by Bing. Yahoo is still [4] J Pei, “Information retrieval and web search
the most popular email provider. architecture”, Lecture slides, 2017

2.5. Ask.com [5] R. Baeza-Yates and B. Ribeiro-Neto.


Modern Information Retrieval. Addison Wesley,
Formerly known as Ask Jeeves. ASK is based on New York, NY, USA, 1999
a question/answer format. It lack quality [6] Comscore report, February 2016
compared to Google,Bing and Yahoo.

2.6. DuckDuckGo

Have a number of advantages over the other


search engines. 1) It has a clean interface 2) it
does not track users 3) it is not fully loaded with
ads

3. Conclusion

Information on the web is diverse in content and


catering to the different information need of users

www.ijaers.com Page |125

View publication stats

You might also like