Search Engine Comparison

Uploaded by

sandi Muhammad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

112 views7 pages

Search Engine Comparison

Uploaded by

sandi Muhammad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

For Advanced Database Systems, CS 2131-8070580-sec1, December 2013

A COMPARISON OF SEARCH ENGINE's

FEATUREs and MECHANIZMs
Amjad J. Khalil Fadi K. Abu Alrub
Al-Quds University Al-Quds University
Computer Science Department Computer Science Department
[email protected] [email protected]

Abstract web page, a process of matching a query with the

Search engines are increasing dramatically for both information stored on web crawler, retrieves it and
personal and professional use. It has become gives a list of results. Search engines are programs
that search documents for specified keywords and
necessary for users to understand the differences return a list of the documents where the keywords
between the search engines in order to achieve the were found [[1]].Over 80% of Web searchers use
highest satisfaction .And therefore it is important Web search engines to locate online information
to evaluate and compare search engines in the services (Nielsen Media).
pursuit of one search engine that will satisfy all the As not all results obtained by search engine for
needs of the user. some query is the result that user are looking for,
and as anyone can be an author on the web, some
The aim of this paper is to provide information
confusing results are fare from what user need due
about the work and the results of the search to bogus pages that filled of fake keywords, this
engines. This paper represents the various led us to spamming phenomena, the term web
evaluation methodologies to estimate the spam refers to hyperlinked pages on the World
capabilities of search engines. This would help Wide Web that are created with the intention of
users to assess and determine the appropriate misleading search engines[7], other spammers
search engine to search depends on their own create link farms to increase popularity or add
reputation “page rank” of a page for a beneficial
needs.
goals, there is seriously threat by malicious web
Keywords: Search Engine, comparison, Page Rank, WebCrawler, spam that try to weaken the neutral searching and
features, indexing. ranking facilities provided by the engines.
In this paper, we will make an experiment to check
1. Introduction for any significant differences for the use of
Boolean operators and features in result pages
People shares information over the worldwide viewed, the large scale search engines we chose for
web, thus, web-based information creates a large our experiments are Google, Yahoo, Bing, Ask,
global database store. User needs to find what he is DuckDuckGo, and we will talk about how search
looking for easy and fast; this is why many search engine works and take a look on the architecture of
engines are available today. Search Engines two major web search engines Google and Yahoo
essentially act as filters for the wealth of and also we will see some statistical that supports
information available on the Internet, they allow our experimental results.
users to quickly and easily find information that is
of genuine interest or value to them[5].While Web
2. Research Questions
contents include text, and multimedia information
which includes images, videos, sounds, and  Why there is various search engines?
graphics with a lot of format kinds, the key for  What is most features average user look for?
search process is by accessing a friendly user  What are the reasons behind singularity and
interface with adequate features, user write a Distinction of some search engine?
textual queries in a search box in a search engine  Do search engine offer the same results?
 Where is the future of search engine going on?
3. Related Works 4. Methodology

There is an absolute agreement in most articles for Five experiments about different kinds of
the importance of search engines for web mining information is executed, we picked up five target
due to the huge grow of data among the worldwide pages, those pages contain different kinds of
web. A variety of search engines which offer information and data files, text, audio, photo,
diversified services to its users [10], A lot of article, and a person information page, for each
Comparative Studies were done to clarify and target page we collect five keywords, then we
explain the preference of one search engine among apply two queries, simple and complex one for
others, taking into account the tremendousfeatures each target page in the five search engines of
that search engine companies compete to create comparison,one examination done to check the
and enhance so as it kept them a head start, addition of Boolean logics[conjunction AND,
popularity, usability, website quality of search disjunction OR, inurl, intitle, …] with the queries
engines are goals for cause of excellence. text, here are a table that shows the target pages
More studies focused on the mechanism of how and keywords selected for each query:
search engine works, as in common all have the
main parts of Web crawling or Spidering, Web Experiment1
Target Page1:http://en.wikipedia.org/wiki/Troy
Indexing, and searching. Here we are looking for text information about Troy
Keywords selected: [1.Greek2.Homeric 3.war 4.ancient 5.troy]
First query :[troy]
Crawl the second query :[troy Greek Homeric war ancient]
The WEB WEB Document
IDs Experiment2
TargetPage2:http://www.englishbaby.com/findfriends/view_ph
Rank oto/683675?_sp_album=19680
Results Create Here we are looking for aphoto file of Yassir Arafat And Ahmad Yaseen
inverted Keywords: [1.yassir arafat 2.ahmed yaseen 3.photo 4.Palestine 5.shymaa]
index First query :[yassirarafatahmedyaseen]
Search
second query:[yassir Arafat ahmed yaseen photo Palestine shymaa]
Engine
Servers
Experiment3
TargetPage3:http://www.mp3quran.net/maher.html
Inverted Here we are looking for mp3 file of Quran chapter 96, search in Arabic language
User
Keywords: [1. ‫المعيقلي‬2. ‫العلق‬3. ‫قران‬4. mp3 5.‫]تحميل‬
Query Index First query :[‫]المعيقلي العلق‬
second query :[‫ قران العلق المعيقلي‬mp3 ‫]تحميل‬
Figure1: Simple Mechanism Representation of How Search Engine Works.
Experiment4
TargetPage4:http://www.alquds.edu/en/faculties/faculty-of-
Web mining is made of three branches i.e. web science-technology/department-of-computer-science-it/135-
content mining (WCM), web structure mining staff/9790-rashid-jayousi.html
Here we are looking for person information.
(WSM) and web usage mining (WUM)[1], WCM Keywords: [1. Rashid Jayousi2. Dr3. Alquds University 4. Staff 5.Palestine]
exploring the proper and relevant information from First query :[Rashid Jayousi Staff]
second query :[ Rashid Jayousi Dr Alquds University Staff]
the contents of web. WSM find out the relation
between different web pages by processing the Experiment5
TargetPage5:ijcer.org/index.php/ojs/article/download/110/37
structure of web. WUM recording the user profile Here we are looking for a pdf file format that contains scientific research.
and user behavior inside the log file of the web, Keywords: [1. ijcer 2. Vijaya Kumar 3. search engine 4. pdf 5.2013]
First query : [ijcersearch engineVijaya Kumar pdf 2013]
and with information scaling over the web, search second query : Using Boolean Operators
engines must Accommodates this scaling, this is
why large scale search engines architecture Table1: Five Experiments Done to Compare Search Engine Features.
includes lots of distributed servers, and many
papers discusses how to improve the Results obtained is analyzed and compared
performance of web search engines regarding regarding several attributes and features as
the user interface and query input, or towards clarified in table2 of results.
filtering the output results, and improvement
in solving algorithms in web page spying and
collecting, indexing, and output[15].
5. Experiments for Search Engines: Google, Yahoo, Bing, Ask, DuckDuckGo.
Search Engine Google Yahoo Bing Ask DuckDuckGow
URL www.google.com www.yahoo.com www.bing.com www.ask.com ww.duckduckgo.com
Boolean Operators AND, OR, - , intitle:, inurl:intext: AND, OR, (), “ ”, NOT, Domain: AND, OR, - , intitle:, site: , “ ” OR, - , + , site: AND, OR, -, “”, intitle:, inbody:,
site:

Experiment1 About 45,400,000 results (0.40 s) 87,500,000 results 47,700,000 RESULTS Target webpage is 3rd
Target webpage is 1st choice
Target webpage is 2nd choice Target webpage is 2nd choice Target webpage is 6th choice choice
Q1: [troy]
Q2: [troy Greek Homeric About 1,650,000 results (0.27 s)
Target webpage is 1st choice
1,930,000 results
Target webpage is 1st choice
3,010,000 RESULTS
Target webpage is 1st choice
Target webpage is 1st
choice
Target webpage is 1st choice
war ancient]

Experiment2 About 42,700 results (0.16 s)Target 34,200 results

32,200 RESULTS Target webpage is 2nd choice
Target webpage is 6th choice There is Image Category No Image Category but there is
webpage is 11th choice Target webpage is 1st choice
Q1: [yassir Arafat ahmed In Image category, 1st choice In Image category, no results
In Image category not found in Not in the first 10 results category options that take you
the first 20 results to other search engines
yaseen] photo
4,210 RESULTS
Q2:[yassirarafatahmedyas About 7,040 results (0.41 s)
Target webpage is 1st choice
3,900 results
Target webpage is 1st choice
Target webpage is 1th choice There is Image Category Target webpage is 1st choice
eenphoto Palestine shymaa] In Image category, 1st choice also In Image category, no results
In Image category not there 6 Not in the first 10 results
results

Experiment3 About 90,100 results (0.26 s) 3,600 results

Target webpage is 8st choice
134,000 RESULTS
Target webpage is 9th choice
Target webpage is 8st
choice
First choice is video results
Target webpage is 7st choice
Target webpage is 16th choice
Q1: [‫ ]المعيقلي العلق‬mp3 file No Audio category
No Audio category No Audio category No Audio category No Audio category
About 251,000 results (0.16 s)
Q2: [‫ قران العلق المعيقلي‬mp3 Target webpage is 3rd choice 200,000 results 162,000 RESULTS
Not in the first 10 results Target webpage is 2nd choice
‫ ]تحميل‬mp3 file we can specify your search language Target webpage is 2nd choice Target webpage is 1st choice
in Google Preferences
Target webpage is not in
Experiment4 3,580 results 2,050 RESULTS
the first 20 results, but in
About 375 results (0.36 seconds) the Helpful Resources Target webpage is 1st choice
Q1: [Rashid Jayousi Staff] Target webpage is 1st choice
Target webpage is 1st choice Target webpage is 1st choice
option on Ask it gives a
link of target webpage. 1st
Target webpage is not in
Q2: [Rashid Jayousi Dr About 1,650 results (0.32 seconds) 31,400 results 31,100 RESULTS
the first 20 results, but in
Alquds University Staff] the Helpful Resources Target webpage is 1st choice
Target webpage is 1st choice Target webpage is 1st choice Target webpage is 1st choice
option on Ask it gives a
link of target webpage. 2nd

Experiment5 About 104 results (0.40 s)

30,500 results 21,800 RESULTS
Not in the first 20 results
Target web link contain the file is Target web link contain the file is Not in the first 20 resultsand
Q1: [ijcer search Target web link contain the file is not in the first 20 results, and first not in the first 20 results, but first
and first results are a web
links that are not close to
first results are a web links that
not in results, firsts results are a web results are a web links that are not 2 results are a web link very are not close to target.
engineVijaya Kumar pdf target.
links very close to target. close to target. close to target.
2013]
2 RESULTS
Q2: 8 results (0.24 seconds)
Target web link of pdf file is the 1st
23 results
Target webpage is in 1stchoice but
2 RESULTS
Target webpage is in 1stchoice
Target web link of pdf file Target webpage is in 1stchoice
UsingBoolean Operators is the 3rd choice but not a direct link.
choice not a direct link. but not a direct link.
Q2:[pdfinurl:ijcerAND Q2:[ijcer+ "vijayakumar" + Q2:[ijcer "vijayakumar"
Q2:[ ijcer ("vijayakumar" AND Q2:[ijcer ("vijayakumar" AND
"search engine" site:org] site:org AND inbody:WEB
intitle:search engine] "search engine" ) domain:org] "search engine" ) site:org]
SEARCH]

Table2: Table of Results For Five Experiments of Searching Different Query’s Over the World Wide Web.

Analyzing and evaluation of results, by taking into account total number of results and speed for each query,
the choice of target page in results, a mark of 10 is given for excellent result, and a mark of 7 is given for
average result, and a mark of 4 is given for poor result, a mark of 60 is given for the availability of
categorizations and search tool, and a mark of 40 is given for the effectiveness of Boolean operators in
(experiment5 Q2). Then the total marks for one search engine is a ratio measure from 300 (Optimal SE).

Evaluation For Google Yahoo Bing Ask DuckDuckGo

9+9+10+10+10+10 10+9+10+10+9+9 9+10+10+10+9+9 No statistics for No statistics for total
Total Results +9+9+10+10= 96 +10+10+10+10= 97 +10+10+10+7= 94 total results 70 results 70
10+10+9+10+9+10 10+10+10+10+9+10 9+10+9+10+9+10 10+10+4+4+7+4 10+10+10+10+9+10
Target page choice +10+10+10+10= 98 +10+10+4+8= 91 +10+10+7+7= 91 +7+7+4+10= 67 +10+10+4+8= 91
Categorizations
and search tools
60 56 54 55 54
Effectiveness of
Boolean operators
40 38 37 38 37
96+98+60+40=294 97+91+56+38=282 96+91+54+37=276 70+67+55+38=230 96+98+60+40=252
Total Result 294/300=98% 282/300=94% 276/300=92% 230/300=77% 252/300=84%
Table3: Table of Evaluation and Total Results.
6. Methods and Structure of search •Re- visit policy that states when to check for the
engine and how does it work. presence of changes to pages,
•Politeness policy that states how to avoid
overloading Web sites, and
•Parallelization policy that states how to
There are differences in the ways various search
engines work, but they all perform the following coordinate distributed Web Crawlers.
activities:
6.2 Indexing
After the page is crawled, search engines parse the
document to generate an index that points to the
corresponding result. Those indexed pages are
stored in a huge database which can be retrieved
later. Indexing is a process that identifies the words
and phrases that best describe the page and
assigning the page to particular keywords.
The purpose of storing an index is to improve the
Figure 2: Characteristics of Search Engines.
speed and performance to find the relevant
6.1 Web Crawling documents to the search query. Without an index,
The first step for search engines is to browse the the search engine will scan each document in the
world wide web in automated manner and see what body, which requires a long time and computing
is there based on important words and this is done power.
by a piece of programs, called a web crawling or
spidering. Crawler follow links through each 6.3 Searching and Processing
visited page and give an index for everything they When a search request done by a user to look for
face. The web crawlers used mainly to create a words found in that index, the search engine
copy of all the visited pages for later processing by process it by compare the search string comes from
a search engine that will index the downloaded the request with the indexed pages in the huge
pages to provide fast searches. In general, it starts database. Sense there are millions of pages which
with a list of URLs to visit, called the seeds. As the may match the search string, the search engine
crawler visits these URLs, it identifies all the calculate the relevancy of each of page in its index
hyperlinks in the page and adds them to the list of with the search string. There are different
URLs to visit, called the crawl frontier. URLs from algorithms to calculate the relevancy. Each of
the frontier are recursively visited according to a these algorithms has different relative weights for
set of policies. And because of the large number of common factors like keyword density, links, or
the page on the web (more than 20 billion), web metatags.
crawler cannot visit all these pages daily to check
if there are new pages appeared or an existing 6.4 Result matching
pages are modified . Web crawlers are an essential A matching method used by the search engine to
part of the search engines, and details about the match the user's query with similar web pages in
architecture and algorithms as kept as secrets. the database. There are many different techniques
used by matching the various search to visualize
The behavior of a web crawler is defined by a relevant results strongly. However, there can be
combination of the following policies [10], [11]: challenges during the matching results. Some of
•Selection policy which states which pages that these are as shown below [10]:
are downloaded to a database,
• Parsing: Parsing algorithms may pose Instead, but we are search on the cache of the
difficulties if they encounter complex Hyper Text web or database that contains information
Markup Language (HTML) used in some of the about all the Web sites visited by the search
engine "s spider or crawler.
web pages. Such difficulties create instances where
 The size which means how many Web pages
there may extract some useful results for display to
had the spider visited, and stored in the
the user. database? Some of the largest search engines
• Filtering: A search engine needs to perform have databases that cover more than three
effective filtering of URLs in order to show the billion Web pages, while others have smaller
most relevant to the searchers it’s really important cover half a billion or less.
to show the results of a unique user by minimizing  How up to date the database is. We know that
the chances of repetition. the Internet is constantly changing and
growing. New Websites appear, and old sites
disappear, and modify the content of existing
6.5 Result ranking: sites. Thus, the information stored in the
It defines the order in which search results are database will become out of date unless the
displayed for the user there could be thousands of search engine "s spider keep up with these
results that can be shown to the user, but the results changes.
appear in order of importance need to be taken care  The ranking algorithm used by the Search
of. Search engines follow the sorting algorithm to Engine determines whether the most relevant
search results appear or not on the top of
rank the results. This algorithm is based on two
results list.
factors:
• Location: It is important for the search engine to Yahoo Algorithm is not far from Google
search for keywords search at the top of the Web Algorithm but different at some points, Yahoo
page. For example: Searching for keyword search gives much interests in taking its web directory as
in the title of the Web page. part of its Ranking Algorithm.
• Frequency: The algorithm looks for the
7.1 Google: An Overview
frequency of keywords repeated in the context of
Google Company was founded by Larry Page and
the search results. Frequency of search keywords is
Sergey Brin while studying PHD at Stanford
not considered to be an ideal factor as it gets University in 1998 and was officially launched in
biased to content-rich pages. the fall of 1999 [[5]].This is a straightforward
engine that does not support advanced search
6.6 Retrieving syntax making it very easy to use and retrieves
Retrieving the results is simply displaying them in pages ranked on the basis of number of sites
the browser and sorted them from the most linking to them and how often they are visited,
relevant to the least relevant sites. indicating their popularity (ibid). It claims that
97% of the users find what they are looking for.
Google's brand has become so universally
7. Search Engine's Architecture : recognizable that now days; people use it like a
Google, Yahoo verb. For example, if someone asks you any
There are a number of search engines available on question you don't know. The answer is “Ask
the Internet. The most popular search engines are Google OR Google it”.
Google and Yahoo.

There are some measurements that differ between

available search engines [16]: Features
 The contents of the database and the success to Google includes the following most important
find the desired result. Because when we do features:
search, we are not search at the Web itself.  Cached page archives
 Result clustered by indention Structure
 Result displayed option, from 10-100  Yahoo is hierarchically organized with
“Google Search” Supports: subject catalogue or directory of the web
 Implied Boolean (+)sign, (-) sign which is browsable and searchable.
 Double quotes (“”) for phrases Stopwords.  Yahoo indexes web pages, UseNet and e-
mail address.

Features
 Topic and region specific “yahoos!”
 Automatic truncation
 No case sensitivity and stop words the
syntax that yahoo follows for searching is
fairly standard among all search engines.

Figure 3: Google Architecture.

The component:
• Crawler: There are several distributed crawlers, they parse the pages and
extract links and keywords
• URL Server: Provides to crawlers a list of URLs to scan.
• Server Store: The crawlers sends collected data to a store serve. It compresses
the pages and places them in the repository. Each page is stored with an
identifier, a docID
• Repository: Contains a copy of the pages and images, allowing comparisons Figure 4: Yahoo Architecture.
and caching.
• Indexer: It indexes pages .It decompresses documents and converts them into The components:
sets of words called "hits". It distributes hits among a set of "barrels". This Data Acquisition -- Web Crawling
provides an index partially sorted. It also creates a list of URLs on each page. A • follow hyperlinks to download pages
hit contains the following information: the word, its position in the document, • spam detection
font size, capitalization. • (near) duplicate detection
• Barrels: These "barrels" are databases that classify documents by docID. They • Link analysis -- e.g., Pagerank
are created by the indexer and used by the sorter • prepares input for crawling and query processing
• Anchors: The bank of anchors created by the indexer contains internal Index Construction and Updates
links and text associated with each link • build inverted index structure in bulk, similar to mining but updates trickier
• URL Resolver: It takes the contents of anchors, converts relative URLs into Query Processing
absolute addresses and finds or creates a docID.It builds an index of documents Boolean queries:
and a database of links. • compute unions/intersections of lists
• Doc Index: Contains the text relative to each URL. Ranked queries:
• Links: The database of links associates each one with a docID (and so to a real • give scores to all docs in union
document on the Web).
• Page Rank: The software uses the database of links to define the PageRank of
each page. 8. STATISTICAL OVERVIEW
• Sorter: It interacts with barrels. It includes documents classified by docID and
creates an inverted list sorted by worded. Figure 5 shows that Google is still the king of
• Lexicon: A software called DumpLexicon takes the list provided by the sorter
(classified by wordID), and also includes the lexicon created by the indexer (the search traffic, accounting for 70.38% of all search
sets of keywords in each page), and produces a new lexicon to the searcher.
• Searcher: It runs on a web server in a datacenter, uses the lexicon built by traffic in October 2013. Bing and Yahoo! follow
DumpLexicon in combination with the index classified by wordID, taking into further behind with 10.63% and 7.31%
account the PageRank, and produces a results page.
respectively, while Ask is at 3.44% and AOL
7.2 Yahoo: An Overview Search is at 1.24% [[3]].
Yahoo is the oldest and also the largest directory
on the Internet began in mid-1994. This is one of
the most frequently accessed tools, and despite the
fact that most people consider it as a search engine,
and it is classified as a directory.
Study of Google and Bing Search Engines in Context of
Precision andRelative Recall Parameter”, Vol. 4 No. 01 January
2012
[2] NeelamTyagi, Simple Sharma, “Weighted Page Rank Algorithm
Based on Number of Visits of Links of Web Page”, International
Journal of Soft Computing and Engineering (IJSCE), Volume-2,
Issue-3, July 2012.
[3] Krishan Kant Lavania, Sapna Jain, Madhur Kumar Gupta, and
Nicy Sharma, “Google: A Case Study (Web Searching and
Crawling)”, International Journal of Computer Theory and
Engineering, Vol. 5, No. 2, April 2013.
[4] Dilip Kumar Sharma et al, “A Comparative Analysis of Web
Page Ranking Algorithms”, (IJCSE) International Journal on
Computer Science and Engineering Vol. 02, No. 08, 2010.
Figure 5: Search Engine Usage For October 2013.
[5] P.N. Vijaya Kumar, “A PRACTICAL APPROACH TO
WORKING OF WEB SEARCH ENGINE”, et al International
Journal of Computer and Electronics Research [Volume 2, Issue
9. SUMMARY & CONCLUSIONS 1, February 2013]
 On average a user need to find the target page [6] B. BarlaCambazoglu, Flavio P. Junqueira, VassilisPlachouras,
“A Refreshing Perspective of Search Engine Caching”,
while he is searching with less keyword and simple International World Wide Web Conference Committee (IW3C2)
query. April 26–30, 2010
[7] Zolt´anGy¨ongyi Hector Garcia-Molina Jan Pedersen,
 A simple user interfaces that enables users to use “Combating Web Spam with TrustRank”, 30th VLDB
search tools or categorization feature sufficiently Conference
makes the effectiveness of search engine. Toronto, Canada, 2004
ZivBarYossef, “Efficient Search Engine Measurements”, the
 As seen from analysis and results, there are [8]
International World Wide Web Conference Committee
advantages for some search engine (Google, (IW3C2), May2007]
Yahoo, Bing )over others because of their accuracy [9] Bernard J. Jansen, Amanda Spink, “How are we searching the
World Wide Web? A comparison of nine search engine
in hitting result that user is looking for and appears transaction logs”, Information Processing and Management,
in the first page, This is due to exhaustive analysis science directs42 (2006)
of algorithms for several process of servers for [10] Inderjeet Singh Oberoi, Mridul Chopra, “Web Search Engines
–A Comparative Study”, Mälardalen University 2010
crawling and ranking and searching.
[11] P.N. Vijaya Kumar, et al. International Journal of Computer
 More Advantages because of user friendly and Electronics Research [Volume 2, Issue 1, February 2013],
interface and features and tools well introduce in ," A PRACTICAL APPROACH TO WORKING OF WEB
SEARCH ENGINE"
the interface. [12] RasmitaMohanty, K S Chudamani. International CALIBER-
 Advanced user look for more advanced tools and 2008," A Comparative Study of Google and Yahoo Web
adequate featuresthat are available in some search Resources on theSearch term “Physics India”
[13] Dr. KhannaSamratVivekanandOmprakash. International
engines but with different forms or techniques to Journal of Advanced Engineering Research and Studies E-
perform. ISSN2249–8974," CONCEPT OF SEARCH ENGINE
 Boolean Operators make difference when we are OPTIMIZATION IN WEB SEARCH ENGINE"
[14] Inderjeet Singh Oberoi,Mridul Chopra." Web Search Engines
searching a unique target link, as we see in - A Comparative Study"
experiment 5, by using Boolean operators we can [15] Monica Peshave “HOW SEARCH ENGINES WORK AND A
have unique target at the first results. WEB CRAWLER APPLICATION”, University of Illinois at
 For more accurate results we need to examine each [16]
Springfield, IL 62703
Krishan Kant Lavania et al. International Journal of Computer
search engine for more queries experiments and we Theory and Engineering, Vol. 5, No. 2, April 2013." Google: A
need to check more the category feature and the Case Study (Web Searching and Crawling)"
[17] Sergey Brin and Lawrence Page. Computer Science
search tools in details. Department,Stanford University, Stanford, CA 94305, US."
 The basic principle of operations for all search The Anatomy of a Large-Scale Hypertextual Web Search
Engine"
engines are the same which are crawling ,
indexing and ranking with some differences which [[1]] http://www.webopedia.com/TERM/S/search_engine.html
may lead to significant changes in the results
[[2]] http://www.rba.co.uk/search/compare.shtml
accuracy.
[[3]] http://www.thesearchguru.com/search-stats.asp
10. REFERENCES
[[4]] http://searchenginewatch.com/article/2067276/Searches-Per-
Day
[1] Tauqeer Ahmad Usmani et al. International Journal on [[5]] http://www.google.com/about/company/
Computer Science and Engineering (IJCSE), “A Comparative

UNIT 3 Notes
No ratings yet
UNIT 3 Notes
32 pages
Seminar Report
No ratings yet
Seminar Report
34 pages
Assignment 2 (SEO)
100% (1)
Assignment 2 (SEO)
11 pages
Comparisions Among Search Engines
No ratings yet
Comparisions Among Search Engines
10 pages
Iseki TA525F, TA530F Operation and Maintenanc Manual
100% (1)
Iseki TA525F, TA530F Operation and Maintenanc Manual
51 pages
Seach Engine
50% (2)
Seach Engine
18 pages
Beginner Guide To SEO
100% (3)
Beginner Guide To SEO
157 pages
Keyword Research For SEO - The Definitive Guide (2020 Update) Brian Dean
100% (1)
Keyword Research For SEO - The Definitive Guide (2020 Update) Brian Dean
101 pages
AI Tools
100% (8)
AI Tools
23 pages
SEO Fundamentals & Training - S3 v2
No ratings yet
SEO Fundamentals & Training - S3 v2
34 pages
BY Skag: 1000$ Week On Autopilot
50% (2)
BY Skag: 1000$ Week On Autopilot
8 pages
914 PDF
No ratings yet
914 PDF
213 pages
SEO Steps:: Organic On-Page Off-Page Paid
100% (2)
SEO Steps:: Organic On-Page Off-Page Paid
6 pages
Seminar On Search Engine Optimization
No ratings yet
Seminar On Search Engine Optimization
20 pages
IR Module 3
No ratings yet
IR Module 3
45 pages
Respuestas de Fundamentos de Marketing Digital de Google - Certification Exam Answers
No ratings yet
Respuestas de Fundamentos de Marketing Digital de Google - Certification Exam Answers
39 pages
Term Paper OF Int-301: Web Programming: Topic: Search Engine
No ratings yet
Term Paper OF Int-301: Web Programming: Topic: Search Engine
18 pages
Technical Seminar Report ON Search Engine: Computer Science and Engineering
No ratings yet
Technical Seminar Report ON Search Engine: Computer Science and Engineering
39 pages
Python Design and Implementation of A Simple Web Search E
No ratings yet
Python Design and Implementation of A Simple Web Search E
9 pages
LLLLLLLLLLLLLLLLL
No ratings yet
LLLLLLLLLLLLLLLLL
30 pages
Beginners Guide To SEO
No ratings yet
Beginners Guide To SEO
27 pages
Machine Learning Techniques For Search Engine Development
No ratings yet
Machine Learning Techniques For Search Engine Development
12 pages
Lect 1 IRIntroduction
No ratings yet
Lect 1 IRIntroduction
59 pages
The Best Ways To Start Local SEO
No ratings yet
The Best Ways To Start Local SEO
3 pages
Search Engine
No ratings yet
Search Engine
35 pages
Wad Module3
No ratings yet
Wad Module3
38 pages
Lab Manual: Web Technology
No ratings yet
Lab Manual: Web Technology
39 pages
Articulo Proyecto
No ratings yet
Articulo Proyecto
37 pages
Search Engine Using Apache Lucene
No ratings yet
Search Engine Using Apache Lucene
5 pages
Working of Webb Search Engines
No ratings yet
Working of Webb Search Engines
29 pages
An Approach For Search Engine Optimization Using Classification - A Data Mining Technique
No ratings yet
An Approach For Search Engine Optimization Using Classification - A Data Mining Technique
4 pages
Module 1 - Search Engine Basics
No ratings yet
Module 1 - Search Engine Basics
79 pages
Seminar Formatkhjj
No ratings yet
Seminar Formatkhjj
24 pages
7 CurrentTrendsAndIssues
No ratings yet
7 CurrentTrendsAndIssues
50 pages
Seminar Report
100% (4)
Seminar Report
44 pages
Meta Search Engines
No ratings yet
Meta Search Engines
48 pages
Search Engine
No ratings yet
Search Engine
15 pages
Search Engines: The Players and The Field
No ratings yet
Search Engines: The Players and The Field
27 pages
Jaff Seminar
No ratings yet
Jaff Seminar
31 pages
MIS Term Paper The Future of Web Search: Vinod Gupta School of Management, IIT Kharagpur
No ratings yet
MIS Term Paper The Future of Web Search: Vinod Gupta School of Management, IIT Kharagpur
15 pages
Unit 5
No ratings yet
Unit 5
36 pages
The Anatomy of A Large-Scale Hypertextual Web Search Engine: Sergey Brin and Lawrence Page
No ratings yet
The Anatomy of A Large-Scale Hypertextual Web Search Engine: Sergey Brin and Lawrence Page
19 pages
Search Engine Student Documents
No ratings yet
Search Engine Student Documents
6 pages
How Google Works
No ratings yet
How Google Works
61 pages
Unit 8 - Search Engines
No ratings yet
Unit 8 - Search Engines
8 pages
Prashant Mathur Neha Gupta Monu K. Verma Mohd. Shoaib
No ratings yet
Prashant Mathur Neha Gupta Monu K. Verma Mohd. Shoaib
31 pages
Web Search Engines: Practice and Experience: Content Analysis Query Prcessing Search Log
No ratings yet
Web Search Engines: Practice and Experience: Content Analysis Query Prcessing Search Log
21 pages
Search Engine Comparisons
No ratings yet
Search Engine Comparisons
23 pages
A Survey On Search Engines
No ratings yet
A Survey On Search Engines
5 pages
W4 Lesson 6 Use Search Engines and Directories Effectively - Presentation
No ratings yet
W4 Lesson 6 Use Search Engines and Directories Effectively - Presentation
36 pages
ST. JOSEPH SCHO-WPS Office
No ratings yet
ST. JOSEPH SCHO-WPS Office
15 pages
SEARCH ENGINES and PAGERANK
No ratings yet
SEARCH ENGINES and PAGERANK
29 pages
Comparative Study On Semantic Search Engines
No ratings yet
Comparative Study On Semantic Search Engines
9 pages
Browse Fonts - Google Fonts
No ratings yet
Browse Fonts - Google Fonts
70 pages
Darknet Report
No ratings yet
Darknet Report
27 pages
Search Engine: Amit Kamath Ancy Alphonso
No ratings yet
Search Engine: Amit Kamath Ancy Alphonso
22 pages
BA4029 SOCIAL MEDIA WEB ANALYTICS Unit 5
No ratings yet
BA4029 SOCIAL MEDIA WEB ANALYTICS Unit 5
23 pages
SPPM 1002 Web Searching
No ratings yet
SPPM 1002 Web Searching
12 pages
Module 2
No ratings yet
Module 2
18 pages
Web Search-Engines: Preksha Mangal B-Tech CS-3 Year
No ratings yet
Web Search-Engines: Preksha Mangal B-Tech CS-3 Year
43 pages
09 Can Doctype
100% (1)
09 Can Doctype
162 pages
Web Search Engine
No ratings yet
Web Search Engine
26 pages
Search Engine Optimization - Using Data Mining Approach
No ratings yet
Search Engine Optimization - Using Data Mining Approach
5 pages
How Do Search Engines Work
No ratings yet
How Do Search Engines Work
3 pages
Search Engine Description
No ratings yet
Search Engine Description
17 pages
Working of Search Engines: Avinash Kumar Widhani, Ankit Tripathi and Rohit Sharma Lnmiit
No ratings yet
Working of Search Engines: Avinash Kumar Widhani, Ankit Tripathi and Rohit Sharma Lnmiit
10 pages
Working of Search Engines: Avinash Kumar Widhani, Ankit Tripathi and Rohit Sharma Lnmiit
No ratings yet
Working of Search Engines: Avinash Kumar Widhani, Ankit Tripathi and Rohit Sharma Lnmiit
13 pages
Ensoniq Eps
No ratings yet
Ensoniq Eps
57 pages
Keyword Research and Competition Analysis
No ratings yet
Keyword Research and Competition Analysis
12 pages
Database & Search Engine
No ratings yet
Database & Search Engine
17 pages
Indexing and Search Engines For The Intranets: by Suvarsha Walters (Suvarsha@ncsi - Iisc.ernet - In)
No ratings yet
Indexing and Search Engines For The Intranets: by Suvarsha Walters (Suvarsha@ncsi - Iisc.ernet - In)
33 pages
SEO Notes
No ratings yet
SEO Notes
8 pages
B16
No ratings yet
B16
197 pages
Doraemon Characters Chinese Names
No ratings yet
Doraemon Characters Chinese Names
3 pages
SEO-Site Audit-Template
No ratings yet
SEO-Site Audit-Template
8 pages
Search Engine Problems and Solutions
No ratings yet
Search Engine Problems and Solutions
2 pages
Digital Marketing Terms
No ratings yet
Digital Marketing Terms
12 pages
24MCA20120 Vishal
No ratings yet
24MCA20120 Vishal
9 pages
Werkplaats Handboek DAF DD575 DF615 DT 615
No ratings yet
Werkplaats Handboek DAF DD575 DF615 DT 615
87 pages
Downloaded From Manuals Search Engine
No ratings yet
Downloaded From Manuals Search Engine
50 pages
Preparation
No ratings yet
Preparation
10 pages
Batch-11 SEO101 1
No ratings yet
Batch-11 SEO101 1
3 pages
Multilingual SEO Services
No ratings yet
Multilingual SEO Services
6 pages
Yashi Resume
No ratings yet
Yashi Resume
1 page
Professional SEO Services - AmazingDuniya7 - Grow Your Online Presence
No ratings yet
Professional SEO Services - AmazingDuniya7 - Grow Your Online Presence
3 pages
Google Search Revealed: Mastering the Algorithm for Search Dominance
From Everand
Google Search Revealed: Mastering the Algorithm for Search Dominance
Azhar ul Haque Sario
No ratings yet
Search Engine Testing
From Everand
Search Engine Testing
Abhinav Vaid
No ratings yet
Mastering Search Engine Marketing: A Guide for SEM Campaign Success
From Everand
Mastering Search Engine Marketing: A Guide for SEM Campaign Success
Rebecca Cox
No ratings yet
Seo Learning Guide
From Everand
Seo Learning Guide
ngencoband
No ratings yet
Image Retrieval: Unlocking the Power of Visual Data
From Everand
Image Retrieval: Unlocking the Power of Visual Data
Fouad Sabry
No ratings yet

Search Engine Comparison

Uploaded by

Search Engine Comparison

Uploaded by

For Advanced Database Systems, CS 2131-8070580-sec1, December 2013

A COMPARISON OF SEARCH ENGINE's

Abstract web page, a process of matching a query with the

Experiment2 About 42,700 results (0.16 s)Target 34,200 results

Experiment3 About 90,100 results (0.26 s) 3,600 results

Experiment5 About 104 results (0.40 s)

Evaluation For Google Yahoo Bing Ask DuckDuckGo

There are some measurements that differ between

Figure 3: Google Architecture.

You might also like