0% found this document useful (0 votes)

109 views10 pages

Working of Search Engines: Avinash Kumar Widhani, Ankit Tripathi and Rohit Sharma Lnmiit

The document discusses how search engines work by crawling websites, indexing their content into databases, and then searching those databases to return relevant results for user queries. It describes how search engine spiders follow links to discover new pages for indexing, and how those pages are analyzed and stored to enable keyword searches. The document also provides background on the history of search engines and discusses common search engine algorithms and ranking systems.

Uploaded by

avi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

109 views10 pages

Working of Search Engines: Avinash Kumar Widhani, Ankit Tripathi and Rohit Sharma Lnmiit

Uploaded by

avi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 10

Working of Search

Engines
A2-39
Avinash Kumar Widhani, Ankit Tripathi and Rohit
Sharma
LNMIIT
[email protected], [email protected] ,
[email protected]
Abstract
The measure of data on the web is expanding step
by step every day, and also the no. of new clients
unpracticed in the craft of web research. It is
Search Engine which empowers the client for
looking the data about their issues. A search
engines crawl the web, and after that produce their
listings by utilizing a few calculations (also known
as algorithms). Search engine results are sorted
out in a way that is controlled by an uncommon
calculation to rank the comes about so that the
best results recorded first. On the off chance that
you change your site pages then likewise web
index crawler will effectively discover these
progressions, and that can influence listing. There
are no. of ways to run a search engines crawlers
and change a site to help improve its rankings. The
best case is Google Web Search Engine which we
utilize every day in our life. This research paper
goes through the different generations of web
search engines, the simplified algorithm used
and a general overview of the search engine
Architecture. It is critical to know how a web
crawler Works, what sort of systems it utilizes and
what are the terms identified with it.
Introduction
Discovering key information from tremendous World Wide
Web is like discover a needle lost in hay stack. For this
reason we would utilize a special magnet that would
automatically, attract that needle for us. In this situation
magnet is search engine. Internet search engine is a tool
that helps us find information on the World Wide Web or
we can say a search engine is a product program or script
accessible through the Internet that scans records and
documents for watchwords and returns the consequences of any
documents containing those keywords. In short a reference book
which can tell everything, WhatIs.com, gives a precise meaning
of a search engine. A web search tool is a blend of no. of
projects and calculations which incorporates.
A spider (also called a crawler) that visits each page or
agent pages on each Web website that needs to be searchable
and understands it, utilizing hypertext connects on each page to
find and give the outcome.
A program that makes a gigantic record (called a
catalog) from the pages that have been perused.
A program that receives your search request, thinks about it
to the passages in the file, and returns results to you
So, the search engine visits the site pages and utilizes
connections to help them to go to other website pages.
The search engine then records those pages into its
database. At the point when a searcher sends a pursuit
demand, the web search tool looks at the website pages
in the record (database) to discover archives that are like
the hunt inquiry and with the assistance of a few
algorithms, the search engine provides results to the
searcher in the search engine result page also known as
SERP .The search engine algorithms are set of programs
and rules that a search engine follows, to locate the most
applicable outcomes for inquiry question. Sometimes
search engines fail to return relevant results, and thats
why they need to improve its algorithm constantly time to
time. The algorithms decide the situation of online
records in the natural list items, which are typically displayed
on the left side of the screen in the SERPs, as illustrated in
the Figure 1Search engine algorithms are very closely kept a
secrets, because of the tough competition in the field.
One more purpose behind search engines to keep their
algorithms mystery is search engine spam. If someone
knew the exact algorithm of a search engine, they could
manipulate the results in their favor very easily. By
testing different-different techniques, website owners
sometimes find out the algorithms and act accordingly to
boost their ranking in the SERPs. Thats why changes in
the algorithms are made oftenly due to increased search
engine spam. There are many search engines which are utilized
by a large number of individuals consistently which incorporate
well known ones like Google, Yahoo, and Bing. The web
creates new challenges for information retrieval. The
amount of information on the web is increasing rapidly,
similarly the people are likely to surf the web, often
starting with high
Quality human maintained indices such as Yahoo.

LITERATURE
REVIEW
Brief History of search engines

1st Generation (1994):

AltaVista, Excite
Ranking in light of Content
The more rare words two documents share the more similar
they are
Documents are dealt with as "sacks of words" (no effort to
understand the contents)

2nd Generation (1996):

Lycos
Ranking in light of Content + Structure

Site Popularity

3rd Generation (1998):

Google, Yahoo, Bing

Ranking based on Content + Structure + Value

Page Reputation

In the Works

Ranking based on the need behind the question

Search Engineers
Information retrieval research includes the improvement
of scientific models of content and dialect, huge scale
explores different avenues regarding test accumulations
or clients, and a considerable measure of insightful paper
composing. These people are mainly trained in computer
science and information technology in spite of the fact
that data science, arithmetic, and every so often,
sociology and computational etymology are additionally
spoken to. So who works with search engines? To a large
extent, it is the same sort of people but with a more
practical emphasis. The computing industry has started
to use the term search engineer to depict such sort of
individual. Search engineers are primarily people trained
in computer science, mostly with a systems or database
background. The people who work in the web search
companies, designing and implementing new lineament
in search engines are search engineers, but the majority
of search engineers are the general population who alter,
create, keep up, or change calculations of subsisting
search engine for an extensive variety of business
applications. People who design content for search
engines are also search engineers.

METHODOLOGY
If I were to conduct this study I think the best way to do
so would be by a combination of quantitative and
qualitative methods. I would choose to use survey
research as well as focus groups in order to study the
working of search engine. By using survey research I
would be able to uncover whether or not people are
actually inclined to know about how a search engine
works. By using the two different types of research it also
will allow for the study to be more diverse and look at
different angles of search engine, which will result in
having a better understanding.
SUMMARY
Search engines never seek the World Wide Web
straightforwardly. They seeks a database of the full
content of website pages chose from the large number of
pages out there set on servers. When you search for
something using a search engine, you are always
searching for a copy of the actual web page. When you
click on links provided in a search engine's results list,
you retrieve the actual version of the page from the
server. Search engine databases are chosen and worked
by PC robot programs called spiders. Although it is said
they "crawl" the web in their search for pages to find
them but genuine truth is that they remain at one place
as it were. They find the pages for potential inclusion by
following the links in the pages they already have
registered in database. They can't think or sort a URL or
utilize judgment to choose to go look into something.

In the event that a website page is never connected to in

whatever other page, web index can never discover it.
The only way a brand new page, that no other page has
ever linked to, can get into a search engine is for its URL
to be sent by some person to the search engine
companies as a request that the new site be included. All
search engine companies offer to do this way.
After spiders discover pages, they pass them on to
another PC program for "indexing." This program
recognize the links and other content in the page and
stores it in the search engine database's files so that the
database can be searched by keyword and the page will
be found if your search matches with the content.

REFERENCES
Dreilinger, D., and Howe, A. 1996. An Information-
Gathering Agent for Querying Web Search Engines,
Technical Report, TR 96-11, Computer Science
Department, Colorado State University.

Brin, S., & Page, L. (1998). The anatomy of a large-scale

hyper textual web search engine. Computer networks
and ISDN systems, 30(1), 107-117.

Langville, A. N., & Meyer, C. D. (2011). Google's

PageRank and beyond: The science of search engine
rankings. Princeton University Press.

McCandless, M., Hatcher, E., & Gospodnetic, O.

(2010). Lucene in Action: Covers Apache Lucene 3.0.
Manning Publications Co.

https://www.scribd.com/presentation/89353754/Working-of-
Search-Engines
https://www.scribd.com/document/12885521/Search-Engine
https://www.cnlp.org/publications/02HowASearchEngineWorks.pdf
http://www.tandfonline.com/doi/abs/10.1080/01972240050133634

https://pdfs.semanticscholar.org/4c9f/afa3b1bed97bb00b8bc68db39a9ad48490f1.p
df

http://www.aaai.org/ojs/index.php/aimagazine/article/view/1290

http://dl.acm.org/citation.cfm?id=256164

http://david-hawking.net/pubs/overview_trecweb2003.pdf

http://ieeexplore.ieee.org/abstract/document/4522561/?reload=true

Search Engines
83% (6)
Search Engines
23 pages
Types of Search Engines and How It Works
100% (2)
Types of Search Engines and How It Works
42 pages
Search Engines
No ratings yet
Search Engines
24 pages
Seminar Report
100% (4)
Seminar Report
44 pages
Search Engine
100% (1)
Search Engine
22 pages
CH - 01 - Final How Google Works
No ratings yet
CH - 01 - Final How Google Works
51 pages
Jaff Seminar
No ratings yet
Jaff Seminar
31 pages
Search Engine: Submitted By, E.Priyan, Pondicherry University
No ratings yet
Search Engine: Submitted By, E.Priyan, Pondicherry University
13 pages
Search Engine
No ratings yet
Search Engine
4 pages
Cali) Ngasan - Search Engine
No ratings yet
Cali) Ngasan - Search Engine
98 pages
Search Engines: Submitted To: Submitted by
No ratings yet
Search Engines: Submitted To: Submitted by
16 pages
Chapter 1 Search Engine 1. Objective
No ratings yet
Chapter 1 Search Engine 1. Objective
63 pages
Meta Search Engines
No ratings yet
Meta Search Engines
48 pages
Unit 1
No ratings yet
Unit 1
47 pages
Seo CH1
No ratings yet
Seo CH1
45 pages
Seo CH1
No ratings yet
Seo CH1
45 pages
Search Engine Student Documents
No ratings yet
Search Engine Student Documents
6 pages
9.database Application PT 05213 - June 2023 Search Search Engines
No ratings yet
9.database Application PT 05213 - June 2023 Search Search Engines
24 pages
Anatomy of Search Engine
No ratings yet
Anatomy of Search Engine
22 pages
BA4029 SOCIAL MEDIA WEB ANALYTICS Unit 5
No ratings yet
BA4029 SOCIAL MEDIA WEB ANALYTICS Unit 5
23 pages
Search Engine
No ratings yet
Search Engine
15 pages
Search Engine
No ratings yet
Search Engine
20 pages
Meeting 14 OK
No ratings yet
Meeting 14 OK
12 pages
ST. JOSEPH SCHO-WPS Office
No ratings yet
ST. JOSEPH SCHO-WPS Office
15 pages
Search Engines: Sara Khalid Suliman
No ratings yet
Search Engines: Sara Khalid Suliman
34 pages
Unit 4
No ratings yet
Unit 4
47 pages
Working of Webb Search Engines
No ratings yet
Working of Webb Search Engines
29 pages
Web Search-Engines: Preksha Mangal B-Tech CS-3 Year
No ratings yet
Web Search-Engines: Preksha Mangal B-Tech CS-3 Year
43 pages
Search Engines: Presented By, Aswathy Gopinadhan 2 Sem Mba
No ratings yet
Search Engines: Presented By, Aswathy Gopinadhan 2 Sem Mba
30 pages
Computer - Search Engines
No ratings yet
Computer - Search Engines
10 pages
Lab Manual: Web Technology
No ratings yet
Lab Manual: Web Technology
39 pages
Social Media
No ratings yet
Social Media
10 pages
Search Engine: by Bhupendra Ratha, Lecturer
No ratings yet
Search Engine: by Bhupendra Ratha, Lecturer
22 pages
WEB BROWSERS+search Engine
No ratings yet
WEB BROWSERS+search Engine
10 pages
Module 2
No ratings yet
Module 2
18 pages
How Google Works
No ratings yet
How Google Works
61 pages
Search Engines .: Presented By: Rasik Mevada Vishal Dabhi Vimal Nair Ravi Mathai
No ratings yet
Search Engines .: Presented By: Rasik Mevada Vishal Dabhi Vimal Nair Ravi Mathai
25 pages
Search Engine Description
No ratings yet
Search Engine Description
17 pages
Prashant Mathur Neha Gupta Monu K. Verma Mohd. Shoaib
No ratings yet
Prashant Mathur Neha Gupta Monu K. Verma Mohd. Shoaib
31 pages
Unit 8 - Search Engines
No ratings yet
Unit 8 - Search Engines
8 pages
Search Tools: Presented By: ISHA
No ratings yet
Search Tools: Presented By: ISHA
22 pages
Technical Seminar Report ON Search Engine: Computer Science and Engineering
No ratings yet
Technical Seminar Report ON Search Engine: Computer Science and Engineering
39 pages
Darknet Report
No ratings yet
Darknet Report
27 pages
Database & Search Engine
No ratings yet
Database & Search Engine
17 pages
005-001-000-024 Search Engines
No ratings yet
005-001-000-024 Search Engines
11 pages
SEARCH ENGINES and PAGERANK
No ratings yet
SEARCH ENGINES and PAGERANK
29 pages
Duplichecker Plagiarism Report
No ratings yet
Duplichecker Plagiarism Report
4 pages
Social Media
No ratings yet
Social Media
10 pages
Search Engine Optimization - Using Data Mining Approach
No ratings yet
Search Engine Optimization - Using Data Mining Approach
5 pages
Search Engine: Programs Keywords
No ratings yet
Search Engine: Programs Keywords
10 pages
The Evolution and Functionality of Search Engines
No ratings yet
The Evolution and Functionality of Search Engines
3 pages
Anatomy of A Search Engine
No ratings yet
Anatomy of A Search Engine
17 pages
Working of Search Engines: Avinash Kumar Widhani, Ankit Tripathi and Rohit Sharma Lnmiit
No ratings yet
Working of Search Engines: Avinash Kumar Widhani, Ankit Tripathi and Rohit Sharma Lnmiit
13 pages
How Do Search Engines Work
No ratings yet
How Do Search Engines Work
3 pages
Preparation
No ratings yet
Preparation
10 pages
Search Engine R1 PDF
No ratings yet
Search Engine R1 PDF
5 pages
Search Engine Powerpoint
No ratings yet
Search Engine Powerpoint
2 pages
Google Search Revealed: Mastering the Algorithm for Search Dominance
From Everand
Google Search Revealed: Mastering the Algorithm for Search Dominance
Azhar ul Haque Sario
No ratings yet
An Introduction To SEO
From Everand
An Introduction To SEO
Nirmalya Roy
No ratings yet
The Pocket Guide to SEO for Authors: Pocket Guides
From Everand
The Pocket Guide to SEO for Authors: Pocket Guides
Troy Lambert
No ratings yet

Working of Search Engines: Avinash Kumar Widhani, Ankit Tripathi and Rohit Sharma Lnmiit

Uploaded by

Working of Search Engines: Avinash Kumar Widhani, Ankit Tripathi and Rohit Sharma Lnmiit

Uploaded by

Working of Search

1st Generation (1994):

2nd Generation (1996):

3rd Generation (1998):

Google, Yahoo, Bing

Ranking based on the need behind the question

In the event that a website page is never connected to in

Brin, S., & Page, L. (1998). The anatomy of a large-scale

Langville, A. N., & Meyer, C. D. (2011). Google's

McCandless, M., Hatcher, E., & Gospodnetic, O.

You might also like