0% found this document useful (0 votes)
31 views3 pages

ABSTRACT1

Internet search engines allow users to find information across the vast and decentralized internet. Early search engines indexed hundreds of thousands of pages and received a few thousand queries per day, while modern top search engines index hundreds of millions of pages and receive tens of millions of queries daily. The first search engine was Archie, created in 1990, which helped solve the problem of scattered data by combining a script to gather data with regular expressions to match filenames to user queries. Later, Veronica and Jughead provided similar search capabilities for files transferred via Gopher.

Uploaded by

shiv900
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
31 views3 pages

ABSTRACT1

Internet search engines allow users to find information across the vast and decentralized internet. Early search engines indexed hundreds of thousands of pages and received a few thousand queries per day, while modern top search engines index hundreds of millions of pages and receive tens of millions of queries daily. The first search engine was Archie, created in 1990, which helped solve the problem of scattered data by combining a script to gather data with regular expressions to match filenames to user queries. Later, Veronica and Jughead provided similar search capabilities for files transferred via Gopher.

Uploaded by

shiv900
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 3

ABSTRACT:

The good news about the Internet and its most visible component, the World Wide Web, is that there are
hundreds of millions of pages available, waiting to present information on an amazing variety of topics. The
bad news about the Internet is that there are hundreds of millions of pages available, most of them titled
according to the whim of their author, almost all of them sitting on servers with cryptic names. When you need
to know about a particular subject, how do you know which pages to read? If you're like most people, you visit
an Internet search engine.

Internet search engines are special sites on the Web that are designed to help people find information stored
on other sites. There are differences in the ways various search engines work, but they all perform three basic
tasks:

 They search the Internet -- or select pieces of the Internet -- based on important words.
 They keep an index of the words they find, and where they find them.
 They allow users to look for words or combinations of words found in that index.

Early search engines held an index of a few hundred thousand pages and documents, and received
maybe one or two thousand inquiries each day. Today, a top search engine will index hundreds of
millions of pages, and respond to tens of millions of queries per day. In this article, we'll tell you how
these major tasks are performed, and how Internet search engines put the pieces together in order to
let you find the information you need on the Web.
HISTORY OF Search Engine

Gerard Salton (1960s - 1990s):


Gerard Salton, who died on August 28th of 1995, was the father of modern search technology. His teams
at Harvard and Cornell developed the SMART informational retrieval system. Salton’s Magic Automatic
Retriever of Text included important concepts like the vector space model, Inverse Document Frequency
(IDF), Term Frequency (TF), term discrimination values, and relevancy feedback mechanisms.
Ted Nelson:
Ted Nelson created Project Xanadu in 1960 and coined the term hypertext in 1963. His goal with Project
Xanadu was to create a computer network with a simple user interface that solved many social problems
like attribution.

While Ted was against complex markup code, broken links, and many other problems associated with
traditional HTML on the WWW, much of the inspiration to create the WWW was drawn from Ted's work.

There is still conflict surrounding the exact reasons why Project Xanadu failed to take off.

Advanced Research Projects Agency Network:


ARPANet is the network which eventually led to the internet. The Wikipedia has a  great background
article on ARPANet and Google Video has a free interesting video about ARPANet from 1972.
Archie (1990):

The first few hundred web sites began in 1993 and most of them were at colleges, but long before most of
them existed came Archie. The first search engine created was Archie, created in 1990 by Alan Emtage,
a student at McGill University in Montreal. The original intent of the name was "archives," but it was
shortened to Archie.

Archie helped solve this data scatter problem by combining a script-based data gatherer with a regular
expression matcher for retrieving file names matching a user query. Essentially Archie became a
database of web filenames which it would match with the users queries.

Bill Slawski has more background on Archie here.


Veronica & Jughead:
As word of mouth about Archie spread, it started to become word of computer and Archie had such
popularity that the University of Nevada System Computing Services group developed Veronica. Veronica
served the same purpose as Archie, but it worked on plain text files. Soon another user interface name
Jughead appeared with the same purpose as Veronica, both of these were used for files sent via Gopher,
which was created as an Archie alternative by Mark McCahill at the University of Minnesota in 1991.
File Transfer Protocol:
Tim Burners-Lee existed at this point, however there was no World Wide Web. The main way people
shared data back then was via File Transfer Protocol (FTP).
If you had a file you wanted to share you would set up an FTP server. If someone was interested in
retrieving the data they could using an FTP client. This process worked effectively in small groups, but the
data became as much fragmented as it was collected.

You might also like