0% found this document useful (0 votes)

31 views3 pages

ABSTRACT1

Internet search engines allow users to find information across the vast and decentralized internet. Early search engines indexed hundreds of thousands of pages and received a few thousand queries per day, while modern top search engines index hundreds of millions of pages and receive tens of millions of queries daily. The first search engine was Archie, created in 1990, which helped solve the problem of scattered data by combining a script to gather data with regular expressions to match filenames to user queries. Later, Veronica and Jughead provided similar search capabilities for files transferred via Gopher.

Uploaded by

shiv900

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views3 pages

ABSTRACT1

Uploaded by

shiv900

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

ABSTRACT:

The good news about the Internet and its most visible component, the World Wide Web, is that there are
hundreds of millions of pages available, waiting to present information on an amazing variety of topics. The
bad news about the Internet is that there are hundreds of millions of pages available, most of them titled
according to the whim of their author, almost all of them sitting on servers with cryptic names. When you need
to know about a particular subject, how do you know which pages to read? If you're like most people, you visit
an Internet search engine.

Internet search engines are special sites on the Web that are designed to help people find information stored
on other sites. There are differences in the ways various search engines work, but they all perform three basic
tasks:

 They search the Internet -- or select pieces of the Internet -- based on important words.
 They keep an index of the words they find, and where they find them.
 They allow users to look for words or combinations of words found in that index.

Early search engines held an index of a few hundred thousand pages and documents, and received
maybe one or two thousand inquiries each day. Today, a top search engine will index hundreds of
millions of pages, and respond to tens of millions of queries per day. In this article, we'll tell you how
these major tasks are performed, and how Internet search engines put the pieces together in order to
let you find the information you need on the Web.
HISTORY OF Search Engine

Gerard Salton (1960s - 1990s):

Gerard Salton, who died on August 28th of 1995, was the father of modern search technology. His teams
at Harvard and Cornell developed the SMART informational retrieval system. Salton’s Magic Automatic
Retriever of Text included important concepts like the vector space model, Inverse Document Frequency
(IDF), Term Frequency (TF), term discrimination values, and relevancy feedback mechanisms.
Ted Nelson:
Ted Nelson created Project Xanadu in 1960 and coined the term hypertext in 1963. His goal with Project
Xanadu was to create a computer network with a simple user interface that solved many social problems
like attribution.

While Ted was against complex markup code, broken links, and many other problems associated with
traditional HTML on the WWW, much of the inspiration to create the WWW was drawn from Ted's work.

There is still conflict surrounding the exact reasons why Project Xanadu failed to take off.

Advanced Research Projects Agency Network:

ARPANet is the network which eventually led to the internet. The Wikipedia has a great background
article on ARPANet and Google Video has a free interesting video about ARPANet from 1972.
Archie (1990):

The first few hundred web sites began in 1993 and most of them were at colleges, but long before most of
them existed came Archie. The first search engine created was Archie, created in 1990 by Alan Emtage,
a student at McGill University in Montreal. The original intent of the name was "archives," but it was
shortened to Archie.

Archie helped solve this data scatter problem by combining a script-based data gatherer with a regular
expression matcher for retrieving file names matching a user query. Essentially Archie became a
database of web filenames which it would match with the users queries.

Bill Slawski has more background on Archie here.

Veronica & Jughead:
As word of mouth about Archie spread, it started to become word of computer and Archie had such
popularity that the University of Nevada System Computing Services group developed Veronica. Veronica
served the same purpose as Archie, but it worked on plain text files. Soon another user interface name
Jughead appeared with the same purpose as Veronica, both of these were used for files sent via Gopher,
which was created as an Archie alternative by Mark McCahill at the University of Minnesota in 1991.
File Transfer Protocol:
Tim Burners-Lee existed at this point, however there was no World Wide Web. The main way people
shared data back then was via File Transfer Protocol (FTP).
If you had a file you wanted to share you would set up an FTP server. If someone was interested in
retrieving the data they could using an FTP client. This process worked effectively in small groups, but the
data became as much fragmented as it was collected.

Internet and Internet Protocols
No ratings yet
Internet and Internet Protocols
21 pages
Web Search Engine
No ratings yet
Web Search Engine
26 pages
Search Engine
100% (2)
Search Engine
42 pages
Week 9 - Introduction To Internet
No ratings yet
Week 9 - Introduction To Internet
24 pages
Web Search Engine
No ratings yet
Web Search Engine
66 pages
Proxy Server, Firewall & VPN Proxy Server (Pdfdrive)
No ratings yet
Proxy Server, Firewall & VPN Proxy Server (Pdfdrive)
164 pages
Module 1
No ratings yet
Module 1
124 pages
Web Search. Web Spidering
No ratings yet
Web Search. Web Spidering
44 pages
Chapter02 120214224530 Phpapp01
No ratings yet
Chapter02 120214224530 Phpapp01
51 pages
Module 1
No ratings yet
Module 1
53 pages
Search Engine
No ratings yet
Search Engine
19 pages
Animated Final Seminar
No ratings yet
Animated Final Seminar
22 pages
Chap It Re 2 The Internet
No ratings yet
Chap It Re 2 The Internet
8 pages
Search Engine
No ratings yet
Search Engine
42 pages
Search Engine - Wikipedia
No ratings yet
Search Engine - Wikipedia
25 pages
Lecture 1 - On Internet
No ratings yet
Lecture 1 - On Internet
56 pages
Accenture CheatSheet PrepInsta
No ratings yet
Accenture CheatSheet PrepInsta
34 pages
How A Search Engine Works - Slide
No ratings yet
How A Search Engine Works - Slide
40 pages
Internet and Search Engines
No ratings yet
Internet and Search Engines
33 pages
G4 1st Lesson
No ratings yet
G4 1st Lesson
17 pages
Notes Introduction Unit 1 & Unit 2 Full (WT)
No ratings yet
Notes Introduction Unit 1 & Unit 2 Full (WT)
67 pages
Search Engine
No ratings yet
Search Engine
35 pages
Internet
No ratings yet
Internet
18 pages
Case 9 Google in 2011
No ratings yet
Case 9 Google in 2011
15 pages
Ip Unit-2 Notes
No ratings yet
Ip Unit-2 Notes
57 pages
CSCI 587 SEC 1220 - Final Project - Kotha0746
No ratings yet
CSCI 587 SEC 1220 - Final Project - Kotha0746
40 pages
Search Engines: Ranjan Patra B.Tech (CSE) IV TH Sem HNB Garhwal University, Uttarakhand, India
No ratings yet
Search Engines: Ranjan Patra B.Tech (CSE) IV TH Sem HNB Garhwal University, Uttarakhand, India
17 pages
By: Abd Rashid Bin HJ Shafie Penyelaras Bestari SMK Gunung Rapat, Ipoh
No ratings yet
By: Abd Rashid Bin HJ Shafie Penyelaras Bestari SMK Gunung Rapat, Ipoh
6 pages
Week7 1
No ratings yet
Week7 1
48 pages
Unit 1: Search Engine Optimisation
No ratings yet
Unit 1: Search Engine Optimisation
10 pages
Innovation Case Study
No ratings yet
Innovation Case Study
9 pages
Search Engines Sunday
No ratings yet
Search Engines Sunday
17 pages
Hands On AWS Penetration Testing 1672316211
No ratings yet
Hands On AWS Penetration Testing 1672316211
129 pages
Unit IV. Internet Fundamentals GROUP 6
No ratings yet
Unit IV. Internet Fundamentals GROUP 6
24 pages
On Internet
No ratings yet
On Internet
38 pages
Div HTML
No ratings yet
Div HTML
6 pages
The Anatomy of A Large-Scale Hypertextual
No ratings yet
The Anatomy of A Large-Scale Hypertextual
41 pages
Unit I
No ratings yet
Unit I
12 pages
Web Technology Lab Assignment 1
No ratings yet
Web Technology Lab Assignment 1
14 pages
Lab Manual: Web Technology
No ratings yet
Lab Manual: Web Technology
39 pages
Using The Internet For Research and Academic Work
No ratings yet
Using The Internet For Research and Academic Work
56 pages
MCS 1 Cautare Pe Net
No ratings yet
MCS 1 Cautare Pe Net
22 pages
Web Crawler A Review
No ratings yet
Web Crawler A Review
5 pages
Module 1 MAX150 WI25v1 en Exercise Guide
No ratings yet
Module 1 MAX150 WI25v1 en Exercise Guide
259 pages
CS101 Final Term by AR Lucky Term
No ratings yet
CS101 Final Term by AR Lucky Term
256 pages
History: Timeline Year Engine Event
No ratings yet
History: Timeline Year Engine Event
6 pages
History: WP:Search Engine Test Search Engine (Disambiguation)
No ratings yet
History: WP:Search Engine Test Search Engine (Disambiguation)
5 pages
How Internet Search Engines Work
No ratings yet
How Internet Search Engines Work
6 pages
Archie Was The First Search Engine Ever Invented
No ratings yet
Archie Was The First Search Engine Ever Invented
4 pages
Internet Searching: Crawling Is Conceptually Quite Simple: Starting at Some Well-Known Sites On The Web
No ratings yet
Internet Searching: Crawling Is Conceptually Quite Simple: Starting at Some Well-Known Sites On The Web
4 pages
Search Engine: The Results of A Search For The Term "Lunar Eclipse" in A Web-Based Image Search Engine
No ratings yet
Search Engine: The Results of A Search For The Term "Lunar Eclipse" in A Web-Based Image Search Engine
10 pages
Search Engine
No ratings yet
Search Engine
17 pages
Web Search Engines: Practice and Experience: Content Analysis Query Prcessing Search Log
No ratings yet
Web Search Engines: Practice and Experience: Content Analysis Query Prcessing Search Log
21 pages
Seminar Formatkhjj
No ratings yet
Seminar Formatkhjj
24 pages
History and Working of Web Crawlers
No ratings yet
History and Working of Web Crawlers
3 pages
5 - APIM - Development - Policy Studio - Lab - Mashup
No ratings yet
5 - APIM - Development - Policy Studio - Lab - Mashup
53 pages
Oc 2 RJPGT 2023
No ratings yet
Oc 2 RJPGT 2023
13 pages
BA4029 SOCIAL MEDIA WEB ANALYTICS Unit 5
No ratings yet
BA4029 SOCIAL MEDIA WEB ANALYTICS Unit 5
23 pages
Websearch
No ratings yet
Websearch
21 pages
Darknet Report
No ratings yet
Darknet Report
27 pages
Online Quiz App
No ratings yet
Online Quiz App
16 pages
Prinect Signa Station - Install en
No ratings yet
Prinect Signa Station - Install en
41 pages
Port Forwarding - A Practical Hands-On Guide
No ratings yet
Port Forwarding - A Practical Hands-On Guide
14 pages
Ict Week 15 16
No ratings yet
Ict Week 15 16
6 pages
Flask Book
No ratings yet
Flask Book
21 pages
(ODataMetadata) Initial Loading of Metadata Failed... - SAP Community
No ratings yet
(ODataMetadata) Initial Loading of Metadata Failed... - SAP Community
8 pages
Internet: M.Sc. 2 Sem
No ratings yet
Internet: M.Sc. 2 Sem
19 pages
ITEC50 Lesson 1
No ratings yet
ITEC50 Lesson 1
29 pages
Unit II - III
No ratings yet
Unit II - III
28 pages
1.2 A Brief History of The Web and The Internet
No ratings yet
1.2 A Brief History of The Web and The Internet
6 pages
Unit II Ui Design
No ratings yet
Unit II Ui Design
28 pages
MS Word Notes PDF Download
No ratings yet
MS Word Notes PDF Download
47 pages
Search Engines .: Presented By: Rasik Mevada Vishal Dabhi Vimal Nair Ravi Mathai
No ratings yet
Search Engines .: Presented By: Rasik Mevada Vishal Dabhi Vimal Nair Ravi Mathai
25 pages
Webpage Design MODULE
No ratings yet
Webpage Design MODULE
129 pages
Javascript Cheat Sheet
No ratings yet
Javascript Cheat Sheet
2 pages
835 Companion Guide
No ratings yet
835 Companion Guide
17 pages
CNCH Lo2-Lo3
No ratings yet
CNCH Lo2-Lo3
44 pages
A Step by Step Guide For Operations Orchestration-NA
No ratings yet
A Step by Step Guide For Operations Orchestration-NA
18 pages
Ambulance Booking System (Project)
No ratings yet
Ambulance Booking System (Project)
3 pages
Industrial Training Presentation
No ratings yet
Industrial Training Presentation
13 pages
Release Notes: Analyzer Update October 2019
No ratings yet
Release Notes: Analyzer Update October 2019
18 pages
Downloaded From Downloaded From
No ratings yet
Downloaded From Downloaded From
9 pages
Introduction To Growing Instagram Pages PDF
No ratings yet
Introduction To Growing Instagram Pages PDF
7 pages
CV Himdyuti
No ratings yet
CV Himdyuti
1 page
Preparation
No ratings yet
Preparation
10 pages
SWOT AND TECHNICAL ANALYSIS OF CHANDIGARH UNIVERSITY With LOVELY PROFESSION UNIVERISTY AND CHITKARA UNIVERSITY
No ratings yet
SWOT AND TECHNICAL ANALYSIS OF CHANDIGARH UNIVERSITY With LOVELY PROFESSION UNIVERISTY AND CHITKARA UNIVERSITY
2 pages
1 Header and Footer
No ratings yet
1 Header and Footer
1 page
3 Different Ways To Display Progress in An ASP - Net AJAX
No ratings yet
3 Different Ways To Display Progress in An ASP - Net AJAX
7 pages
OA Framework Training Contents
No ratings yet
OA Framework Training Contents
2 pages
The State of the Internet: Living on the Network of Networks
From Everand
The State of the Internet: Living on the Network of Networks
Ryan Richardson Barrett
No ratings yet

ABSTRACT1

Uploaded by

ABSTRACT1

Uploaded by

ABSTRACT:

Gerard Salton (1960s - 1990s):

Advanced Research Projects Agency Network:

Bill Slawski has more background on Archie here.

You might also like