Unit 5 World Wide Web
Unit 5 World Wide Web
Unit 5 World Wide Web
Unit-V
World Wide Web
World Wide Web is an architectural framework for accessing linked documents called
web pages that are spread over thousand of computers all over the world.
WWW is a set of programs, standards and protocols that allow the text, images,
animations, sounds and videos to be stored, accessed and linked together in form of web
sites.
The WWW project was developed at CERN, the European Center for Nuclear
Research in 1989.
websites.
From the user's point of view web consists of a vast worldwide collection of
documents called web pages. These web pages reside on different sites or
machines all over the world.
Each web page can contain link to other pages any where in the world. By
etc.
Such a text or image that enables the user to link to another web page is
called hyperlink.
Ha
WWW Architecture
The string of text that points to another web page is called hypertext. The difference between the normal text
and hypertext is that, when you take the mouse pointer over it, it changes into a hand shaped cursor. Such a
text is sometime, underlined and blue in color.
Hypermedia is enhanced form of a hyperlink which not only links to the other pages or other sections within
the same page but can also link with various medium like sound, animation, movie clip etc, Hypermedia is
grouping of different media like sound, graphics, animations and text in a single file.
These hyperlinks are created with the help of specialized language called Hypertext Mark up Language
(HTML).
In order to access these web pages on different sites, each of these pages has a specific address called Uniform
Resource Locator (URL).
Static web pages are created using only HTML. Static web pages are
only used when the information is no more required to be modified.
Dynamic Web page
SERVER-SIDE DYNAMIC WEB PAGE
It is created by using server-side scripting. There are server-side
scripting parameters that determine how to assemble a new web page
which also include setting up of more client-side processing.
Web browser can show text, audio, video, animation and more. It is
the responsibility of a web browser to interpret text and commands
contained in the web page.
Web Browser
Browser Vendor
Safari Apple
K-meleon K-meleon
Web Browser Architecture
The browser's high level structure
The user interface: this includes the address bar, back/forward button, bookmarking menu, etc. Every part of the browser
display except the window where you see the requested page.
The browser engine: marshals actions between the UI and the rendering engine.
The rendering engine : responsible for displaying requested content. For example if the requested content is HTML, the
rendering engine parses HTML and CSS, and displays the parsed content on the screen.
Networking: for network calls such as HTTP requests, using different implementations for different platform behind a
platform-independent interface.
UI backend: used for drawing basic widgets like combo boxes and windows. This backend exposes a generic interface that is
not platform specific. Underneath it uses operating system user interface methods.
Data storage. This is a persistence layer. The browser may need to save all sorts of data locally, such as cookies. Browsers also
support storage mechanisms such as localStorage, IndexedDB, WebSQL and FileSystem.
Browser Engine
plug-ins
extensions
add-ons
Rendering Engine
Gecko (Firefox)
Presto (Opera)
Trident (MSIE)
JavaScript programs must be executed immediately when the engine spots the
JS source-code (eventually, loaded from an external URL) .
Web site is collection of web pages while web server is a software that respond
to the request for web resources.
Example
S.N. Web Server Descriptino
1 Apache HTTP Server
This is the most popular web server in the world developed by the Apache Software Foundation. Apache
web server is an open source software and can be installed on almost all operating systems including Linux,
UNIX, Windows, FreeBSD, Mac OS X and more. About 60% of the web server machines run the Apache
Web Server.
2. Internet Information Services (IIS)
The Internet Information Server (IIS) is a high performance Web Server from Microsoft. This web server
runs on Windows NT/2010 and 2012 platforms (and may be on upcoming new Windows version also). IIS
comes bundled with Windows; Because IIS is tightly integrated with the operating system so it is relatively
easy to administer it.
Web Server Working
When client sends request for a web page, the web server search for the
requested page if requested page is found then it will send it to client with
an HTTP response.
If the requested web page is not found, web server will the send an HTTP
response:Error 404 Not found.
If client has requested for some other resources then the web server will
contact to the application server and data store to construct the HTTP
response.
Search Engine
Search Engine refers to a huge database of internet resources such as
web pages, newsgroups, programs, images etc. It helps to locate
information on World Wide Web.
Web Crawler
Database
Search Interfaces
Search Engine Components
Web crawler
It is also known as spider or bots. It is a software component that traverses the web to gather information.
Database
All the information on the web is stored in database. It consists of huge web resources.
Search Interfaces
This component is an interface between user and the database. It helps the user to search through the
database.
Search Engine Working
The search engine looks for the keyword in the index for predefined
database instead of going directly to the web to search for the keyword.
It then uses software to search for the information in the database. This
software component is known as web crawler.
Once web crawler finds the pages, the search engine then shows the
relevant web pages as a result. These retrieved web pages generally
include title of page, size of text portion, first several sentences etc.
Search Engine Working
Web crawler, database and the search interface are the major component
of a search engine that actually makes search engine to work. Search
engines make use of Boolean expression AND, OR, NOT to restrict and
widen the results of a search. Following are the steps that are performed
by the search engine: