0% found this document useful (0 votes)
14 views

CH 1 - Intro

The document discusses various topics related to web technology including the Internet, World Wide Web (WWW), HTTP protocol, web browsers, web servers, and features of Web 2.0. It provides definitions and explanations of key concepts such as what the Internet is, how it originated from ARPANET, what the WWW is and who invented it. It also covers URL structure, domain name addressing, HTTP as a request-response protocol, and file transfer using FTP.

Uploaded by

urmi desai
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
14 views

CH 1 - Intro

The document discusses various topics related to web technology including the Internet, World Wide Web (WWW), HTTP protocol, web browsers, web servers, and features of Web 2.0. It provides definitions and explanations of key concepts such as what the Internet is, how it originated from ARPANET, what the WWW is and who invented it. It also covers URL structure, domain name addressing, HTTP as a request-response protocol, and file transfer using FTP.

Uploaded by

urmi desai
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 31

Web Technology

CH 1. Introduction
Syllabus
– Concept of WWW
– Internet and WWW
– HTTP Protocol: Request and Response
– Web Browser and Web servers
– Features of Web 2.0
What is Internet
• Internet is an interconnection between the
computers cited at the remote locations in the
world wide web (WWW).
• Every person who makes a connection owns a
slice of the internet
• Internet is a network of networks or collection of
LANs which share a common set of protocols for
communication between the LANs.
• The internet has it’s root in the ARPANET
system of the Advanced Research Project
Agency of the US department of Defence.
What is Internet actually ?
 ARPANET (Advance Research Project Agency Network) was the
first WAN and had only four sites in 1969.

 In 1980s the national Science foundation of USA Officially created


the internet.

 A Community of people Who use and develop the networks.

 A Collection of resources that can be reached from those networks.

 A setup to facilitate collaboration among members of the research


and educational communities , world wide. Email , bulletin board

 The connected networks uses the TCP/IP Protocol communication


standard for binding networks for communication .
What is WWW ?
 Inventor of WWW is Tim Berners- Lee , he proposed idea of a global
hypertext space in which any network accessible information could
be referred to by a single universal document identifier.

 The WWW (World Wide Web) consists of information organized


into Web pages containing text and graphic images.

 It contains hypertext links, or highlighted keywords and images that


lead to related information.

 A collection of linked Web pages that has a common theme or focus


is called a Web site.
Domain Name Addressing
 Most web browsers do not use the IP address to locate Web sites and
individual pages.

 They use domain name addressing.

 A domain name is a unique name associated with a specific IP


address by a program that runs on an Internet host computer.

 This program, which coordinates the IP addresses and domain


names for all computers attached to it, is called DNS (Domain
Name System ) software.

 The host computer that runs this software is called a domain name
server.
Uniform Resource Locators
 To identify a Web pages exact location, Web browsers rely on
Uniform Resource Locator (URL).

 URL is a four-part addressing scheme that tells the Web


browser:

 What transfer protocol to use for transporting the file

 The domain name of the computer on which the file resides

 The pathname of the folder or directory on the computer on


which the file resides

 The name of the file


Structure of Uniform Resource
Locators

pathname
protocol

http://www.chicagosymphony.org/civicconcerts/index.htm

filename
Domain name
HTTP
 HTTP is an application layer protocol for WWW.

 HTTP is stateless object oriented protocol

 HTTP is stateless as neither client nor server remembers information


according to this protocol.

 HTTP is request – response protocol with following steps

 HTTP Client establish TCP connection to HTTP server

 Client sends an HTTP request to the server specifying resource it wants


to access

 Server sends an HTTP response containing desired information.


File Transfer Protocol
 To Transfer File from one machine to another machine we can
use File Transfer Protocol.

 We can load a file on ftp server from there it can be easliy


downloadable when necessary.

Commands
ftp – starts an ftp session
Get – download file from server
Put – upload file to the server
Bin – binary mode transfer ( JPEG , GIF , MS WORD )
Asc – Ascii text mode Transfer ( text based files needs to be sent )
Open – open a new connection
Close – close the connection
File Transfer Protocol
Upload myIndex.doc on FTP server

 ftp myserver.co.in // Connected to myserver.co.in

220 FTP server ready

 Name : jayesh

331 Password required for jayesh

 Password :

230user jayesh logged in

 ftp>put myindex.doc
Problems with FTP
 Firewall may be set up to stop access either outward bound or
inward toward your computer on available ports. This includes the
FTP port 21. So for Connection problem check port 21 is blocked or
not.

 FTP on 21 is not secure and it is possible to intercept and view data


that is sent as well as any passwords because data is plain text.

 If you use FTP to transfer file of your website any newly uploaded
pages may take a time to appear to browser due to factors such as
local caching on individual machines and servers.
Email
STEPS for Sending Mail

• User writing message using mail client ( MUA – mail user agent)

• Program uses SMTP to send message to the local (MTA) mail Transfer
agent which exist on the user’s ISP.

• MTA decipher email address of receipent.

• MTA uses DNS to find appropriate mail exchange server accepting


messages for the domain.

• Once mail server is found , the message is sent on using SMTP and from
there it is placed using local name to find correct mail box.

• User’s client retrieves from mail box via MUA using post office protocol
(POP3)
HTTP Message Format
• HTTP is used for Communication between browser and web server.

• First Enter URL in the browser window

http://www.yahoo.com/index.html

• URL is converted to IP address which is then used to make a connection to


server at location via port 80.

• Once the connection is established , the client application extracts the file
name from URL and sends the request down the established connection.

• When receive the server looks up the request.

• Two types of HTTP messages. Request Message and Response Message


Request Message
• Request Message contains four parts
1. Request Line
2. Header
3. An Empty Line
4. An Optional Body
Request Line : It consists of three parts . Request Type , URL and HTTP Version.
GET /index.html HTTP/1.1
– Request Type mostly used are GET and POST methods.
GET
– Get is specified when a client wants to retrieve a resource from the server. Message
body is empty for the Get Method.
– Get May also be used to send information to server for processing without using
HTML form.
Request Message
– By Get Information is sent by appending it to the URL using name-value pair.
POST
– When we send information to server it is stored in body part of request message
– Header Describes the message body such as content type and content length.
– The result of post method may be updates of an existing resource or creation of
a new resource or both.
– It is slower then GET Method
Headers :
Header consist of single line or multiple line
Header-name : Header-value
– Header name is not case sensitive but header value may be.
– If a header line starts with a space it is considered as part of the previous header
line
General Header
Request Message
It is used to specify how messages should be processed and handled.
Examples
Cache-control : no-cache // Shows whether caching should be used
Connection : Close
Date : // Shows date time when message is originated.
Request Header
It contains information about the client sending the request as well as data format that
client expects.
Examples
Accept: text/plain // the format of media the client can accept
Accept-Language : en // Natural Language the client prefers
If-Modified-Since : Wed, 14 Oct 2009 // only return is resource is modified after
specified date
Response Header
Request Message
It contains information about server and data being sent.

Examples

Age: 18 // Age of the resource in the proxy cache in seconds

Server : Apache/2.0.40 ( Red Hat Linux)

Public : GET , POST // List of Supported methods.

Entity Header

It Contains information about message body.

Content-Length : 2453 // length of response body in bytes

Expires : Thu, 15 Oct 2009 // the date and time content will modified
Response Message
Response message is sent by a server to the client. It Consists of following
paths.

– A status line

– A header

– An empty line

– An optional body

Status Line : It Consists of three parts: HTTP version , Status code, Status
phrase. Two Consecutive parts are separated by a space.

e.g.

GET /index.html HTTP/1.1 // Request LIne

HTTP/1.1 200 OK // Response Line


Status Code
Response Message
It is three digit code that indicates status of the response.

Status codes are classified into five groups

1xx series (Informational)


Status Status Description
Code Phrase
100 Continue The Server has received initial part of the
request and client may proceed further

101 Switchin The server switches the protocol on


g receiving a request from the client to do
same
102 Processin The server has received the request which
g is currently under process and no response
is available yet.
Response
2xx series (Success)
Message
Status Status Phrase Description
Code
200 OK Indicates that request was valid
201 Created The Request was successful and
desired resource was created.
202 Accepted The Request is accepted for
further processing
203 Non – The server is not Authoritative
Authoritative for information being sent back
information
204 No Content The request is processed
successfully but the body of the
response has no content
Response
3xx series (Redirection)
Message
Status Status Phrase Description
Code
300 Multiple Choices The server May provide Resources
in multiple choice like video in
multiple formats.
301 Moved Permanently The resource request no longer
exists.
4xx series (Client Error)

Status Status Phrase Description


Code
400 Bad Request The Request Contains Syntax error
and cannot be fulfilled.
401 Unauthorized The request has failed to authorized
403 Forbidden The request was valid but server is
refusing to respond it
408 Request Timeout The server timed out waiting for the
Response
5xx series (Server Error)
Message
Status Status Phrase Description
Code
500 Internal Server Error An Error message indicating that
problem has occurred in server
501 Not Implemented The server is unable to recognize the
method specified and unable to
fulfill request
502 Bad Gateway There Server is a gateway or proxy
and has received an invalid response
from downstream server.
Protocols and Programs: Web Pages
•The main protocol used for communication between a browser and a
web server is HTTP.
•For web documents the HTTP protocol works by sending commands over
a TCP connection.
•Main model used for the web is client-server.

Client
Server
(Browser)
http://www.mysports.com/index.html

•First stage: user types a URL in the browser address window.

24
Protocols and Programs: Web Pages

192.22.18.32
Port 80
Client
Server
(Browser)
http://www.mysports.com/index.html

192.22.18.32
•Second stage: URL is converted into IP address, which is then used to
make a connection to the server via port 80, the one used for HTTP and
web.

25
Protocols and Programs: Web Pages

Get index.html
Client
Server
(Browser)
HTTP/1.1
http://www.mysports.com/index.html

•Third stage: client extracts the file name from the URL and sends the
request. When received, the server looks up the request.

26
Protocols and Programs: Web Pages

HTTP/1.1 200 OK
Client
Server
(Browser)
[contents of document]
http://www.mysports.com/index.html

•Final stage: if all is well, the HTTP message is sent saying that the page
was found, followed by the page itself. When the page has been sent the
connection is dropped.

27
Web Browser
•One of the main tools used while developing web applications is
the Web Browser.

•Browser connects over the internet to a web server.

•Uses HTTP protocol

•The format of incoming web pages is HTML which is interpreted


by the browser and displayed.

•In addition to HTML, most browsers support other types of files


such as JPEG, GIF and PNG.

•Many modern browsers work within the standard versions of


HTML and XHTML.
28
Features of Web Browser
•Main function of a browser is to show text and graphics and
possibly other media.

•‘Preferences’ or ‘Tools’ section of browser allows to customize or


set up special options. (ex: set home page)

•‘Cache’, a local memory area, speeds up loading of regularly visited


pages, by storing a copy in the client computer, which it will show
when the page is requested again.

•‘Clear the Cache’ option allows to empty the cache memory at


times.
•For example, while developing web pages, if the page is cached, it
may not update when you refresh the current page view.
•We can turn caching off or put a cache size of 0.
29
Features of Web Browser
•Preferences has option of cookies which are small pieces of
information to identify you when you revisit the site.
•In some cases, you can switch off the acceptance of cookies.

•Both cached pages and cookies can be cleared from memory,


either individually or all together.

•Auto filling of forms is other option if you want your details to be


automatically added when the site gives you a form to fill out.

•Download location option allows you to select where the files you
want to download will go.

30
Web 2.0
• 2nd generation of the WWW.
• Some example of features considered to be part
of web 2.0 are listed below..
• 1)Blogs
• 2) Wikis
• 3) Social Networking
• 4) Web application
• 5) User Participation
• 6) Long tail
• 7) Rich user experience

You might also like