Unit 1 - Web Essentials
Unit 1 - Web Essentials
The Internet
The internet is a globally connected network system that uses TCP/IP to transmit data via
various types of media. The internet is a network of global exchanges – including private,
public, business, academic and government networks. It is a network of networks around
the globe.
The history of Internet was started with Department of Defense’s project called
Advanced Research Projects Agency, or ARPA in late 1960s. The ARPANET project
was intended to support DoD research on computer networking. The ARPANET
computer network was launched in 1969 and by year’s end consisted of four computers at
four sites (UCLA, Stanford Research Institute, UCSB, and the University of Utah)
running four different operating systems.
Other similar networks were developed both internationally and regionally. For
example, SURAnet (Southeastern University Research Association Network) was
organized by the University of Maryland beginning in 1982 and eventually included
essentially all of the major universities and research institutions in the southeastern
United States. Another of these networks, CSNET (Computer Science Network), was
partially funded by the U.S. National Science Foundation (NSF) to aid scientists at
universities without ARPANET access, laying the groundwork for future network
developments.
While these other networks were springing up, the ARPANET project continued to
fund research on networking. Several of the most widely used Internet protocols—
including the File Transfer Protocol (FTP) and Simple Mail Transfer Protocol (SMTP),
which underlie many of the Internet’s file transfer and e-mail operations, respectively—
were initially developed under ARPANET.
ARPANET switched from using an earlier protocol to TCP/IP during 1982. At
around the same time, an ARPA Internet was created, allowing computers on some
outside networks such as CSNET to communicate via TCP/IP with computers on the
ARPANET.
Beginning in 1985, the NSF began work on a new network based on TCP/IP, called
NSFNET. In 1988, networks in Canada and France were connected to NSFNET.
NSFNET quickly supplanted ARPANET, which was officially decommissioned in 1990.
At this point, NSFNET was at the center of the Internet, that is, the collection of
computer networks connected via the public backbone and communicating across
networks using TCP/IP.
Page 1
Chapter 1 – Web Essentials
Application Layer – This layer pprovides logic needed to support varieties of
applications. Separate module is needed for each type of application like DNS, HTTP,
SMTP etc.
Transport (Host to Host) Layer – This layer is responsible for end-to-end
communication and error-free delivery of data. Two main protocols used in this layer are
TCP and UDP.
Internet Layer – This layer is responsible for moving data among different networks.
Internet protocol (IP) is used at this layer for addressing and routing implemented in
computers and routers. A router is a processor that connects two networks and whose
primary function is to relay data from one network to the other on its route from the
source to the destination end system.
Network Access Layer – This layer defines details of how data is physically sent
through the network, including how bits are signaled by hardware devices that interface
directly with a network medium. The protocols included in this layer are Ethernet, Token
Ring, FDDI (Fiber Distributed Data Interface), Frame Relay etc.
TCP, UDP, DNS, and Domain Names
Transmission Control Protocol (TCP) – It is known to provide reliable and error-free
communication between end systems. It performs sequencing and segmentation of data.
It also has acknowledgement feature and controls the flow of the data through flow
control mechanism. It is a very effective protocol but has a lot of overhead due to such
features. Increased overhead leads to increased cost.
User Datagram Protocol (UDP) – On the other hand does not provide any such features.
It is the go to protocol if your application does not require reliable transport as it is very
cost-effective. Unlike TCP, which is connection-oriented protocol, UDP is
connectionless.
Domain Name Service (DNS) and Domain Names
While every device on the Internet has an IP address such as 192.168.0.10, humans
generally find it easier to refer to machines by names such as www.facebook.com. DNS
provides a mechanism for mapping back and forth between IP addresses and host names.
Basically, there are a number of DNS servers on the Internet, each listening through
UDP. When a computer on the Internet needs DNS services—for example, to convert a
host name such as www.facebook.com to a corresponding IP address – it uses the UDP
software running on its system to send a UDP message to one of these DNS servers,
requesting the IP address. If all goes well, this server will then send back a UDP message
containing the IP address.
Internet host names consist of a sequence of labels separated by dots. The final
label in a host name is a top-level domain. There are two standard types of top-level
domain: generic (such as .com, .edu, .org, and .biz) and country-code (such as .de, .np,
and .mx). The top-level domain names are assigned by the Internet Corporation for
Assigned Names and Numbers (ICANN), a private non-profit organization formed to
take over technical Internet functions that were originally funded by the U.S.
government.
Each top-level domain is divided into sub-domains (second-level domains), which
may in turn be further divided, and so on. The assignment of second-level domains within
each top-level domain is performed by a registry operator selected by ICANN. The
Page 2
Chapter 1 – Web Essentials
owner of a second-level domain can then further divide that domain into sub-domains,
and so on. Ultimately, the sub-domains of a domain are individual computers.
Higher Level Protocols
A variety of higher-level protocols are used to communicate once a TCP connection has
been established. SMTP and FTP are two examples of widely used higher-level protocols
that are used to communicate over TCP connections. SMTP supports transfer of e-mail
between different e-mail servers, while FTP is used for transferring files between
machines. Another higher-level TCP protocol, Telnet, is used to execute commands typed
into one computer on a remote computer. The protocol that will be used to communicate
over a TCP connection is normally determined by the port number used to establish the
connection. The primary TCP-based protocol used for communication between web
servers and browsers is called the Hypertext Transport Protocol (HTTP).
Page 3
Chapter 1 – Web Essentials
send these messages, but does expect that the request and response are both sent within a
TCP-style connection between the client and the server. So, most HTTP implementations
send these messages using TCP.
HTTP is an application layer protocol which is the foundation for data
communication for the World Wide Web. The data transferred by the protocol can be
plaintext, hypertext, audio, images, video or any Internet-accessible information. Basic
features of HTTP are:
• HTTP is connectionless: The HTTP client, i.e., a browser initiates an HTTP request
and after a request is made, the client disconnects from the server and waits for a
response. The server processes the request and re-establishes the connection with the
client to send a response back.
• HTTP is media independent: It means, any type of data can be sent by HTTP as
long as both the client and the server know how to handle the data content. It is
required for the client as well as the server to specify the content type using
appropriate MIME (Multipurpose Internet Mail Exchange) – type.
• HTTP is stateless: As mentioned above, HTTP is connectionless and it is a direct
result of HTTP being a stateless protocol. The server and client are aware of each
other only during a current request. Afterwards, both of them forget about each other.
Due to this nature of the protocol, neither the client nor the browser can retain
information between different requests across the web pages.
Page 4
Chapter 1 – Web Essentials
• Entity Body: The body of the message.
A response message has the same structure as a request message, but substitutes the
following headers for the request line and the request headers:
• Status Line: Indicates the version of HTTP used in this message and provides status
information about this response. For example, “OK” means that the request was
successfully completed.
• Response Headers: Provides additional data that expand on the status information in
the status line.
Page 5
Chapter 1 – Web Essentials
to handing out a printed brochure to customers or clients, a static website will generally
provide consistent, standard information for an extended period of time. Although the
website owner may make updates periodically, it is a manual process to edit the text,
photos and other content and may require basic website design skills and software. In
summary, visitors are not able to control what information they receive via a static
website, and must instead settle for whatever content the website owner has decided to
offer at that time.
A dynamic website is one that changes or customizes itself frequently and
automatically, based on certain criteria. Dynamic websites can have two types of
dynamic activity: Code and Content. Dynamic code is invisible or behind the scenes
and dynamic content is visible or fully displayed. In the first type, the code is constructed
dynamically on the fly using active programming language instead of plain, static HTML.
In the second type, website displays dynamic content in plain view. Variable content is
displayed dynamically on the fly based on certain criteria, usually by retrieving content
stored in a database
Page 6
Chapter 1 – Web Essentials
Page 7
Chapter 1 – Web Essentials
Page 8