Wordwibe Web HTML Search Engines Lecture-13

Download as ppt, pdf, or txt
Download as ppt, pdf, or txt
You are on page 1of 23

CS-100

ICT
MEHREEN TAHIR
More about webs

LECTURE-13
How the Browser Finds Things:
URLs
• URLS: ADDRESSES FOR WEB PAGES
– Before your browser can connect with a website,
it needs to know the site’s address, the URL.
– The URL (Uniform Resource Locator) is a string of
characters that points to a specific piece of
information anywhere on the web.
– In other words, the URL is the website’s unique
address.
How the Browser Finds Things:
URLs
• A URL consists of
– (1) the web protocol
– (2) the domain name or web server name
– (3) the directory (or folder) on that server, and
– (4) the file within that directory (perhaps with an
extension such as html or htm ).
How the Browser Finds Things:
URLs
• Consider the following example of a URL for a
website offered by the National Park Service
for Yosemite National Park:
How the Browser Finds Things:
URLs
• The protocol: http://
– It stands for HyperText Transfer Protocol (HTTP) ,
the communications rules that allow browsers to
connect with web servers.
• Note: Most browsers assume that all web addresses
begin with http://, and so you don’t need to type this
part; just start with whatever follows, such as www.
How the Browser Finds Things:
URLs
• The domain name (web server name):
– www.nps.gov/A domain is simply a location on
the internet, the particular web server.
– Domain names tell the location and the type of
address.
– Domain-name components are separated by
periods (called “dots”).
How the Browser Finds Things:
URLs
– The last part of the domain, called the top-level
domain, is a three-letter extension that describes
the domain type: .
gov, .com, .net, .edu, .org, .mil, .int— government,
commercial, network, educational, nonprofit,
military, or international organization.
How the Browser Finds Things:
URLs
– In our example, the www stands for “World Wide
Web,” of course; the . nps stands for “National
Park Service,” and . gov is the top-level domain
name indicating that this is a government website.
– Some top-level domain names also include a two-
letter code extension for the country—for
example, . Us for United States, . ca for Canada, .
mx for Mexico, . uk for United Kingdom, . jp for
Japan, . in for India, . cn for China. These country
codes are optional.
The Nuts & Bolts of the Web: HTML
& Hyperlinks
• The basic communications protocol that
makes the internet work, as we described, is
TCP/IP.
• The communications protocol used to access
that part of the internet called the World
Wide Web, we pointed out, is called
HyperText Transfer Protocol (HTTP).
The Nuts & Bolts of the Web: HTML
& Hyperlinks
• A hypertext document uses hypertext markup
language (HTML), which uses hypertext links,
to connect with other documents.
• The foundations of the World Wide Web,
then, are HTML and hypertext links.
The Nuts & Bolts of the Web: HTML
& Hyperlinks
• HYPERTEXT MARKUP LANGUAGE (HTML)
– Hypertext markup language (HTML) is the set of
special instructions (called “tags” or “markups”)
that are used to specify document structure,
formatting, and links to other multimedia
documents on the web.
– Extensible hypertext markup language (XHTML) is
the successor to and the current version of HTML.
The Nuts & Bolts of the Web: HTML
& Hyperlinks
– The need for a stricter version of HTML was
perceived primarily because World Wide Web
content now needs to be delivered to many
devices (such as mobile phones) that have fewer
resources than traditional computers have.
The Nuts & Bolts of the Web: HTML
& Hyperlinks
• HYPERTEXT LINKS
– Hypertext links —also called hyperlinks, hotlinks,
or just links —are connections to other
documents or web pages that contain related
information; a word or phrase in one document
becomes a connection to a document in a
different place.
– Hyperlinks appear as underlined or colored words
and phrases.
The Nuts & Bolts of the Web: HTML
& Hyperlinks
– On a home page, for instance, the hyperlinks
serve to connect the main page with other pages
throughout the website. Other hyperlinks will
connect to pages on other websites, whether
located on a computer next door or one on the
other side of the world
Browser
Web Portal
• A web portal, or simply portal, is a type of
gateway website that functions as an
“anchor site” and offers a broad array of
resources and services, online shopping
malls, email support, community forums,
current news and weather, stock quotes,
travel information, and links to other popular
subject categories.
Web Portal
• Portals may be general public portals
(horizontal portals or megaportals), such as
Yahoo!, Google, Bing (formerly MSN), Lycos,
and AOL
• There are also specialized portals—called
vertical portals, or vortals, which focus on
specific narrow audiences or communities.
Search Engines
• Search services are organizations that
maintain databases accessible through
websites to help you find information on the
internet.
• Examples are not only parts of portals such as
Yahoo! and Bing/MSN but also Google,
Ask.com, and Gigablast, to name just a few.
Search Engines
• Search services also maintain search engines,
programs that enable you to ask questions or
use keywords to help locate information on
the web.
Search Engines
• Search services compile their databases by
using special programs called spiders —also
known as crawlers, bots (for “robots”), or
agents —that crawl through the World Wide
Web, following links from one web page to
another and indexing the words on that site.
Search Engines
• This method of gathering information has two
important implications:
– A search never covers the entire web
– Search engines differ in what they cover

You might also like