Data Base System: UNIT - 16
Data Base System: UNIT - 16
UNIT 16
Structure
16.1.
Introduction
16.2.
What is a Database?
16.3.
16.4.
Database Technology
16.5.
Types of Databases
16.6.
16.7.
Summary
16.8.
Review Questions
16.1. Introduction
Data is a vital organizational resource which is an important input in an
information system. This data resource is traditionally called the database. It is
from this database that data is processed and converted into information to
satisfy information needs of the organisation. Nowadays we find that the
9
internal and external information resources available to organisations are
increasing at a rapid rate, due to which databases are becoming larger and
larger in size. At the same time, business environment has forced the
businesses to take quick and right decisions for which databases are required
to be queried frequently. Queries may be varied, e.g. one manager may be
interested to know the names of all those products for which sales in the
current year exceed that of the previous year, one may require information on
the total amount outstanding, or one may require the list of products having a
market share greater than 30 per cent and so on. To correctly process varied
types of queries and to ensure a fast response time the use of computer-based
information systems have become a necessity of any business.
To meet the objective of fast retrieval of data, computer-based information
system should be able to organize, store and manage data effectively and
efficiently. There are two main methods to organize data on computer media,
which are known as files and databases.
10
analyze or print data in a specific layout using reports. The advantage of
storing data in a database is that data are stored only once in a single location,
can be viewed from multiple locations, and if updated at a single point, it is
automatically updated everywhere it appears.
A database is a collection of records in a structured format. Data in a
structured format are easily retrievable for analysis and calculations. It is
possible to query data that exist in a structured format. For example, if a
manager wants to select the best customer, he need not ask the information
systems department for a report on sales figures, but can simply compare the
total amounts and customer ID fields for all the records in the sales order
database and select the best customer, as shown in Table 16-1.
Table 16-1 Database of online sales orders
Order
ID
1
2
3
4
5
6
7
8
9
10
Customer ID
Stop N Shop
Stop N Shop
Stop N Shop
Stop N Shop
Old World
Old World
Old World
Canyon Grocery
Canyon Grocery
Canyon Grocery
Employee
ID
Steven
Steven
Nancy
Andrew
Steven
Nancy
Andrew
Andrew
Janet
Andrew
Order date
2/2/07
3/14/00
4/18/07
4/21/07
2/1/07
4/9/07
4/11/07
2/18/07
3/26/07
4/1/07
PO
number
52
32
87
125
145
444
555
55
222
666
Total
amount
231.95
87.3
144.65
11.3
42.45
31.6
52.3
52.3
137.1
136.65
Objectives of Database
They provide a guide to the database requirements process and data modelling.
The database objectives are:
11
i)
ii)
12
logical views. The database management software makes the physical
database available for different logical views presented for various application
programs. For example, an employee retirement benefits program might use a
logical view of the human resources database that requires only the
employees name, address, social security number, pension plan, and
retirement benefits data.
A database management system has three components:
i)
ii)
13
locations, and so on. A data element represents a field. In addition to listing the
standard name, the dictionary lists the names that reference this element in
specific systems and identifies the individuals, business functions, programs,
and reports that use this data element.
By creating an inventory of data contained in the database, the data dictionary
serves as an important data management tool. For instance, business users
could consult the dictionary to find out exactly what pieces of data are
maintained for the sales or marketing function or even to determine all the
information maintained by the entire enterprise. The dictionary could supply
business users with the name, format, and specifications required to access
data for reports. Technical staff could use the dictionary to determine what
data elements and files must be changed if a program is changed.
Most data dictionaries are entirely passive; they simply report. More advanced
types are active; changes in the dictionary can be automatically used by related
programs. For instance, to change ZIP codes from five to nine digits, one
could simply enter the change the dictionary without having to modify all
application programs using ZIP codes.
In an ideal database environment, the data in the database are defined only
once and used for all applications whose data reside in the database, thereby
eliminating data redundancy and inconsistency. Application programs, which
are written using a combination of the data manipulation language of the
DBMS and a conventional programming language, request data elements from
the database. Data elements called for by the application programs are found
and delivered by the DBMS. The programmer does not have to specify in
detail how or where the data are to be found.
A DBMS can reduce program-data dependence along with program
development and maintenance costs. Access and availability of information
can be increased because users and programmers can perform ad hoc quires of
data in the database. The DBMS allows the organization to centrally manage
data, their use and security. The Window on Management illustrates some of
these benefits.
14
15
microcomputers and powerful telecommunication systems will further boost
the growth of distributed systems.
Client-Server Systems
These systems are closely related to the concept of distributed database. In the
client/server model, the database and processing power are distributed over the
organisation rather than having a centralized database. This model splits
processing between clients and servers on a network, assigning these
functions to the machine that it is most able to perform.
Servers, in general, are high performance machines that support heavy
transaction processing known as server processes, whereas the clients are lowend microcomputers with rich graphical user interface (GUI). Client-servers
are growing in popularity these days and are being used by a large number of
organisations.
Operational Databases
Operational databases store detailed data needed to support the business
processes and operations of a company. They are also called subject area
databases (SADB), transaction databases, and production databases. Examples
are a customer database, human resource database, inventory database, and
other database containing data generated by business operations. This includes
databases of Internet and electronic commerce activity, such as click stream
data describing the online behavior of customers or visitors to a companys
website.
16
Distributed
Databases
on Intranets
and Other
Networks
Client PC
or NC
External
Databases on
the Internet
and Online
Network
Server
Services
Operational
Databases of
the
Organization
End User
Databases
Data
Warehouse
Data
Marts
Distributed Databases
Many organisations replicate and distribute copies or parts of databases to
network servers at a variety of sites. These distributed databases can reside on
network servers on the World Wide Web, on corporate intranets or extranets,
or on other company networks. Distributed databases may be copies of
operational or analytical databases, hypermedia or discussion databases, or
any other type of database. Replication and distribution of databases is done to
improve database performance and security. Ensuring that all of the data in an
organizations distributed databases are consistently and concurrently updated
is a major challenge of distributed database management.
External Databases
Access to a wealth of information from external databases is available for a
fee from commercial online services, and with or without charge from many
sources on the Internet, especially the World Wide Web. Websites provide an
endless variety of hyperlinked pages of multimedia documents in hypermedia
databases for you to access. Data are available in the form of statistics on
17
economic and demographic activity from statistical data banks. Or you can
view or download abstracts or complete copies of hundreds of newspapers,
magazines, newsletters, research papers, and other published material and
other periodicals from bibliographic and full text databases.
Hypermedia Databases
The rapid growth of websites on the Internet and corporate intranets and
extranets has dramatically increased the use of databases of hypertext and
hypermedia documents. A website stores such information in a hypermedia
database consisting of hyperlinked pages of multimedia (text, graphic, and
photographic images, video clips, audio segments, and so on). That is, from a
database of interrelated hypermedia page elements, rather than interrelated
data records.
Data Warehouses
They are designed for strategic decision support and built from operational
databases. They contain a vast amount of data. Smaller, local data warehouses
are called data marts. A data warehouse is a must where high volumes of data
processing is required, cross-functional flow of information is required, single
and centralized data source is a necessity, and increased quality and
consistency of an organizations data are a must.
Unstructured Databases
Nowadays most office tasks are computerized. Documents exist in the text
format; GIS store data in the form of maps and locations. A database for
software engineering might store program statements, while a chemical
database might store protein structures. Multimedia data exist in the form of
audio, video, images, etc. All these kinds of data are known as unstructured
data. The web is emerging as a universal data repository, offering access to
sources whose data organization varies from strictly structured databases to
almost completely unstructured pages. Consequently, much research has
recently focused on data integration and data translation systems, whose goals
18
are to allow applications to utilize data from many sources, with possibly
widely varying formats.
Customer ID
Order Date
Order ID
Product Code
PO Number
Sales Tax Rate
Ship Date
Shipping Method
Authorized signatory
Figure 16-2 A sales order online form.
Unit Price
Quantity
Amount
Order Subtotal
Sales Tax
Total Amount
Payment Made
Balance Amount Due
19
For example, a manager might want to create a database to store order details
from a sales order form as shown in Figure 16-2. Here the facts about which
he will need to store data will be the fields in Table 16-1.
The process of database designing starts with understanding the traditional
method of work. People who will use the database need to be consulted. A
manager should brainstorm about the questions he would like the database to
answer. He should sketch out the reports he would like it to produce. The
forms currently used to record data should be analysed, and similar, welldesigned databases should be examined.
20
customer table may include customer name, address, city, and phone number
fields. Some points to remember while creating a table are:
Relate each field directly to the subject of the table.
Do not include derived or calculated data (data the result from an
expression).
Include all the information needed.
Store information in its smallest logical parts (for example, first name and
last name, rather than name).
RDBMS
Not necessary for prior definition of
relationship.
User can interact with database in a
non-procedural way.
Re-organisation of database is
flexible.
21
16.7. Summary
A database is a collection of information in the digital form related to a
particular subject or purpose, such as tracking customer orders or maintaining
a music collection. A database is a collection of records in a structured format.
A database management system (DBMS) is simply the software that permits
an organization to centralize data, manage them efficiently, and provide access
to the stored data by application programs. The DBMS acts as an interface
between application programs and the physical data files. A database
management system has three components: A data definition language, A data
manipulation language, and A data dictionary.
There are the basic steps in designing a database: i) Determine the purpose of
the database. ii) Determine the tables needed by the organization in the
database. iii) Determine the fields needed in the tables. iv) Identify the primary
unique keys and v) Determine the relationships between tables.