DBMS Unit - 1
DBMS Unit - 1
2/6 B. TECH+M.TCH(CSE-3)
II YEAR – II SEMESTER
(2022-2023)
Introduction
The information storage and retrieval has become very important in our day-to-day life. The old era
of manual system is no longer used in most of the places. For example, to book your airline tickets or
to deposit your money in the bank the database systems may be used. The database system makes
most of the operations automated. A very good example for this is the billing system used for the
items purchased in a super market. Obviously, this is done with the help of a database application
package. Inventory systems used in a drug store or in a manufacturing industry are some more
examples of database. We can add similar kind of examples to this list.
Apart from these traditional database systems, more sophisticated database systems are used in the
Internet where a large amount of information is stored and retrieved with efficient search engines. For
instance, http://www.google.com is a famous web site that enables users to search for their favorite
information on the net. In a database we can store starting from text data to very complex data like
audio, video, etc.
Data
Data is the raw material from which useful information is derived. The word data is the plural of
Datum. Data is commonly used in both singular and plural forms. It is defined as raw facts or
observations. It takes variety of forms, including numeric data, text and voice and images. Data is a
collection of facts, which is unorganized but can be made organized into useful information. The term
Data and Information come across in our daily life and are often interchanged.
Example: Weights, prices, costs, number of items sold etc.
Information
Data that have been processed in such a way as to increase the knowledge of the person who uses the
data. The term data and information are closely related. Data are raw material resources that are
processed into finished information products. The information as data that has been processed in such
way that it can increase the knowledge of the person who uses it.
In practice, the database today may contain either data or information.
Data Processing
The process of converting the facts into meaningful information is known as data processing. Data
processing is also known as information processing.
Metadata
Data that describe the properties or characteristics of other data.
Data is only become useful when placed in some context. The primary mechanism for providing
context for data is Metadata. Metadata are data that describe the properties, or characteristics of other data.
Some of these properties include data definition, data structures and rules or constraints. The Metadata
describes the properties of data but do not include that data.
1. Not enough primary memory to process large data sets. If data is maintained in other storage
devices like disks, tapes and bringing relevant data to main memory, it increases the cost of
performance. Problem in accessing the large data due to addressing the data using 32 bit or 64 bit
mode addressing mechanism.
2. Programs must be written to process the user request to process the data stored in files which are
complex in nature because of large volume of data to be searched.
3. Inconsistent data and complexity in providing concurrent accesses.
4. Not sufficiently flexible to enforce security policies in which different users have permission to
access different subsets of the data.
A DBMS is a piece of software that is designed to make the preceding tasks easier. By storing data in a
DBMS, rather than as a collection of operating system Files, we can use the DBMS's features to manage
the data in a robust and efficient manner.
1.4 Advantages of DBMS
One of the main advantages of using a database management system is that the organization can exert
via the DBA, centralized management, and control over the data. The database administrator is the focus
of the centralized control.
The following are the major advantages of using a Database Management System (DBMS): Data
independence: Application programs should be as independent as possible from details of data
representation and storage. The DBMS can provide an abstract view of the data to insulate application
code from such details.
Efficient data access: A DBMS utilizes a variety of sophisticated techniques to store and retrieve data
efficiently. This feature is especially important if the data is stored on external storage devices.
Data integrity and security: The DBMS can enforce integrity constraints on the data. The DBMS can
enforce access controls that govern what data is visible to different classes of users.
Data administration: When several users share the data, centralizing the administration of data can offer
significant improvements. It can be used for organizing the data representation to minimize redundancy
and for fine-tuning the storage of the data to make retrieval efficient.
Concurrent access and crash recovery: A DBMS schedules concurrent accesses to the data in such a
manner that users can think of the data as being accessed by only one user at a time. Further, the DBMS
protects users from the effects of system failures. .
Reduced application development time: Clearly, the DBMS supports many important functions that
are common to many applications accessing data stored in the DBMS.
The disadvantage of the DBMS system is overhead cost. The processing overhead introduced by the
DBMS to implement security, integrity, and sharing of the data causes a degradation of the response
and throughput times. An additional cost is that of migration from a traditionally separate application
environment to an integrated one.
Even though centralization reduces duplication, the lack of duplication requires that the database be
adequately backup so that in the case of failure the data can be recovered.
Backup and recovery operations are complex in a DBMS environment, and this is an increment in a
concurrent multi-user database system. A database system requires a certain amount of controlled
redundancies and duplication to enable access to related data items.
A schema is a description of a particular collection of data, using the given data model. The relational
model of data is the most widely used model today.
Main concept: relation, basically a table with rows and columns. Every relation has a schema,
which describes the columns, or fields.
Data Model is a collection of high-level data description constructs that hide many low-level storage
details. A DBMS allows a user to define the data to be stored in terms of a data model. Most database
management systems today are based on the Relational data model. Relational models include –
IBM’s DB2, Informix, Oracle, Sybase, Microsoft’s Access, Foxbase, Paradox, Tandem and Teradata.
1.7 Categories of data models
Conceptual (high-level, semantic) data models: Provide concepts that are close to the way many
users perceive data (Also called entity-based or object-based data models).
Physical (low-level, internal) data models: Provide concepts that describe details of how data is
stored in the computer.
Implementation (representational) data models: Provide concepts that fall between the above
two.
1. Hierarchical models:
Advantages:
Hierarchical model is simple to construct and operate on.
Corresponds to a number of natural hierarchical organized domains – e.g., assemblies in
manufacturing, personal organization in companies.
Language is simple; uses constructs like GET, GET UNIQUE, GET NEXT, GET NEXT
WITHIN PARENT etc.,
Disadvantages:
Navigational and procedural nature of processing.
Database is visualized as a linear arrangement of records.
Little scope for “query optimization”.
One-to-many relationships.
2. Network model:
Advantages:
Network model is able to model complex relationships and represents semantics of add/delete
on the relationships.
Can handle most situations for modeling using record types and relationship types.
Language is navigational; uses constructs like FIND, FIND member, FIND owner, FIND
NEXT within set, GET etc. Programmers can do optimal navigation through the database.
Disadvantages:
Conceptual schema:
The conceptual schema(also called as logical schema) describes the stored data in terms of the data
model of the DBMS.
In a relational DBMS, the conceptual schema describes all relations that are stored in the
database.
In our sample university database, these relations contain information about entities, such as
students and faculty, and about relationships, such as students’ enrollment in courses.
Students(sid: string, name: string, login: string, age: integer, gpa:real)
Faculty(fid: string, fname: string, salary : real) Courses(cid: string, cname: string, credits: integer)
Rooms(nw: integer, address: string, capacity: integer)Enrolled (sid: string, cid: string, grade: string)
Teaches (fid: string, cid: string)
The choice of relations, and the choice of fields for each relation, is not always obvious, and
the process of arriving at a good conceptual schema is called conceptual database design.
Physical Schema:
External Schema:
This schema allows data access to be customized at the level of individual users or groups of
users.
A database has exactly one conceptual schema and one physical schema, but it may have
several external schemas.
An external schema is a collection of one or more views and relations from the conceptual
schema.
A view is conceptually a relation, but the records in a view are not stored in the DBMS.
The task of designing and maintaining the database is entrusted to a professional called the
database administrator.
Design of the conceptual and physical schemas: The DBA is responsible for interacting with the
users of the system to understand what data is to be stored in the DBMS and how it is likely to be
used. Based on this knowledge, the DBA must design the conceptual schema (decide what
relations to store) and the physical schema (decide how to store them).
Security and authorization: The DBA is responsible for ensuring that unauthorized data access is
not permitted. In general, not everyone should be able to access all the data. In a relational DBMS,
users can be granted permission to access only certain views and relations.
Data availability and recovery from failures: The DBA must take steps to ensure that if the system
fails, users can continue to access as much of the uncorrupted data as possible.
Database tuning: The needs of users are likely to evolve with time. The DBA is responsible for
modifying the database, the conceptual and physical schemas, to ensure adequate performance as
user requirements change.
The architecture of a DBMS can be seen as either single tier or multi-tier. The tiers are classified as
follows:
1- tier architecture:
One-tier architecture involves putting all of the required components for a software application or
technology on a single server or platform.
2- tier architecture:
The two-tier is based on Client Server architecture. The two-tier architecture is like client server
application. The direct communication takes place between client and server. There is no intermediate
between client and server.
3- tier architecture:
A 3-tier architecture separates its tiers from each other based on the complexity of the users and how
they use the data present in the database. It is the most widely used architecture to design a DBMS.
Gradually, DBMS systems started to exploit the available processing power at the user side, which
led to client/server DBMS architectures.
Review Questions