Chapter 1
Chapter 1
CHAPTER 1
INTRODUCTION TO DBMS
WHAT IS DATABASE
The database is a collection of inter-related data which is used to retrieve, insert and delete
the data efficiently. It is also used to organize the data in the form of a table, schema, views,
and reports, etc.
For example: The college Database organizes the data about the admin, staff, students and
faculty etc.
Using the database, you can easily retrieve, insert, and delete the information.
o It provides protection and security to the database. In the case of multiple users, it
also maintains data consistency.
1|Page
Chapter 1 Introduction to DBMS
1.1 INFORMATION:
Information is delineate because the structured, organized, and processed data, conferred
inside context, that makes it relevant and helpful to the one who desires it. Data suggests
that raw facts and figures regarding individuals, places, or the other issue, that is expressed
within the type of numbers, letters or symbols.
Information is the knowledge that is remodeled and classified into an intelligible type, which
may be utilized in the method of deciding. In short, once knowledge end up to be purposeful
when conversion, it’s referred to as info. It’s one thing that informs, in essence, it provides
a solution to a specific question. It may be obtained from numerous sources like
newspapers, the internet, television, people, books, etc.
Information can be defined as “stimuli that has meaning in some context for its receiver.
When information is entered into and stored in a computer, it is generally referred to as
data. After processing — such as formatting and printing — output data can again be
perceived as information. When information is compiled or used to better understand
something or to do something, it becomes knowledge.”
1.2 DATA:
Data is a raw and unorganized fact that is required to be processed to make it meaningful.
It can be considered as facts and statistics collected together for reference or analysis. Data
are individual units of information. In analytical processes, data are represented by
variables. Data is always interpreted, by a human or machine, to derive meaning. So, data
is meaningless. Data contains numbers, statements, and characters in a raw form.
Let’s understand it with an example, see it is a fact (or data) that an apple a day can keep
the doctor away. But why it is not told, means this doesn’t tell us about what makes apple
healthy but if it tells that apple provides different vitamins, minerals, and fiber, which keeps
us healthy and are needed by our body; then it is information as it conveys the full meaning
of the fact.
2|Page
Chapter 1 Introduction to DBMS
Data in its most basic, standalone digital format does not provide information. But when
it’s combined with other data or is manipulated in some way, that’s when the organization
derives value from the information — which then leads to knowledge.
When information is compiled or used to better understand something or to do something,
it becomes knowledge.
DIFFERENCE BETWEEN INFORMATION AND DATA:
Bits and Bytes are the measuring unit of Information is measured in meaningful
4
data. units like time, quantity, etc.
3|Page
Chapter 1 Introduction to DBMS
Data does not directly helps in decision Information directly helps in decision
8
making. making.
Data is collection of facts, which it self Information puts those facts into
9
have no meaning. context.
Data management is a functional system for collecting and analyzing raw datasets. The
main goal of this process is to help people, and organizations and optimize the use of data
by maintaining the policy and regulations. There are some key components of data
management include as:
It is the first and foremost process, where you need to collect and gather the data from
various sources in a raw format. They could be structured or unstructured. You need to sort
the data in a secure and organized manner. This step involves selecting appropriate storage
technology based on the data volume.
Collection of data is the first and foremost process, where you need to collect and gather the
data from various sources in a raw format. They could be structured or unstructured. You
need to sort the data in a secure and organized manner. This step involves selecting
appropriate storage technology based on the data volume. Processing the data is an
important process to filter the data into a structured format. It involves data cleaning,
aggregating, and enhancing the dataset to make it more meaningful. When you need to
assure the accuracy and reliability of the data, you should maintain the data quality which
involves the validation rules, and error-checking processes.
4|Page
Chapter 1 Introduction to DBMS
Maintaining data security and privacy processes implement security measures to protect
the data from unauthorized access, security breaches, and data loss by imposing data
encryption and access control. Furthermore, it is important to analyze the data by applying
the data mining process, machine learning, and data visualization processes. There are
different types of data management lifecycles in the market, by which organizations can
maintain the business and regulatory requirements to develop a durable data set. It leads
to managing the process of metadata and provides detailed information about
the structure, mining process, and data usage to perform the process effectively.
An organizational program that manages the Ensures data that underlies an organization
people, processes and technology that is available, accurate, complete and secure.
provide control over the structure,
processing, delivery and usage of information
5|Page
Chapter 1 Introduction to DBMS
The file system is basically a way of arranging the files in a storage medium like a hard disk.
The file system organizes the files and helps in the retrieval of files when they are required.
File systems consist of different files which are grouped into directories. The directories
further contain other folders and files. The file system performs basic operations like
management, file naming, giving access rules, etc.
File based systems were an early attempt to computerize the manual system. It is also called
a traditional based approach in which a decentralized approach was taken where each
department stored and controlled its own data with the help of a data processing specialist.
The main role of a data processing specialist was to create the necessary computer file
6|Page
Chapter 1 Introduction to DBMS
structures, and also manage the data within structures and design some application programs
that create reports based on file data.
Consider an example of a student's file system. The student file will contain information
regarding the student (i.e. roll no, student name, course etc.). Similarly, we have a subject file
that contains information about the subject and the result file which contains the information
regarding the result.
Some fields are duplicated in more than one file, which leads to data redundancy. So to
overcome this problem, we need to create a centralized system, i.e. DBMS approach.
7|Page
Chapter 1 Introduction to DBMS
8|Page
Chapter 1 Introduction to DBMS
Only one user can access data at a Multiple users can access data
User Access time. at a time.
Database Management System (DBMS) is a software system that is designed to manage and
organize data in a structured manner. It allows users to create, modify, and query a
database, as well as manage the security and access controls for that database.
9|Page
Chapter 1 Introduction to DBMS
DBMS provides an environment to store and retrieve the data in coinvent and efficient
manner.
DBMS allows users the following tasks:
• Data Definition: It helps in the creation, modification, and removal of definitions
that define the organization of data in the database.
• Data Updation: It helps in the insertion, modification, and deletion of the actual
data in the database.
• Data Retrieval: It helps in the retrieval of data from the database which can be
used by applications for various purposes.
• User Administration: It helps in registering and monitoring users, enforcing data
security, monitoring performance, maintaining data integrity, dealing with
concurrency control, and recovering information corrupted by unexpected
failure.
• Data modeling: A DBMS provides tools for creating and modifying data models,
which define the structure and relationships of the data in a database.
• Data storage and retrieval: A DBMS is responsible for storing and retrieving data
from the database, and can provide various methods for searching and querying
the data.
• Concurrency control: A DBMS provides mechanisms for controlling concurrent
access to the database, to ensure that multiple users can access the data without
conflicting with each other.
• Data integrity and security: A DBMS provides tools for enforcing data integrity
and security constraints, such as constraints on the values of data and access
controls that restrict who can access the data.
• Backup and recovery: A DBMS provides mechanisms for backing up and
recovering the data in the event of a system failure.
• DBMS can be classified into two types: Relational Database Management
System (RDBMS) and Non-Relational Database Management System (NoSQL or
Non-SQL)
10 | P a g e
Chapter 1 Introduction to DBMS
• RDBMS: Data is organized in the form of tables and each table has a set of rows
and columns. The data are related to each other through primary and foreign
keys.
• NoSQL: Data is organized in the form of key-value pairs, documents, graphs, or
column-based. These are designed to handle large-scale, high-performance
scenarios.
A database is a collection of interrelated data which helps in the efficient retrieval, insertion,
and deletion of data from the database and organizes the data in the form of tables, views,
schemas, reports, etc. For Example, a university database organizes the data about
students, faculty, admin staff, etc. which helps in the efficient retrieval, insertion, and
deletion of data from it.
Applications of DBMS:
• Enterprise Information: Sales, accounting, human resources, Manufacturing,
online retailers.
• Banking and Finance Sector: Banks maintaining the customer details, accounts,
loans, banking transactions, credit card transactions. Finance: Storing the
information about sales and holdings, purchasing of financial stocks and bonds.
• University: Maintaining the information about student course enrolled
information, student grades, staff roles.
• Airlines: Reservations and schedules.
• Telecommunications: Prepaid, postpaid bills
11 | P a g e
Chapter 1 Introduction to DBMS
Advantages of DBMS
o Controls database redundancy: It can control data redundancy because it stores all
the data in one single database file and that recorded data is placed in the database.
o Data sharing: In DBMS, the authorized users of an organization can share the data
among multiple users.
o Easily Maintenance: It can be easily maintainable due to the centralized nature of the
database system.
o Backup: It provides backup and recovery subsystems which create automatic backup
of data from hardware and software failures and restores the data if required.
o multiple user interface: It provides different types of user interfaces like graphical
user interfaces, application program interfaces
Disadvantages of DBMS
o Cost of Hardware and Software: It requires a high speed of data processor and large
memory size to run DBMS software.
o Size: It occupies a large space of disks and large memory to run them efficiently.
o Higher impact of failure: Failure is highly impacted the database because in most of
the organization, all the data stored in a single database and if the database is
damaged due to electric failure or database corruption then the data may be lost
forever.
There are many components available in the DBMS. Each component has a significant task in
the DBMS. A database environment is a collection of components that regulates the use of
data, management, and a group of data. These components consist of people, the technique
of Handel the database, data, hardware, software, etc. there are several components
available for the DBMS. We are going to explain five main topics of the database below.
12 | P a g e
Chapter 1 Introduction to DBMS
1. Hardware
o Here the hardware means the physical part of the DBMS. Here the hardware includes
output devices like a printer, monitor, etc., and storage devices like a hard disk.
o In DBMS, information hardware is the most important visible part. The equipment
which is used for the visibility of the data is the printer, computer, scanner, etc. This
equipment is used to capture the data and present the output to the user.
o With the help of hardware, the DBMS can access and update the database.
o The server can store a large amount of data, which can be shared with the help of the
user's own system.
o The database can be run in any system that ranges from microcomputers to mainframe
computers. And this database also provides an interface between the real worlds to
the database.
o When we try to run any database software like MySQL, we can type any commands
with the help of our keyboards, and RAM, ROM, and processor are part of our
computer system.
2. Software
o Software is the main component of the DBMS.
o Software is defined as the collection of programs that are used to instruct the
computer about its work. The software consists of a set of procedures, programs, and
routines associated with the computer system's operation and performance. Also, we
can say that computer software is a set of instructions that is used to instruct the
computer hardware for the operation of the computers.
13 | P a g e
Chapter 1 Introduction to DBMS
o The software includes so many software like network software and operating software.
The database software is used to access the database, and the database application
performs the task.
o This software has the ability to understand the database accessing language and then
convert these languages to real database commands and then execute the database.
o This is the main component as the total database operation works on a software or
application. We can also be called as database software the wrapper of the whole
physical database, which provides an easy interface for the user to store, update and
delete the data from the database.
o Some examples of DBMS software include MySQL, Oracle, SQL Server, dBase,
FileMaker, Clipper, Foxpro, Microsoft Access, etc.
3. Data
o The term data means the collection of any raw fact stored in the database. Here the
data are any type of raw material from which meaningful information is generated.
o The database can store any form of data, such as structural data, non-structural data,
and logical data.
o The structured data are highly specific in the database and have a structured format.
But in the case of non-structural data, it is a collection of different types of data, and
these data are stored in their native format.
o We also call the database the structure of the DBMS. With the help of the database,
we can create and construct the DBMS. After the creation of the database, we can
create, access, and update that database.
o The main reason behind discovering the database is to create and manage the data
within the database.
o Data is the most important part of the DBMS. Here the database contains the actual
data and metadata. Here metadata means data about data.
o For example, when the user stores the data in a database, some data, such as the size
of the data, the name of the data, and some data related to the user, are stored within
the database. These data are called metadata.
4. Procedures
o The procedure is a type of general instruction or guidelines for the use of DBMS. This
instruction includes how to set up the database, how to install the database, how to
log in and log out of the database, how to manage the database, how to take a backup
of the database, and how to generate the report of the database.
14 | P a g e
Chapter 1 Introduction to DBMS
o In DBMS, with the help of procedure, we can validate the data, control the access and
reduce the traffic between the server and the clients. The DBMS can offer better
performance to extensive or complex business logic when the user follows all the
procedures correctly.
o The main purpose of the procedure is to guide the user during the management and
operation of the database.
o The procedure of the databases is so similar to the function of the database. The major
difference between the database procedure and database function is that the
database function acts the same as the SQL statement. In contrast, the database
procedure is invoked using the CALL statement of the DBMS.
o Database procedures can be created in two ways in enterprise architecture. These two
ways are as below.
o The individual object or the default object.
o The operations in a container.
<Datatype>,...)
IS
Declaration section<variable, constant> ;
BEGIN
Execution section
EXCEPTION
Exception section
END
15 | P a g e
Chapter 1 Introduction to DBMS
The following commands serve as the base for all DDL commands:
o ALTER<object>
o COMMENT
o CREATE<object>
o DESCRIBE<object>
o DROP<object>
o SHOW<object>
o USE<object>
2. Data Manipulation Language(DML): It is used to access a database. The DML provides the
statements to retrieve, modify, insert and delete the data from the database.
The following commands serve as the base for all DML commands:
o INSERT
o UPDATE
o DELETE
o LOCK
o CALL
o EXPLAIN PLAN
6. People
o The people who control and manage the databases and perform different types of
operations on the database in the DBMS.
o The people include database administrator, software developer, and End-user.
o Database administrator-database administrator is the one who manages the complete
database management system. DBA takes care of the security of the DBMS, its
availability, managing the license keys, managing user accounts and access, etc.
o Software developer- theThis user group is involved in developing and designing the
parts of DBMS. They can handle massive quantities of data, modify and edit databases,
design and develop new databases, and troubleshoot database issues.
16 | P a g e
Chapter 1 Introduction to DBMS
o End user - These days, all modern web or mobile applications store user data. How do
you think they do it? Yes, applications are programmed in such a way that they collect
user data and store the data on a DBMS system running on their server. End users are
the ones who store, retrieve, update and delete data.
o The users of the database can be classified into different groups.
i. Native Users
ii. Online Users
iii. Sophisticated Users
iv. Specialized Users
v. Application Users
vi. DBA - Database Administrator
Database users interact with data to update, read and modify the given information on a daily
basis. There are various types of database users and we will learn in detail about them.
• End Users
o Naive users / Parametric users
o Sophisticated users
• Application Programmer or Specialized users or Back-End Developer
• System Analyst
• Database Administrator (DBA)
• Temporary Users or Casual Users
These users can access the database and recover the data using various applications.
• End Users/Parametric Users − These users access the database from the front end
with the help of a pre-developed application. They have little knowledge about the
design and working of databases.
There are two types of end users −
17 | P a g e
Chapter 1 Introduction to DBMS
o Naive Users − These naive users are those users who don’t have any database
knowledge. They depend on pre-developed applications like Bank
Management Systems, Library Management Systems, Hospital Management
Systems, and Railway Ticket Booking Systems(IRCTC) and get the desired
result.
o Sophisticated Users − These users interact with the system without writing a
program and have separate databases for personal use. In the database, the
user passes each query to the query processor.
There are two ways to interact with a system −
▪ They use the structure query language to run the query on the
database.
▪ They use the tool of data analysis software. For example, data
engineers and data scientists are familiar with databases.
• Application programmers/Specialized programmers/Back-End Developer − These
programmers write the code for an application program that uses the database. The
application programmer can make the application according to user requirements and
control software that runs an entire computer system. The application program is
written in any programming language like C#, .net, JAVA, etc., and focuses on business,
engineering, and science program.
Application Programmers are divided into four different types −
o Web Developers
o Computer Hardware Programmers
o Database Developers
o Software Developers
Examples of Application programmers develop software like −
o Content access software
o Educational Software
o Information Worker Software
o Media Development Software
o Product Engineering Software
o Enterprise Software
18 | P a g e
Chapter 1 Introduction to DBMS
• System Analyst − A System Analyst has also known as a business technology analyst.
These professionals are responsible for the design, structure, and properties of
databases. The application programmer uses the specifications provided by the
system analyst to construct the software that is used by end users. The analyst will
gather information from the shareholders as well as end users to understand their
requirements and translate it into functional specifications for the new system.
Examples of System Analysts −
o They serve as team leaders.
o They are responsible for managing projects.
o They are the supervisor who manages the lower-level information Staff.
• Database Administrator (DBA) − The DBA is the group of people that includes
everything required to manage and solve every complex. The DBA can easily use the
database to find the information they need and to plan the goal of the database. To
meet future needs, they are ready for future scope and provide solutions for end
users. Therefore, they are known for high-level management.
For example −
o To handle the data loss.
o To secure the privacy of data.
o Monitor the recovery and backup of the database.
• Temporary Users/Casual Users − These users utilize the database for testing and are
only accessible for a limited time. According to business requirements, these users
update a little or new information to the database with the help of a database
administrator. It helps to maintain the security and integrity of data.
For example
High-level management people are temporary users with little knowledge of DBMS.
o The DBMS design depends upon its architecture. The basic client/server architecture
is used to deal with a large number of PCs, web servers, database servers and other
components that are connected with networks.
o The client/server architecture consists of many PCs and a workstation which are
connected via the network.
19 | P a g e
Chapter 1 Introduction to DBMS
o DBMS architecture depends upon how users are connected to the database to get
their request done.
Database architecture can be seen as a single tier or multi-tier. But logically, database
architecture is of two types like: 2-tier architecture and 3-tier architecture.
1-Tier Architecture
o In this architecture, the database is directly available to the user. It means the user can
directly sit on the DBMS and uses it.
o Any changes done here will directly be done on the database itself. It doesn't provide
a handy tool for end users.
o The 1-Tier architecture is used for development of the local application, where
programmers can directly communicate with the database for the quick response.
2-Tier Architecture
o The user interfaces and application programs are run on the client-side.
o The server side is responsible to provide the functionalities like: query processing and
transaction management.
20 | P a g e
Chapter 1 Introduction to DBMS
3-Tier Architecture
o The 3-Tier architecture contains another layer between the client and server. In this
architecture, client can't directly communicate with the server.
o The application on the client-end interacts with an application server which further
communicates with the database system.
o End user has no idea about the existence of the database beyond the application
server. The database also has no idea about any other user beyond the application.
o The 3-Tier architecture is used in case of large web application.
21 | P a g e
Chapter 1 Introduction to DBMS
Database systems comprise complex data structures. In order to make the system efficient
in terms of retrieval of data, and reduce complexity in terms of usability of users, developers
use abstraction i.e. hide irrelevant details from the users. This approach simplifies database
design.
This is the lowest level of data abstraction. It tells us how the data is actually stored in
memory. Access methods like sequential or random access and file organization methods
like B+ trees and hashing are used for the same. Usability, size of memory, and the number
of times the records are factors that we need to know while designing the database.
Suppose we need to store the details of an employee. Blocks of storage and the amount of
memory used for these purposes are kept hidden from the user.
This level comprises the information that is actually stored in the database in the form of
tables. It also stores the relationship among the data entities in relatively simple structures.
At this level, the information available to the user at the view level is unknown.
We can store the various attributes of an employee and relationships, e.g. with the manager
can also be stored.
22 | P a g e
Chapter 1 Introduction to DBMS
The logical level thus describes the entire database in terms of a small number of relatively
simple structures. Although implementation of the simple structures at the logical level may
involve complex physical-level structures, the user of the logical level does not need to be
aware of this complexity. This is referred to as physical data independence. Database
administrators, who must decide what information to keep in the database, use the logical
level of abstraction.
This is the highest level of abstraction. Only a part of the actual database is viewed by the
users. This level exists to ease the accessibility of the database by an individual user. Users
view data in the form of rows and columns. Tables and relations are used to store data.
Multiple views of the same database may exist. Users can just view the data and interact
with the database, storage and implementation details are hidden from them. Even though
the logical level uses simpler structures, complexity remains because of the variety of
information stored in a large database. Many users of the database system do not need all
this information; instead, they need to access only a part of the database. The view level of
abstraction exists to simplify their interaction with the system
Example: In case of storing customer data,
The main purpose of data abstraction is to achieve data independence in order to save the
time and cost required when the database is modified or altered.
23 | P a g e
Chapter 1 Introduction to DBMS
Data Independence is mainly defined as a property of DBMS that helps you to change the
database schema at one level of a system without requiring to change the schema at the
next level. it helps to keep the data separated from all program that makes use of it.
We have namely two levels of data independence arising from these levels of abstraction:
• Physical level data independence
• Logical level data independence
o Logical data independence refers characteristic of being able to change the conceptual
schema without having to change the external schema.
o Logical data independence is used to separate the external level from the conceptual
view.
24 | P a g e
Chapter 1 Introduction to DBMS
o If we do any changes in the conceptual view of the data, then the user view of the
data would not be affected.
o Physical data independence can be defined as the capacity to change the internal
schema without having to change the conceptual schema.
o If we do any changes in the storage size of the database system server, then the
Conceptual structure of the database will not be affected.
o Physical data independence is used to separate conceptual levels from the internal
levels.
25 | P a g e
Chapter 1 Introduction to DBMS
It mainly concern about how the data is It mainly concerned about the structure or
stored into the system. the changing data definition.
Any change at the physical level, does not The change in the logical level requires a
require to change at the application level. change at the application level.
The modifications made at the internal The modifications made at the logical level
level may or may not be needed to improve is significant whenever the logical structure
the performance of the structure. of the database is to be changed.
It is concerned with the internal schema. It is concerned with the conceptual schema.
26 | P a g e
Chapter 1 Introduction to DBMS
REVIEW QUESTIONS
Short Questions
1. Define DBMS.
2. What is the difference between data and information?
3. Explain file-based data management.
4. What are the components of a DBMS?
5. Who are database users?
6. Define database architecture.
7. What is data abstraction?
8. Differentiate between physical and logical data independence.
9. What is a database system?
10. Name two types of database users.
11. Define data independence.
12. What are the main functions of a DBMS?
13. Describe the concept of data management.
14. Explain the term "information system."
15. What is a database schema?
16. Define the term "metadata."
17. What is the role of a database administrator (DBA)?
18. Explain the concept of database design.
19. What is a data model?
20. Differentiate between data and database.
27 | P a g e
Chapter 1 Introduction to DBMS
REVIEW QUESTIONS
Essay Questions
28 | P a g e