RDBMS-Unit 1-Dr - Sandeep
RDBMS-Unit 1-Dr - Sandeep
Unit I:
Introduction to database
D r. S a n d e e p G a i k w a d
• Unit test
• Assignments
• Case study
• Quiz
• Attendance
• Class Behavior
Data can be defined as a representation of facts, concepts, or instructions in a formalized manner, which
should be suitable for communication, interpretation, or processing by human or electronic machine.
Computers represent data, including video, images, sounds and text, as binary
values using patterns of just two numbers: 1 and 0.
VS
Computer systems work with different types of digital data. In the early days of computing,
data consisted primarily of text and numbers, but in modern-day computing, there are
lots of different multimedia data types, such as audio, images, graphics and video.
There are two general ways to represent data: analog and digital. Analog data are continuous.
They are 'analogous' to the actual facts they represent.
Digital data are discrete, broken up into a limited number of elements.
Nature is analog, while computers are digital.
Many aspects of our natural world are continuous in nature. For example, think of the
spectrum of colors. This is a continuous rainbow of an infinite number of shades.
All data are stored in binary digits, and there is a limit to how much data we can represent. For example, a color
image on a computer has a limited number of colors - the number might be very large, but it is still finite.
One of the most basic data types is plain text. In database terminology, this is referred to as a character string, or
simply a string. A string represents alphanumeric data. This means that a string can contain many different
characters, but that they are all considered as if they were text and not put into calculations, even if the characters
are numbers.
The second most important data type is numeric data. As a general rule, you store numbers only as a numeric data
type if they represent a count or measurement of some kind and if it makes sense to perform calculations with
them. A ZIP code is a number assigned to a geographic area by the postal service. It would not make much sense
to determine the average value for multiple ZIP codes.
There are several different types of numeric data. An integer is a numeric value without a decimal. Integers are
whole numbers and can be positive or negative
A number with a decimal is referred to as a decimal, a float or a double. The terminology varies somewhat with
the software being used.
The Boolean data type can only represent two values: true or false. Typically, a zero is used to represent false and
a one is used to represent true. In the example table of students, the field Financial Aid is stored as a Boolean,
since a student is classified as having financial aid or not.
Dates and times have their own data type to distinguish them from numeric data. There are many different ways
to format dates and times. Some common ways to format dates are:
Day of the week, month and day, year - for example Sunday, April 1, 2012
An image, digital image, or still image is a binary representation of visual information, such as drawings,
pictures, graphs, logos, or individual video frames. Digital images can be saved electronically on any storage
device. The following example image is of an Intel Pentium CPU. In this example, a digital camera was used to
take the photograph, and then the text was added using photo editing software to create an image
Images are stored on computers in bitmap or vector form.
As can be seen in the above image, Pixel(A) has value (255, 0, 255) and is determined by the combination of
intensities stored in the red colour plane, green colour plane and blue colour plane respectively.
Similarly, pixel(B) has value (127, 255, 0) and is determined in the same manner as pixel(A).
an RGB image is basically a M*N*3 array of colour pixel, where each colour pixel is associated with three values
which correspond to red, blue and green colour component of RGB image at a specified spatial location.
So, the colour of any pixel is determined by the combination of the red, green, and blue intensities stored in each
colour plane at the pixel’s location.
Here each colour plane is a M*N array.
The database is a collection of inter-related data which is used to retrieve, insert and delete
the data efficiently. It is also used to organize the data in the form of a table, schema, views,
and reports, etc.
The database is a collection of inter-related data which is used to retrieve, insert and delete the data
efficiently. It is also used to organize the data in the form of a table, schema, views, and reports, etc.
1976
• P-Chen introduces a new database model known as ER Entity-Relationship.
• This design is basically made to focus on data applications.
• It gave another important insight into the conceptual data model.
• Such a higher level molding allows the designers to focus on the data instead of a logical structure.
Early 1980’s
• SQL(Structured Query Language) became standard. IBM launched DB2.
• The development of the IBM PC gave rise to many DB companies and products such as RIM
• , OS/2, PARADOX, RBASE, Database Management System also introduce in this period.
• The relational database system became a success as rapid increase in computer sales.
• This increased the popularity of the network and hierarchical database models.
Early 1990’s
In early 1990s much work was done on client tools for application as development such as PowerBuilder. VB (Microsoft),
Oracle Developer. The client-server model for computing became the norm for future business decisions.
The work on the object database system also started in this year. The more complex database system was introduced at high
prices.in this year new client tools for application development were introduced. A number of tools for personal productivity
such as ODBC and Excel/Access were also developed.
Mid 1990’s
The usable internet /WWW was introduced in the middle of this year.it allows remote access to the computer systems. The
concept of Web/DB started to grow. The advancement in the database system was started. Average desktops users began to
use the client-server database.
Late 1990’s
In the late 1990s, the online business was increased resulting in a rise in demand for internet database connectors
like Active server page, Jave Servlets, FrontPage, Dream Weaver. Enterprise Java Beans, Oracle Developer 2000
etc.in this year internet companies worked for Web/Internet connectors. The open-source solutions came online
with online analytic processing (OLAP) come of age with many merchants using point-of-sale (POS) technology
on a daily basis.
Today
Today’s database is used anywhere in our day to day life. In the present day, there are many new players in the non-relational
database space offering particular solutions. Many powerful technologies is introduces in the database. The database is very helpfu
in every field to keep records save. The quick base is an online database on a relational database that gives users of any skill level
the ability to create custom applications using the power of the relational database.
Text is here
1. Query Processor: It interprets the requests (queries) received from end user via an
application program into instructions. It also executes the user request which is received
from the DML compiler.
Query Processor contains the following components –
DML Compiler: It processes the DML statements into low level instruction (machine
language), so that they can be executed.
DDL Interpreter: It processes the DDL statements into a set of table containing meta data
(data about data).
Embedded DML Pre-compiler: It processes DML statements embedded in an application
program into procedural calls.
Query Optimizer: It executes the instruction generated by DML Compiler.
Before the use of computer, a manual file system was used to maintain the records and files. All the data was
stored in files and it makes it easy to find any information. But this system was good only for small organizations
having small number of items.
• It was time consuming.
• Inefficient to maintain the record of big firm having large number
of items.
• It requires a lots of labor work to do.
• It becomes more complex when anyone requires changing the
• information.
• It establishes a digital repository on a server to store and manage the data in the
database.
• It provides a clear view and a logical view of the process manipulating the data.
• DBMS offers automatic backup and recovery of the data.
• DBMS maintains data in a healthy manner in case of failure due to ACID
properties.
• It has the ability to reduce the complicated relationship between the data in the
database.
• It is used for processing and manipulation of the data.
• It provides security of data.
• It can view the database from different viewpoints depending on the needs of the
users.
10 March 2023 Sandeep Gaikwad 37
Advantages and Disadvantages of DBMS
Advantages of DBMS
•Controls database redundancy: All the data is stored in one place, and that recorded in the
database and hence controls the redundancy in the database.
•Data sharing: DBMS allows users with authority to share the data in the database with
multiple users.
•Easy Maintenance: The centralized nature of the database helps in the easy maintenance of
the data.
•Reduce time: It reduces the maintenance need and development time.
•Backup: It automatically backs up data to maintain its integrity in case of failure.
•Multiple user interfaces: It offers a number of user interface to multiple users.
Disadvantages of DBMS
Cost of software and hardware: It requires a number of high powered processors and large size
memory to run DBMS.
Size: a Large amount of storage size is required to run DBMS efficiently.
Complexity: DBMS adds an additional layer of complexity to the data.
Higher impact of failure: DBMS faces a higher risk of losing the data since all the data is stored at
single location and a catastrophic failure can wipe it all.
Data definition: DBMS is used for the creation of definition, their modification, and the
removal of definitions that define the organization of data in the database.
Data updation: DBMS is used for updating the data in the database through the means of
modification, insertion, and deletion of actual data.
Data retrieval: It is used for the retrieval of data by the application to perform various
functions.
User Administration: DBMS is used for registration and monitoring of users, concurrency
control, maintaining data integrity, monitoring performance, enforcing data security, and
recovering data lost due to critical failure.
The DBMS follows certain properties to maintain consistency in the Database. These properties are usually termed ACID
Properties.
Here, atomicity means either the operation should be performed or not performed. i.e., it should complete the operation
on 0% or 100%.
Here DBMS (Database Management System) provides atomicity as a characteristic. This is the most important and
useful characteristic of the DBMS (Database Management System).
Atomicity means the transaction should either be 0% or 100% completed, and consistency means that the change in data
should be reflected everywhere in a database.
Isolation means that multiple transactions can occur independently without the interference of some other transactions.
Durability means that the chances of a successful atomic transaction, i.e., a transaction that has been 100% completed,
should reflect in the Database.
Database users in DBMS can be categorized based on their interaction with the databases.
According to the tasks performed by the database users on the databases, we can categorize them
into seven categories as follows:
1. Database Administrators (DBA)
2. Database Designers
3. System Analysts
4. Application Programmers / Back-End Developers
5. Naive Users / Parametric Users
6. Sophisticated Users
7. Casual Users / Temporary Users
Database Designers : As the name suggests, Database Designers are the users in DBMS
who design and create the structure of the database including triggers, indexes,
schemas, entity relationships, tables, constraints, etc. which complete the database.
Database designers try to gather information depending upon the requirements related to
the database like the layout, looks, database functioning, costing, technologies to be
used & implementation techniques, and finally, they design the final layout of the
database for programmers to code its logic.
System Analysts are the type of database users in DBMS who analyze the requirements of Naive /
Parametric End users. It is their responsibility to check whether all the requirements of end users are
satisfied or not.
Analyzing feasibility, economic and technical aspects are some of the major responsibilities for a
system analyst in DBMS.
Naive / Parametric End Users : Naive users also known as Parametric End users, don't have any
knowledge of DBMS but still frequently use the database applications to get the desired results. With the
help of the interface provided by the DBMS applications, Naive users mostly use the database to fill in or
retrieve the information (view level of the database).
Sophisticated Users : Sophisticated users are the type of database users in DBMS who know DBMS (DDL &
DML commands) and are familiar with the database. Sophisticated users can be business analysts,
engineers, scientists, system analysts, etc.
Application Programmers : Application Programmers also known as Back-End Developers, are computer professional
users who are responsible for developing the application programs (C, C++, Java, PHP, Python, etc.) or the user
interface so that other users can use these applications to interact with the database.
7.Casual Users / Temporary Users : Casual users also known as temporary users, are the type of database users in DBMS
who frequently or occasionally use the database services. Whenever these users try to access the database, they want all
the information sorted in place.
7. Casual/Temporary users have little knowledge about DBMS and each time they try to access the database, they
require new information.