Data Abstraction
Data Abstraction
Database systems are made-up of complex data structures. To ease the user interaction with database,
the developers hide internal irrelevant details from users. This process of hiding irrelevant details
from user is called data abstraction.
We have three levels of abstraction (view level, logical level and physical level)-The three level
architecture
DBMS – Three Level Architecture(Views)
- This level describes what data is stored in the database and the relationships among the
data.
- This level contains the logical structure of the entire database as seen by DBA - This
level must not contain any storage dependent details.
- Hides details of the physical(internal) level.
- Database constraints and security are also implemented in this level of architecture. This level
is maintained by DBA (database administrator).
- The DBMS maps data access between the conceptual to physical schemas automatically
For example, in case of student database Roll No, Name, Class, Address etc. are attributes of entity
student.
DATABASE SCHEMA
A database schema is the skeleton structure that represents the logical view of the entire database. It
defines how the data is organized and how the relations among them are associated.
It’s the database designers who design the schema to help programmers understand the database and
make it useful.
The process of creating a database schema is called data modelling.
Logical Database Schema- This schema describes what data is stored in the database
and the relationships among the data. It defines tables, views, and integrity constraints.
View schema- This schema generally describes the end user interaction with the database
system.
For example: In the following diagram, we have a schema that shows the relationship between
three tables: Course, Student and Section. The diagram only shows the design of the database; it
doesn’t show the data present in those tables. Schema is only a structural view(design) of a
database as shown in the diagram below.
DATA INDEPENDENCE
It is the property of the database which tries to ensure that if we make any change in any level of
schema of the database, the schema immediately above it would require minimal or no need of
change. It removes the need for additional amount of work needed in adopting the single change
into all the levels above.
Data independence can be classified into the following two types:
1. Physical Data Independence: This means that for any change made in the physical
schema, the need to change the logical schema is minimal. This is practically easier to
achieve.
2. Logical Data Independence: This means that for any change made in the logical schema,
the need to change the external schema is minimal; this is a little difficult to achieve.
DBMS INSTANCE
Definition of instance: The data stored in database at a particular moment of time is called instance
of database.
Database schema is a representation of a planned database and does not actually contain data while
a database instance is a snapshot of an actual database at a particular time.
For example, let’s say we have a single table student in the database, today the table has 100 records,
so today the instance of the database has 100 records. Let’s say we are going to add another 100
records in this table by tomorrow so the instance of database tomorrow will have 200 records in
table. In short, at a particular moment the data stored in database is called the instance, that changes
over time when we add or delete data from the database.
ii) Parallel network database system – This system has the advantage of improving
processing input and output speeds. Majorly used in the applications that have query to
larger database. It holds the multiple central processing units and data storage disks in
parallel.
iii) Distributed database system – In this data and the DBMS software are distributed over
several sites but connected to the single computer.
i) Online transaction processing (OLTP) DBMS – They manage the operational data.
Database server must be able to process lots of simple transactions per unit of time.
Transactions are initiated in real time, in simultaneous by lots of user and applications hence
it must have high volume of short, simple queries.
ii) Online analytical processing (OLAP) DBMS – They use the operational data for tactical
and strategical decision making. They have limited users deal with huge amount of data and
complex queries.
iii) Big data and analytics DBMS – To cope with big data new database technologies have
been introduced. One such is NoSQL (not only SQL) which abandons the well-known
relational database scheme.
iv) Multimedia DBMS – Stores data such as text, images, audio, video and 3D games which
are usually stored in binary large object
SAMPLE KNEC QUESTIONS
July 2017
1. Describe two ways of classifying database management systems. (4 marks)
2. Outline three types of end users in a database environment. (3 marks)
3. Define the term data abstraction as used in database management system (2 marks)
4. Explain two reasons that may lead to the migration from file based to database centered
approach. (4 marks)
5. Outline the roles of each of the following in a database management system
environment.
(4 marks)
i) Hardware
ii) Data
6. Explain two disadvantages of database approach. (4 marks)
7. With the aid of a diagram explain two paradigms the early models of databases were
based on. (4 marks)
July 2016
1. Distinguish between active data dictionary and passive data dictionary as used in
databases.
(4 marks)
2. An organization intends to acquire a database application to manage its data. Describe
three professionals that may be required. (4 marks)
3. State two similarities between hierarchical and network database models. (4 marks)
4. Explain two reasons for one using online database. (4 marks)