Introduction To Databases
Introduction To Databases
DATABASE
RDBMS
The father of Relational Data Model was Dr. E.F. CODD. He developed the relational data
model by taking the concept from Relational Algebra in June-1970.
Relational Data Model is nothing but 12 Rules which are named after Codd as Codd Rules.
According to Codd a package can be called as RDBMS only if it satisfies the Codd Rules.
ORACLE
Every Oracle Database Contains Logical and Physical Structures. Logical Structures are
tablespaces, Schema objects, extents and segments. Physical Structures are Datafiles, Redo Log
Files, Control File.
A database is divided into logical storage units called tablespaces, which group related logical
structures together. Each Tablespace in turn consists of one are more datafiles.
All the tables and other objects are stored in tablespace logically, but physically they are stored
in the datafiles associated with the tablespace.
Every Oracle database has a set of two or more redo log files. The set of redo log files for a
database is collectively known as the database's redo log. A redo log is made up of redo entries
(also called redo records).
The primary function of the redo log is to record all changes made to data. If a failure prevents
modified data from being permanently written to the datafiles, the changes can be obtained from
the redo log so work is never lost.
Every Oracle database has a control file. A control file contains the database name and locations
of all datafiles and redo log files.
Every Oracle database also has a Parameter File. Parameter file contains the name of the
Database, Memory Settings and Location of Control file.
CODD’S RULES
1 Information Rule: All information in a relational database including table names,
column names are represented by values in tables. This simple view of data speeds
design and learning. User productivity is improved since knowledge of only one language
is necessary to access all data such as description of the table and attribute definitions,
integrity constraints. Action can be taken when the constraints are violated. Access to
data can be restricted. All these information are also stored in tables.
2 Guaranteed Access Rule: Every piece of data in a relational database, can be
accessed by using combination of a table name, a primary key value that identifies the
row and column name which identified a cell. User productivity is improved since there
is no need to resort to using physical pointers addresses. Provides data independence.
Possible to retrieve each individual piece of data stored in a relational database by
specifying the name of the table in which it is stored, the column and primary key which
identified the cell in which it is stored.
3 Systematic Treatment of Nulls Rule: The RDBMS handles records that have unknown
or inapplicable values in a pre-defined fashion. Also, the RDBMS distinguishes between
zeros, blanks and nulls in the records hand handles such values in a consistent manner
that produces correct answers, comparisons and calculations. Through the set of rules for
handling nulls, users can distinguish results of the queries that involve nulls, zeros and
blanks. Even though the rule doesn’t specify what should be done in the case of nulls it
specifies that there should be a consistent policy in the treatment of nulls.
4 Active On-line catalog based on the relational model: The description of a database
and in its contents are database tables and therefore can be queried on-line via the data
manipulation language. The database administrator’s productivity is improved since the
changes and additions to the catalog can be done with the same commands that are used
to access any other table. All queries and reports can also be done as any other table.
5 Comprehensive Data Sub-language Rule: A RDBMS may support several languages.
But at least one of them should allow user to do all of the following: define tables and
views, query and update the data, set integrity constraints, set authorizations and define
transactions. User productivity is improved since there is just one approach that can be
used for all database operations. In a multi-user environment the user does not have to
worry about the data integrity an such things, which will be taken care by the system.
Also, only users with proper authorization will be able to access data.
6 View Updating Rule: Any view that is theoretically updateable can be updated using
the RDBMS. Data consistency is ensured since the changes made in the view are
transmitted to the base table and vice-versa.
7 High-Level Insert, Update and Delete: The RDBMS supports insertions, updation and
deletion at a table level. The performance is improved since the commands act on a set of
records rather than one record at a time.
8 Physical Data Independence: The execution of adhoc requests and application
programs is not affected by changes in the physical data access and storage methods.
Database administrators can make changes to the physical access and storage method
which improve performance and do not require changes in the application programs or
requests. Here the user specified what he wants an need not worry about how the data is
obtained.
9 Logical Data Independence: Logical changes in tables and views such adding/deleting
columns or changing fields lengths need not necessitate modifications in the programs or
in the format of adhoc requests. The database can change and grow to reflect changes in
reality without requiring the user intervention or changes in the applications. For
example, adding attribute or column to the base table should not disrupt the programs or
the interactive command that have no use for the new attribute.
10 Integrity Independence: Like table/view definition, integrity constraints are stored in
the on-line catalog and can therefore be changed without necessitating changes in the
application programs. Integrity constraints specific to a particular RDB must be
definable in the relational data sub-language and storable in the catalog. At least the
Entity integrity and referential integrity must be supported.
11 Distribution Independence: Application programs and adhoc requests are not affected
by change in the distribution of physical data. Improved systems reliability since
application programs will work even if the programs and data are moved in different
sites.
12 No subversion Rule: If the RDBMS has a language that accesses the information of a
record at a time, this language should not be used to bypass the integrity constraints.
This is necessary for data integrity.
According to Dr. Edgar. F. Codd, a relational database management system must be able to
manage the database entirely through its relational capabilities.
A table is the data structure that holds data in a relational database. A table is composed of rows
and columns.
A table in Oracle Ver. 7.3 can have maximum 255 Columns and in Oracle Ver. 8 and above a
table can have maximum 1000 columns. Number of rows in a table is unlimited in all the
versions.
A table can represent a single entity that you want to track within your system. This type of a
table could represent a list of the employees within your organization, or the orders placed for
your company's products.
A table can also represent a relationship between two entities. This type of a table could portray
the association between employees and their job skills, or the relationship of products to orders.
Within the tables, foreign keys are used to represent relationships.
Although some well designed tables could represent both an entity and describe the relationship
between that entity and another entity, most tables should represent either an entity or a
relationship.
The following sessions explain how to create, alter, and drop tables. Some simple guidelines to
follow when managing tables in your database are included.
Designing Tables
Be consistent in abbreviations and in the use of singular and plural forms of table names and
columns.
Document the meaning of each table and its columns with the COMMENT command.
Cluster tables whenever appropriate, to conserve storage space and optimize performance of SQL
statements.
Before creating a table, you should also determine whether to use integrity constraints. Integrity
constraints can be defined on the columns of a table to enforce the business rules of your
database automatically.
Before creating a Table you have to decide what type of data each column can contain. This is
known as datatype. Lets Discuss what datatypes are available in Oracle.
Datatypes
A datatype associates a fixed set of properties with the values that can be used in a column of a
table or in an argument of a procedure or function. These properties cause Oracle to treat values
of one datatype differently from values of another datatype. For example, Oracle can add values
of NUMBER datatype, but not values of RAW datatype.
Character datatypes
oCHAR
oNCHAR
oVARCHAR2 and VARCHAR
oNVARCHAR2
oCLOB
oNCLOB
oLONG
NUMBER datatype
Binary datatypes
oBLOB
oBFILE
oRAW
oLONG RAW
Another datatype, ROWID, is used for values in the ROWID pseudocolumn, which represents the
unique address of each row in a table.
The following table summarizes the information about each Oracle built-in datatype.
VARCHAR2 and NVARCHAR2 datatypes store variable-length character strings. (The VARCHAR
datatype is synonymous with the VARCHAR2 datatype.)
NCHAR and NVARCHAR2 datatypes store Unicode character data only.
CLOB and NCLOB datatypes store single-byte and multibyte character strings of up to four
gigabytes.
The LONG datatype stores variable-length character strings containing up to two gigabytes, but with
many restrictions.
This datatype is provided for backward compatibility with existing applications; in general,
new applications should use CLOB and NCLOB datatypes to store large amounts of character
data, and BLOB and BFILE to store large amounts of binary data.
When deciding which datatype to use for a column that will store alphanumeric data in a table,
consider the following points of distinction:
To store data more efficiently, use the VARCHAR2 datatype. The CHAR datatype blank-pads and
stores trailing blanks up to a fixed column length for all column values, while the VARCHAR2
datatype does not add any extra blanks.
For example if you define empname as char(20) then if you store names like “Sami” then name will
occupy 20 bytes( 4 bytes for characters “Sami” and 16 blank spaces)
And if you define empname as varchar2(20) then if you store names like “Sami” then oracle will take 4
bytes only.
Use the CHAR datatype when you require ANSI compatibility in comparison semantics (when
trailing blanks are not important in string comparisons). Use the VARCHAR2 when trailing blanks are
important in string comparisons.
The CHAR and VARCHAR2 datatypes are and will always be fully supported. At this time, the
VARCHAR datatype automatically corresponds to the VARCHAR2 datatype and is reserved for future
use.
Use the NUMBER datatype to store real numbers in a fixed-point or floating-point format. Numbers
using this datatype are guaranteed to be portable among different Oracle platforms, and offer up
to 38 decimal digits of precision. You can store positive and negative numbers of magnitude
1 x 10-130 through 9.99 x10125, as well as zero, in a NUMBER column.
You can specify that a column contains a floating-point number, for example:
distance NUMBER
Or, you can specify a precision (total number of digits) and scale (number of digits to right of
decimal point):
price NUMBER (8, 2)
Although not required, specifying precision and scale helps to identify bad input values. If a
precision is not specified, the column stores values as given. The following table shows
examples of how data different scale factors affect storage.
Use the DATE datatype to store point-in-time values (dates and times) in a table. The DATE
datatype stores the century, year, month, day, hours, minutes, and seconds.
Use the TIMESTAMP datatype to store precise values, down to fractional seconds. For example, an
application that must decide which of two events occurred first might use TIMESTAMP. An
application that needs to specify the time for a job to execute might use DATE.
Date Format
For input and output of dates, the standard Oracle default date format is DD-MON-RR. For
example:
'13-NOV-1992'
To change this default date format on an instance-wide basis, use the NLS_DATE_FORMAT
parameter. To change the format during a session, use the ALTER SESSION statement. To enter
dates that are not in the current default date format, use the TO_DATE function with a format
mask. For example:
Be careful using a date format like DD-MON-YY. The YY indicates the year in the current century.
For example, 31-DEC-92 is December 31, 2092, not 1992 as you might expect. If you want to
indicate years in any century other than the current one, use a different format mask, such as the
default RR.
Time Format
Time is stored in 24-hour format, HH24:MI:SS. By default, the time in a date field is 12:00:00
A.M. (midnight) if no time portion is entered. In a time-only entry, the date portion defaults to
the first day of the current month. To enter the time portion of a date, use the TO_DATE function
with a format mask indicating the time portion, as in:
Once you have designed the table and decided about datatypes use the following SQL command
to create a table.
To insert rows in the table you can use SQL INSERT command.
For example the following statement creates a row in the above table.
To insert rows continuously in SQL Plus you can give the following command.
These &Empno, &name and &sal are known as substitution variables. That is SQLPlus will prompt you
for these values and then rewrites the statement with supplied values.
To see the rows you have inserted give the following command.
To see the structure of the table i.e. column names and their datatypes and widths. Give the following
command.
SQL>desc emp
To see how many tables are in your schema give the following command.
or