0% found this document useful (0 votes)
3 views

Database Management System

The document outlines key concepts in database management, including definitions of databases, database management systems (DBMS), and various database models such as hierarchical, network, relational, and object-oriented. It also discusses the advantages of using a DBMS, such as improved data integrity and security, and introduces data mining as a process for analyzing large data sets to extract meaningful insights. The document emphasizes the importance of data preparation and cleansing in the data mining process.

Uploaded by

hassan.javed
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
3 views

Database Management System

The document outlines key concepts in database management, including definitions of databases, database management systems (DBMS), and various database models such as hierarchical, network, relational, and object-oriented. It also discusses the advantages of using a DBMS, such as improved data integrity and security, and introduces data mining as a process for analyzing large data sets to extract meaningful insights. The document emphasizes the importance of data preparation and cleansing in the data mining process.

Uploaded by

hassan.javed
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 18

Database Management

System
Outline
• Managing Files: Basic Concepts
• Database Definition
• Database Management Systems
• Database Models
• Data Mining

2
Managing Files: Basic Concepts
• Data storage
hierarchy - levels of
data stored in a
computer:
– Database
– Files
– Records
– Fields
– Characters
(bytes)
– Bits
3
Managing Files: Basic Concepts
• Database – an organized collection of integrated files.
• File – a collection of related records.
• Record – a collection of related fields. Often called a
row.
• Field – a unit (individual piece) of data consisting of
one or more characters (bytes). Often called a
column.
• Character (byte) – a letter, number, or special
character.
• Key field – a field that is chosen to uniquely identify a
record so that it can be easily retrieved and
processed.
4
Managing Files: Basic Concepts

Field Name

Field

Record

5
Database Definition
• Structured set of data held in a computer.
(Pocket Oxford Dictionary)

• An organized collection of related (integrated)


files. (Williams and Sawyer)

• A database is a collection of related data or


facts. (Peter Norton)
6
Database Management Systems
• Database management system (DBMS) – programs
that control the structure of a database and access to
the data. (Williams and Sawyer)
• DBMS is a collection of programs that control the
database. (Peter Norton)
• Advantages of DBMSes:
– File sharing
– Reduced data redundancy
• Data redundancy – situation in which the same data fields
appear in many different files and often in different
formats.
– Improved data integrity
• Data integrity – measure of how accurate, consistent, and
up-to-date data is.
– Increased security
7
Database Models
• Just as files can be organized in different
ways, so databases can be organized in ways
to best fit their use.
• The four most common arrangements are:
– Hierarchical
– Network
– Relational
– Object-oriented

8
Database Models
• Hierarchical database – fields or records are arranged
in related groups, resembling a family tree, with child
(lower-level) records subordinate to parent (higher-
level) records.

9
Database Models
• Network database – similar to a hierarchical
database, but each child record can have more than
one parent record.

10
Database Models
• Relational database – a database which relates
(connects) data in different files through the use of a
key field, or common data element.

11
Database Models
• SQL (Structured Query Language) – the standard
language used to create, modify, maintain, and query
relational databases.

• SQL is pronounced as “sequel.”


• How did this acronym get such an unlikely pronunciation?
• The first structured query language was developed by IBM
in the 1970s; its product name was “Sequel2.”

12
Data Mining
• Data mining (DM) – the computer-assisted process of
sifting through and analyzing vast amounts of data in
order to extract meaning and discover new
knowledge.
– Searches for trends and patterns
– Makes predictions on events
– Supplies ideas for improving business
• Data mining begins with acquiring data and preparing
it for what is known as the data warehouse by the
following steps:
– Data sources
– Data fusion and cleansing
– Data and meta-data
– Data warehouse
13
Data Mining
1. Data sources
• Data may come from a number of sources:
– Point-of-sale transactions in flat files on
mainframes;
– Databases of all kinds;
– Other, e.g., news articles, online articles, etc.;
and
– Data from data warehouses

14
Data Mining
2. Data fusion and cleansing
• Data from diverse sources must be fused\join
together, then put through a process known as data
cleansing, or scrubbing.
• The data may be of poor quality, full of errors and
inconsistencies
• Putting together the data from various sources and
then “scrubbing” the data to eliminate errors and
inconsistencies.
15
Data Mining
3. Data and meta-data
• Cleaned-up data and meta-data (data about data)
• The cleansing process yields both the cleaned-up
data and a variation of it called meta-data.
• Meta-data shows the origins of the data, the
transformations it has undergone, and summary
information about it, which makes it more useful
than the cleansed but unintegrated, unsummarized
data.
16
Data Mining
4. Data warehouse

• A special database of cleaned up data and meta-


data.
• Both the data and the meta-data are sent to the
data warehouse.

17
End of Chapter

You might also like