Database Management System
Database Management System
System
Outline
• Managing Files: Basic Concepts
• Database Definition
• Database Management Systems
• Database Models
• Data Mining
2
Managing Files: Basic Concepts
• Data storage
hierarchy - levels of
data stored in a
computer:
– Database
– Files
– Records
– Fields
– Characters
(bytes)
– Bits
3
Managing Files: Basic Concepts
• Database – an organized collection of integrated files.
• File – a collection of related records.
• Record – a collection of related fields. Often called a
row.
• Field – a unit (individual piece) of data consisting of
one or more characters (bytes). Often called a
column.
• Character (byte) – a letter, number, or special
character.
• Key field – a field that is chosen to uniquely identify a
record so that it can be easily retrieved and
processed.
4
Managing Files: Basic Concepts
Field Name
Field
Record
5
Database Definition
• Structured set of data held in a computer.
(Pocket Oxford Dictionary)
8
Database Models
• Hierarchical database – fields or records are arranged
in related groups, resembling a family tree, with child
(lower-level) records subordinate to parent (higher-
level) records.
9
Database Models
• Network database – similar to a hierarchical
database, but each child record can have more than
one parent record.
10
Database Models
• Relational database – a database which relates
(connects) data in different files through the use of a
key field, or common data element.
11
Database Models
• SQL (Structured Query Language) – the standard
language used to create, modify, maintain, and query
relational databases.
12
Data Mining
• Data mining (DM) – the computer-assisted process of
sifting through and analyzing vast amounts of data in
order to extract meaning and discover new
knowledge.
– Searches for trends and patterns
– Makes predictions on events
– Supplies ideas for improving business
• Data mining begins with acquiring data and preparing
it for what is known as the data warehouse by the
following steps:
– Data sources
– Data fusion and cleansing
– Data and meta-data
– Data warehouse
13
Data Mining
1. Data sources
• Data may come from a number of sources:
– Point-of-sale transactions in flat files on
mainframes;
– Databases of all kinds;
– Other, e.g., news articles, online articles, etc.;
and
– Data from data warehouses
14
Data Mining
2. Data fusion and cleansing
• Data from diverse sources must be fused\join
together, then put through a process known as data
cleansing, or scrubbing.
• The data may be of poor quality, full of errors and
inconsistencies
• Putting together the data from various sources and
then “scrubbing” the data to eliminate errors and
inconsistencies.
15
Data Mining
3. Data and meta-data
• Cleaned-up data and meta-data (data about data)
• The cleansing process yields both the cleaned-up
data and a variation of it called meta-data.
• Meta-data shows the origins of the data, the
transformations it has undergone, and summary
information about it, which makes it more useful
than the cleansed but unintegrated, unsummarized
data.
16
Data Mining
4. Data warehouse
17
End of Chapter