Database Chap1
Database Chap1
Fundamentals of Design,
Implementation, and
Management
Ninth Edition
Carlos Coronel, Steven Morris, and Peter Rob
Chapter 1
Introduction to Database Systems
In this chapter, you will learn:
• The difference between data and information
• What a database is, the various types of databases, and why
they are valuable assets for decision making
• The importance of database design
• How modern databases evolved from file systems
• The main components of the database system
• The main functions of a database management system
(DBMS)
Objectives 2
© 2010 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the
U.S. Edition. May not be scanned, copied, duplicated, or posted to a publicly accessible website, in whole or in part.
• Good decisions require good information derived from
raw facts
• Data is managed most efficiently when stored in a
database
• Databases evolved from computer file systems
• Understanding file system characteristics is important
Introduction 3
© 2010 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the
U.S. Edition. May not be scanned, copied, duplicated, or posted to a publicly accessible website, in whole or in part.
• Databases solve many of the problems encountered in
data management
• Used in almost all modern settings involving data
management:
• Business
• Research
• Administration
• Important to understand how databases work and interact
with other applications
Why Databases? 4
© 2010 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the
U.S. Edition. May not be scanned, copied, duplicated, or posted to a publicly accessible website, in whole or in part.
5
© 2010 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the
U.S. Edition. May not be scanned, copied, duplicated, or posted to a publicly accessible website, in whole or in part.
• Data are raw facts
• Raw data must be formatted for storage, processing, and
presentation
• Information is the result of processing raw data to reveal
meaning
• Information requires context to reveal meaning
• Data are the foundation of information, which is the
bedrock of knowledge
Data vs.
Information 6
© 2010 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the
U.S. Edition. May not be scanned, copied, duplicated, or posted to a publicly accessible website, in whole or in part.
• Data: building blocks of information
• Information produced by processing data
• Information used to reveal meaning in data
• Accurate, relevant, timely information is the key to
good decision making
• Good decision making is the key to organizational
survival
© 2010 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the
U.S. Edition. May not be scanned, copied, duplicated, or posted to a publicly accessible website, in whole or in part.
8
© 2010 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the
U.S. Edition. May not be scanned, copied, duplicated, or posted to a publicly accessible website, in whole or in part.
• Database: shared, integrated computer structure that stores a
collection of:
• End-user data: raw facts of interest to end user
• Metadata: data about data
• Provides description of data characteristics and relationships in data
• Complements and expands value of data
• Database management system (DBMS): collection of
programs
Introducing
• Manages structure and controlsthe
access to data
Database 9
© 2010 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the
U.S. Edition. May not be scanned, copied, duplicated, or posted to a publicly accessible website, in whole or in part.
Database
Terminology 10
© 2010 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the
U.S. Edition. May not be scanned, copied, duplicated, or posted to a publicly accessible website, in whole or in part.
• DBMS is the intermediary between the
user and the database
• Database structure stored as file collection
• Can only access files through the DBMS
• DBMS enables data to be shared
• DBMS integrates many users’ views of the data
Role and
Advantages of the
DBMS 11
© 2010 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the
U.S. Edition. May not be scanned, copied, duplicated, or posted to a publicly accessible website, in whole or in part.
12
© 2010 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the
U.S. Edition. May not be scanned, copied, duplicated, or posted to a publicly accessible website, in whole or in part.
• Advantages of a DBMS:
• Improved data sharing
• Improved data security
• Better data integration
• Minimized data inconsistency
• Improved data access
• Improved decision making
Advantages of the
DBMS (cont’d.) 13
© 2010 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the
U.S. Edition. May not be scanned, copied, duplicated, or posted to a publicly accessible website, in whole or in part.
14
© 2010 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the
U.S. Edition. May not be scanned, copied, duplicated, or posted to a publicly accessible website, in whole or in part.
• Databases can be classified according to:
• Number of users
• Database location(s)
• Expected type and extent of use
• Single-user database supports only one user at a time
• Desktop database: single-user; runs on PC
• Multiuser database supports multiple users at the same time
Types of
• Workgroup and enterprise databases
Databases 15
© 2010 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the
U.S. Edition. May not be scanned, copied, duplicated, or posted to a publicly accessible website, in whole or in part.
16
© 2010 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the
U.S. Edition. May not be scanned, copied, duplicated, or posted to a publicly accessible website, in whole or in part.
• Centralized database: data located at a single site
• Distributed database: data distributed across several different sites
• Operational database: supports a company’s day-to-day operations
• Transactional or production database
• Data warehouse: stores data used for tactical or strategic decisions
Types of Databases
(cont’d.) 17
© 2010 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the
U.S. Edition. May not be scanned, copied, duplicated, or posted to a publicly accessible website, in whole or in part.
• Unstructured data exist in their original state
• Structured data result from formatting
• Structure applied based on type of processing to be performed
• Semi-structured data have been processed to some extent
• Extensible Markup Language (XML) represents data elements in
textual format
• XML database supports semi-structured XML data
Types of Data
(cont'd.) 18
© 2010 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the
U.S. Edition. May not be scanned, copied, duplicated, or posted to a publicly accessible website, in whole or in part.
19
© 2010 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the
U.S. Edition. May not be scanned, copied, duplicated, or posted to a publicly accessible website, in whole or in part.
• Database design focuses on design of database structure
used for end-user data
• Designer must identify database’s expected use
• Well-designed database:
• Facilitates data management
• Generates accurate and valuable information
• Poorly designed database:
• Causes difficult-to-trace errors
Why Database
Design Is Important 20
© 2010 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the
U.S. Edition. May not be scanned, copied, duplicated, or posted to a publicly accessible website, in whole or in part.
File System Data
Processing 21
© 2010 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the
U.S. Edition. May not be scanned, copied, duplicated, or posted to a publicly accessible website, in whole or in part.
22
© 2010 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the
U.S. Edition. May not be scanned, copied, duplicated, or posted to a publicly accessible website, in whole or in part.
• File systems were an improvement over manual system
• File systems used for more than two decades
• Understanding the shortcomings of file systems aids in
development of modern databases
• Many problems not unique to file systems
• Even simple file system retrieval task required extensive
programming
• Ad hoc queries impossible
• Changing existing structure difficult
© 2010 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the
U.S. Edition. May not be scanned, copied, duplicated, or posted to a publicly accessible website, in whole or in part.
• Security features difficult to program
• Often omitted in file system environments
• Summary of file system limitations:
• Requires extensive programming
• Cannot perform ad hoc queries
• System administration is complex and difficult
• Difficult to make changes to existing structures
• Security features are likely to be inadequate
© 2010 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the
U.S. Edition. May not be scanned, copied, duplicated, or posted to a publicly accessible website, in whole or in part.
• Structural dependence: access to a file is dependent on its
own structure
• All file system programs must be modified to conform to a new
file structure
• Structural independence: change file structure without
affecting data access
• Data dependence: data access changes when data storage
characteristics change
• Data independence: data storage characteristics do not affect
Problems with File System:
data access
© 2010 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the
U.S. Edition. May not be scanned, copied, duplicated, or posted to a publicly accessible website, in whole or in part.
• Practical significance of data dependence is difference
between logical and physical format
• Logical data format: how human views the data
• Physical data format: how computer must work with data
• Each program must contain:
• Lines specifying opening of specific file type
• Record specification
• Field definitions
Problems with File System:
Structural and Data
Dependence 26
© 2010 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the
U.S. Edition. May not be scanned, copied, duplicated, or posted to a publicly accessible website, in whole or in part.
• File system structure makes it difficult to combine data from
multiple sources
• Vulnerable to security breaches
• Organizational structure promotes storage of same data in
different locations
• Islands of information
• Data stored in different locations is unlikely to be updated
consistently
• Data redundancy: same data stored unnecessarily in
different places
Problems with File System: 27
Data Redundancy
© 2010 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the
U.S. Edition. May not be scanned, copied, duplicated, or posted to a publicly accessible website, in whole or in part.
• Data inconsistency: different and conflicting versions of
same data occur at different places
• Data anomalies: abnormalities when all changes in
redundant data are not made correctly
• Update anomalies
• Insertion anomalies
System : Data
Inconsistency (cont'd.) 28
© 2010 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the
U.S. Edition. May not be scanned, copied, duplicated, or posted to a publicly accessible website, in whole or in part.
• Most users lack the skill to properly design databases,
despite multiple personal productivity tools being
available
• Data-modeling skills are vital in the data design process
• Good data modeling facilitates communication between
the designer, user, and the developer
© 2010 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the
U.S. Edition. May not be scanned, copied, duplicated, or posted to a publicly accessible website, in whole or in part.
• Database system consists of logically related data stored in a
single logical data repository
• May be physically distributed among multiple storage facilities
• DBMS eliminates most of file system’s problems
• Current generation stores data structures, relationships
between structures, and access paths
• Also defines, stores, and manages all access paths and components
Database Systems 30
© 2010 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the
U.S. Edition. May not be scanned, copied, duplicated, or posted to a publicly accessible website, in whole or in part.
31
The Database
System Environment 32
© 2010 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the
U.S. Edition. May not be scanned, copied, duplicated, or posted to a publicly accessible website, in whole or in part.
• Hardware: all the system’s physical devices
• Software: three types of software required:
• Operating system software
• DBMS software
• Application programs and utility software
The Database
System Environment
(cont'd.) 35
The Database
System Environment
(cont'd.) 37
DBMS Functions 38
DBMS Functions
(cont'd.) 42
DBMS Functions
(cont'd.) 43
DBMS Functions
(cont'd.) 44
DBMS Functions
(cont'd.) 45
(cont'd.) 46
© 2010 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the
U.S. Edition. May not be scanned, copied, duplicated, or posted to a publicly accessible website, in whole or in part.
• Database system provides a framework in which strict
procedures and standards enforced
• Role of human changes from programming to managing
organization’s resources
• Database system enables more sophisticated use of the
data
• Data structures created within the database and their
Managing the
relationships determine effectiveness
Database System: A
Shift in Focus 48
Database System: A
Shift in Focus
(cont'd.) 49
Summary 50
Summary (cont'd.) 51
Summary (cont'd.) 52
Summary (cont'd.) 53