Chapter 01
Chapter 01
Chapter 1
Database Systems
1
Learning Objectives
●
Define the difference between data and information
●
Describe what a database is, various types, and why they are valuable
assets for decision making
●
Explain the importance of database design
●
See how modern databases evolved from file systems
●
Understand flaws in file system data management
●
Outline the main components of the database system
●
Describe the main functions of a database management system (DBMS)
2
Why Databases?
●
Data are generated every day
– Need a way to store
– Need a way to access
●
Databases save data and allow access
3
Key Definitions
●
Data ●
Knowledge
– Raw facts – Information and facts about a
– Unprocessed subject
– Building blocks of information – “Familiarity, awareness and
understanding of information”
●
Information as it applies in the context
– Data, processed and put into – New knowledge can build
context upon old knowledge
– Basis of knowledge
4
Valuable Information
●
Valuable information is necessary for making good
decisions
●
Attributes of valuable information
– Timely
– Accurate
– Relevant
●
Databases support all three
5
Ethical Issues
●
Many issues with the amount of data collected
●
Everything is tracked
– Data are being stored in private hands
●
Is this good?
6
Some More Definitions
●
Data Management
– The process of data generation, storage, and retrieval
– Includes: addition, deletion, modification and listing of data
●
Metadata
– Data about data
– Includes characteristics and relationships
7
And Some More
●
Database
– A system that stores data and metadata
●
Database Management System (DBMS)
– Usually computerized
– “The collection of programs that manages the database
structure and controls access to the data stored in the
database.”
8
DBMS
9
Advantages of a DBMS
●
Improved data sharing
●
Improved data security
●
Better data integration
●
Minimized data inconsistency
●
Improved data access
●
Improved decision making
●
Increased end-user productivity
10
Types of Databases
●
See section 1-3b in text
●
Users: single, mutliuser, workgroup, enterprise
●
Location: centralized, distributed, cloud
●
Data stored: general purpose, discipline specific, operational
●
Analytic: data warehouse, online analytical processing (OLAP),
business intelligence
●
Structure: structured, semistructured, unstructured, extensible
markup language (XML)
11
Database Design
●
Designing the database structure
●
Well-designed database
– Facilitates data management and generates accurate and
valuable information
●
Poorly designed database
– Causes difficult-to-trace errors that may lead to poor
decision making
12
Design Issues
13
Improved Design
14
In the Beginning...
●
Manual file systems
– Paper and pencil
– Files, folders, filing cabinets
●
Early computerized systems
– Formatted files, many formats
– Typical file, text with “columns”
– See Customer.txt example in D2L
●
Happening again today with PCs and spreadsheets
15
Some Definitions
16
File System in Use
17
Issues with File Systems
●
Many files with related and overlapping data
●
No consistent, centralized way to control data
●
Files owned by user/department
●
Lengthy development times
●
Difficulty getting quick answers
– Need to write programs to for ad hoc queries
18
More Issues
●
Complex system administration
●
Lack of security
●
Limited data sharing
●
Extensive programming
– Change a file, EVERY program that uses it has to be
changed
19
Even More Issues
●
Structural Dependence
●
Data Dependence
●
Data Redundancy
●
Data Anomalies
20
Structural Dependence
●
Structural dependence
– Access to a file is dependent on its own structure
– All file system programs are modified to conform to a
new file structure
●
Structural independence
– File structure is changed without affecting the
application’s ability to access the data
21
Data Dependence
●
Data dependence
– Data access changes when data storage
characteristics change
●
Data independence
– Data storage characteristics are changed without
affecting the program’s ability to access the data
22
Data Redundancy
●
Unnecessarily storing the ●
Poor data security
same data at different ●
Data inconsistency
places
●
Islands of information (i.e.,
●
Data-entry errors
scattered data locations) ●
Data integrity problems
●
Increases the probability of
having different versions of
the same data
23
Data Anomalies
●
Develop when not all of the required changes in
the redundant data are made successfully
●
Update anomalies
●
Insertion anomalies
●
Deletion anomalies
24
DBMS vs. File Systems
25
Database System Environment
26
DBMS Functions
●
Data dictionary management
– Data dictionary: stores definitions of data elements and their relationships
●
Data storage management
– Performance tuning ensures efficient performance
●
Data transformation and presentation
– Data is formatted to conform to logical expectations
●
Security management
– Enforces user security and data privacy
27
DMBS Functions
●
Multiuser access control
– Sophisticated algorithms ensure that multiple users can access
the database concurrently without compromising its integrity
●
Backup and recovery management
– Enables recovery of the database after a failure
●
Data integrity management
– Minimizes redundancy and maximizes consistency
28
DBMS Functions
●
Database access languages and application programming
interfaces
– Query language: lets the user specify what must be done without
having to specify how
– Structured Query Language (SQL): de facto query language and data
access standard supported by the majority of DBMS vendors
●
Database communication interfaces
– Accept end-user requests via multiple, different network environments
29
Advantages and Disadvantages
●
Lot’s of advantages
– Fixes disadvantages from file systems
●
Disadvantages
– Increased costs
– Management complexity
– Maintaining currency
– Vendor dependence
– Frequent upgrade/replacement cycles
30
Database Jobs
31