0% found this document useful (0 votes)
41 views

01introduction ADVANCE DATABASE SYSTEM

This document provides an introduction to a lecture on database systems. It discusses how technological advances like increased storage, faster processors, better networking, new sensors, and machine learning are resulting in more ubiquitous data. It defines key database concepts like data, databases, database management systems, and database systems. It also outlines typical database management system functionalities like defining and constructing databases, querying, modifying, and sharing data.

Uploaded by

Efrem Girma
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
41 views

01introduction ADVANCE DATABASE SYSTEM

This document provides an introduction to a lecture on database systems. It discusses how technological advances like increased storage, faster processors, better networking, new sensors, and machine learning are resulting in more ubiquitous data. It defines key database concepts like data, databases, database management systems, and database systems. It also outlines typical database management system functionalities like defining and constructing databases, querying, modifying, and sharing data.

Uploaded by

Efrem Girma
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 27

CSIE30600/CSIEB0290 Database Systems Lecture 1 Introduction

Outline
 Data is ubiquitous
 Basic Definitions
 Types of Databases and Database Applications
 Typical DBMS Functionalities
 Example of a Database (UNIVERSITY)
 Main Characteristics of the Database Approach
 Database Users
 Advantages of Using the Database Approach
 History of Database Systems
 Extending Database Capabilities
 When Not to Use Databases

CSIE30600/CSIEB0290 Database Systems Introduction 2

Note 1
CSIE30600/CSIEB0290 Database Systems Lecture 1 Introduction

Data & Technological Advances


 Five classes of technological advances are changing our
relationship with data:
 More storage space
 allows us to keep more data
 Faster processor (and memory) speeds
 allows us to access and process more data
 Better networking
 allows us to share data more efficiently
 Different “sensors”
 allows us to access new kinds of data
 Better processing methods (AI & machine learning)
 allows us to process data more intelligently
CSIE30600/CSIEB0290 Database Systems Introduction 3

HDD Areal Density Growth

CSIE30600/CSIEB0290 Database Systems Introduction 4

Note 2
CSIE30600/CSIEB0290 Database Systems Lecture 1 Introduction

HDD Cost over Time

CSIE30600/CSIEB0290 Database Systems Introduction 5

HDD Cost Trends

CSIE30600/CSIEB0290 Database Systems Introduction 6

Note 3
CSIE30600/CSIEB0290 Database Systems Lecture 1 Introduction

Processor Technology

http://i1.wp.com/thetechguysblog.com/wp-content/uploads/2015/01/Chart-III-8-Moores-Law-Over-199-Years-And-Going-Strong.png

CSIE30600/CSIEB0290 Database Systems Introduction 7

Network Capacity Growth

CSIE30600/CSIEB0290 Database Systems Introduction 8

Note 4
CSIE30600/CSIEB0290 Database Systems Lecture 1 Introduction

New Data from Sensor Web

CSIE30600/CSIEB0290 Database Systems Introduction 9

Data Everywhere
 Airline flight management system
 Financial data
 Commercial store (eg, WalMart) data
 Department of Motor Vehicles
 Surveillance video
 University student records
 Baseball results
 Web sites
 Medical records
 ...

CSIE30600/CSIEB0290 Database Systems Introduction 10

Note 5
CSIE30600/CSIEB0290 Database Systems Lecture 1 Introduction

Need Effective Data Management


 Effective management can make an
organization’s data a valuable asset(資產).
 Ineffective policies can make an
organization’s data a liability(負債).
 Big data analytics is becoming the gold
mine of the 21st century.
 The paradigm has been extended from
database systems to data science.

CSIE30600/CSIEB0290 Database Systems Introduction 11

Basic Concepts
 Data: Known facts that can be recorded and have an implicit
meaning.
 Database: Collection of interrelated data
 Mini-World or Universe of Discourse (UoD): Some part of the
real world about which data is stored in a database.
 Database Management System (DBMS): A collection of
programs to facilitate the creation and maintenance of a
database.
 Database System = DBMS + Database
 A database system contains information about a particular
enterprise.
 A database system provides an environment that is both
convenient and efficient to use.

CSIE30600/CSIEB0290 Database Systems Introduction 12

Note 6
CSIE30600/CSIEB0290 Database Systems Lecture 1 Introduction

Database Management System (DBMS)


 DBMS is:
 A collection of software programs
 General purpose
 DBMS enables users to:
 Define DB
 Construct DB
 Change (or update) DB
 Query the data in DB
 Share DB
 DBMS maintains the integrity of DB

CSIE30600/CSIEB0290 Database Systems Introduction 13

DBMS Popularity Ranking

http://db-engines.com/en/ranking/relational+dbms
CSIE30600/CSIEB0290 Database Systems Introduction 14

Note 7
CSIE30600/CSIEB0290 Database Systems Lecture 1 Introduction

DBMS Ranking Trend

http://db-engines.com/en/ranking_trend/relational+dbms
CSIE30600/CSIEB0290 Database Systems Introduction 15

Growing Popularity of NoSQL

CSIE30600/CSIEB0290 Database Systems Introduction 16

Note 8
CSIE30600/CSIEB0290 Database Systems Lecture 1 Introduction

Main Goals of DB Course


 To understand how to use a DBMS
 How to design and create DB, data models, SQL, ...

 To understand how a DBMS works


 Physical properties of disks and files, software to
manage reading and writing to disk,
implementation of algorithms to answer user
queries, ...

CSIE30600/CSIEB0290 Database Systems Introduction 17

Types of Databases and Applications


 Traditional Applications:
 Numeric and Textual Databases
 More Recent Applications:
 Multimedia Databases (images, audio, video, …)
 Geographic Information Systems (GIS)
 Data Warehouses
 Real-time and Active Databases
 Many other applications
 New Trends: big data analytics, IoT
 First part: focuses on traditional applications
 A number of recent applications are described later in the
class and book.
CSIE30600/CSIEB0290 Database Systems Introduction 18

Note 9
CSIE30600/CSIEB0290 Database Systems Lecture 1 Introduction

Database System Environment

catalog
CSIE30600/CSIEB0290 Database Systems Introduction 19

Typical DBMS Functionality


 Define a particular database in terms of its data types,
structures, and constraints
 Construct or Load the initial database contents on a
secondary storage medium
 Manipulating the database:
 Retrieval: Querying, generating reports
 Modification: Insertions, deletions and updates to its
content
 Accessing the database through Web applications
 Processing and Sharing by a set of concurrent users and
application programs – yet, keeping all data valid and
consistent

CSIE30600/CSIEB0290 Database Systems Introduction 20

Note 10
CSIE30600/CSIEB0290 Database Systems Lecture 1 Introduction

Typical DBMS Functionality


 Other features:
 Protection or Security measures to prevent
unauthorized access
 “Actively” take internal actions on data
 Presentation and Visualization of data
 Maintaining the database and associated programs
over the lifetime of the database application
 Called database, software, and system maintenance

CSIE30600/CSIEB0290 Database Systems Introduction 21

Application Activities Against a DB


 Applications interact with a database by generating
 Queries: that access different parts of data and formulate
the result of a request
 Transactions: that may read some data and “update” certain
values or generate new data and store that in the database
 Applications must not allow unauthorized users to access
data
 Applications must keep up with changing user
requirements against the database

CSIE30600/CSIEB0290 Database Systems Introduction 22

Note 11
CSIE30600/CSIEB0290 Database Systems Lecture 1 Introduction

Database Applications
 Banking: all transactions
 Airlines: reservations, schedules
 Universities: registration, grades
 Sales: customers, products, purchases
 Online retailers: order tracking, customized
recommendations
 Manufacturing: production, inventory, orders, supply
chain
 Human resources: employee records, salaries, tax
deductions
 Databases touch all aspects of our lives

CSIE30600/CSIEB0290 Database Systems Introduction 23

Purpose of Database Systems


 In the early days, database applications were built directly
on top of file systems
 Drawbacks of using file systems :
 Data redundancy and inconsistency
 Multiple file formats, duplication in different files
 Difficulty in accessing data
 Need to write a new program for each new task
 Data isolation — multiple files and formats
 Integrity problems
 Integrity constraints (e.g. account balance > 0) become “buried”
in program code rather than being stated explicitly
 Hard to add new constraints or change existing ones

CSIE30600/CSIEB0290 Database Systems Introduction 24

Note 12
CSIE30600/CSIEB0290 Database Systems Lecture 1 Introduction

Purpose of Database Systems


 Drawbacks of using file systems (cont.)
 Atomicity of updates
 Failures may leave database in an inconsistent state
 Example: Transfer of funds from one account to another
 Concurrent access by multiple users
 Concurrent accessed needed for performance
 Uncontrolled concurrent accesses can lead to inconsistencies
 Example: Two people reading and updating at the same time
 Security problems
 Hard to provide user access to some, but not all, data
 Database systems offer solutions to ALL the above
problems

CSIE30600/CSIEB0290 Database Systems Introduction 25

Example of a Database
 Mini-world for the example:
 Part of a UNIVERSITY environment.
 Some mini-world entities:
 STUDENTs
 COURSEs
 SECTIONs (of COURSEs)
 DEPARTMENTs
 INSTRUCTORs

CSIE30600/CSIEB0290 Database Systems Introduction 26

Note 13
CSIE30600/CSIEB0290 Database Systems Lecture 1 Introduction

Example of a Database
 Some mini-world relationships:
 SECTIONs are of specific COURSEs
 STUDENTs take SECTIONs
 COURSEs have prerequisite COURSEs
 INSTRUCTORs teach SECTIONs
 COURSEs are offered by DEPARTMENTs
 STUDENTs major in DEPARTMENTs

 Note: The above entities and relationships are typically


expressed in a conceptual data model, such as the ENTITY-
RELATIONSHIP data model (to be discussed later in class)

CSIE30600/CSIEB0290 Database Systems Introduction 27

Example of a Database

CSIE30600/CSIEB0290 Database Systems Introduction 28

Note 14
CSIE30600/CSIEB0290 Database Systems Lecture 1 Introduction

Main Characteristics
 Self-describing: A DBMS catalog (meta-data) stores
the description of the database. (next slide)
 Program-data Independence: Allows changing storage
structures w/o changing DBMS access programs.
 Data abstraction: Data models hide storage details
and present the users with a conceptual view of the
DB.
 Multiple views: Each user may see a different view of
the database.
 Data sharing: among multiple users
 Transactions, concurrent access , recovery , OLTP

CSIE30600/CSIEB0290 Database Systems Introduction 29

A Simplified Database Catalog

CSIE30600/CSIEB0290 Database Systems Introduction 30

Note 15
CSIE30600/CSIEB0290 Database Systems Lecture 1 Introduction

Database Users
 Users may be divided into
 Those who actually use and control the database
content, and those who design, develop and
maintain database applications (called “Actors on
the Scene”), and
 Those who design and develop the DBMS software
and related tools, and the computer systems
operators (called “Workers Behind the Scene”).

CSIE30600/CSIEB0290 Database Systems Introduction 31

Database Users
 Actors on the scene
 Database administrators:
 Responsible for authorizing access to the database, for
coordinating and monitoring its use, acquiring software
and hardware resources, controlling its use and
monitoring efficiency of operations.
 Database Designers:
 Responsible to define the content, the structure, the
constraints, and functions or transactions against the
database. They must communicate with the end-users
and understand their needs.

CSIE30600/CSIEB0290 Database Systems Introduction 32

Note 16
CSIE30600/CSIEB0290 Database Systems Lecture 1 Introduction

Database Administrator
 Coordinates all the activities of the database system; must
have a good understanding of the enterprise’s information
resources and needs.
 Database administrator's duties:
 Schema definition
 Storage structure and access method definition
 Schema and physical organization modification
 Granting user authority to access the database
 Specifying integrity constraints
 Acting as liaison with users
 Monitoring performance and responding to changes in
requirements

CSIE30600/CSIEB0290 Database Systems Introduction 33

End-users
 Actors on the scene (continued)
 End-users: use the data for queries, reports and
some of them update the database content.
 End-users can be categorized into:
 Casual: access db occasionally when needed
 Naïve or Parametric: they make up a large section of
the end-user population.
 They use previously well-defined functions in the form of
“canned transactions” against the database.
 Examples are bank-tellers or reservation clerks who do
this activity for an entire shift of operations.

CSIE30600/CSIEB0290 Database Systems Introduction 34

Note 17
CSIE30600/CSIEB0290 Database Systems Lecture 1 Introduction

End-users (cont.)
 Sophisticated:
 Business analysts, scientists, engineers, others thoroughly
familiar with the system capabilities.
 Many use tools in the form of software packages that work
closely with the stored database.
 Stand-alone:
 Mostly maintain personal databases using ready-to-use
packaged applications.
 An example is a tax program user that creates its own
internal database.
 Another example is a user that maintains an address book

CSIE30600/CSIEB0290 Database Systems Introduction 35

System Analysts and Application


Programmers
 System analysts: Analyze problem, determine the
requirements of the users, develop specifications.
 Application programmers: Design and implement
specification, testing, debugging, maintaining
softwares. Also known as software developers or
software engineers.
 Business analysts: There is an increasing need for
people who can analyze vast amounts of business data
and real-time data (“Big Data”) for better decision
making, planning, advertising, marketing etc.

CSIE30600/CSIEB0290 Database Systems Introduction 36

Note 18
CSIE30600/CSIEB0290 Database Systems Lecture 1 Introduction

Users behind the Scene


 DB designers – design the database systems for end
users
 DBMS designers – design database management
systems and tools for building databases
 Tool designers – Design and implement tools that
facilitate building of applications and allow using
database effectively (eg. modeling and designing
databases, performance monitoring, prototyping, test
data generation, user interface creation, simulation
etc.)
 Operators and maintenance personnel.

CSIE30600/CSIEB0290 Database Systems Introduction 37

Advantages of Using the Database


Approach
 Controlling redundancy in data storage and in
development and maintenance efforts.
 Sharing of data among multiple users.
 Restricting unauthorized access to data.
 Providing persistent storage for program objects
 In object-oriented DBMS
 Providing storage structures (e.g. indexes) efficient
data access
 Provide optimization of queries for efficient
processing
CSIE30600/CSIEB0290 Database Systems Introduction 38

Note 19
CSIE30600/CSIEB0290 Database Systems Lecture 1 Introduction

Advantages of Using the Database


Approach (cont.)
 Providing backup and recovery services.
 Providing multiple interfaces to different classes of
users.
 Representing complex relationships among data.
 Enforcing integrity constraints on the database.
 Drawing inferences and actions from the stored data
using deductive and active rules

CSIE30600/CSIEB0290 Database Systems Introduction 39

Additional Implications
 Potential for enforcing standards:
 This is very crucial for the success of database
applications in large organizations. Standards
refer to data item names, display formats, screens,
report structures, meta-data (description of data),
Web page layouts, etc.
 Reduced application development time:
 Incremental time to add each new application is
reduced.

CSIE30600/CSIEB0290 Database Systems Introduction 40

Note 20
CSIE30600/CSIEB0290 Database Systems Lecture 1 Introduction

Additional Implications (cont.)


 Flexibility to change data structures:
 Database structure may evolve as new requirements
are defined.
 Availability of current information:
 Extremely important for on-line transaction
systems such as airline, hotel, car reservations.
 Economies of scale:
 Wasteful overlap of resources and personnel can be
avoided by consolidating data and applications
across departments.

CSIE30600/CSIEB0290 Database Systems Introduction 41

History of Database Systems


 Pre-1960s
 File processing systems
 Redundancy and inconsistency between files
 Incompatibility between access programs
 Data isolation
 Concurrent access anomalies
 Security and integrity problems

CSIE30600/CSIEB0290 Database Systems Introduction 42

Note 21
CSIE30600/CSIEB0290 Database Systems Lecture 1 Introduction

DB History (cont.)
 The '60s
 Carles Bachman designed the 1st DBMS
Integrated Data Store (and received the 1st
Turing Award in 1973)
 Three-level architecture (more about this in
next lecture)
 CODASYL, DBTG, and the network model
 Hierarchical model and the IMS system

CSIE30600/CSIEB0290 Database Systems Introduction 43

DB History (cont.)
 The '70s
 Edgar Codd (1970): The Relational model (Codd
won the 1981 Turing Award)
 Provide a sound theoretical base.
 1975, 1st ACM SIGMOD international conference
 1975, 1st VLDB international conference
 Peter Chen 陳品山 (1976): The Entity-relationship
model
 System R (IBM), INGRES (UC-Berkely), System
2000 (UT-Austin)
 SQL, QUEL
CSIE30600/CSIEB0290 Database Systems Introduction 44

Note 22
CSIE30600/CSIEB0290 Database Systems Lecture 1 Introduction

DB History (cont.)
 The '80s
 Commertcial relational DBMS
(DB2, ORACLE, SYBASE, INFORMIX, ...)
 DBMS on PC's
(DBASE, PARADOX, ...)
 Transaction management (James Gray won
the 1999 Turing Award)
 Standards (SQL standardized in the late
1980s)
CSIE30600/CSIEB0290 Database Systems Introduction 45

DB History (cont.)
 The '90s
 New applications (Web, CAD/CAM, CASE, office
automation, science and engineering, VLSI, ...)
 Demand for new DBMS technologies
 Object-oriented DBs, Parallel/Distributed DBs,
Active/Deductive DBs, Multimedia DBs, Mobile DBs,
Temporal/Real-time DBs, Spatial DBs(such as GIS), …
 The emergence of ERP (Enterprise Resource Planning)
and MRP (Material Requirements Planning) packages
 Data Warehousing and data mining
 DBMS in the Internet/Web and E-commerce
applications

CSIE30600/CSIEB0290 Database Systems Introduction 46

Note 23
CSIE30600/CSIEB0290 Database Systems Lecture 1 Introduction

DB History (cont.)
 The 2000s and beyond
 XML, XQuery and the Semantic Web
 Data Stream Management Systems (DSMS)
 Sensor databases
 Network traffic analysis
 RFID data management
 …
 Mobile Data Management (MDM)

CSIE30600/CSIEB0290 Database Systems Introduction 47

New Trend
 First decade of the 21st century has seen tremendous
growth in user generated data and automatically collected
data from applications and search engines.
 Social Media platforms such as Facebook and Twitter are
generating millions of transactions a day and businesses
are interested to tap into this data to “understand” the
users.
 Cloud Storage and Backup is making unlimited amount of
storage available to users and applications

CSIE30600/CSIEB0290 Database Systems Introduction 48

Note 24
CSIE30600/CSIEB0290 Database Systems Lecture 1 Introduction

Emergence of Big Data & NoSQL


 New data storage, management and analysis technology
was necessary to deal with the huge volumn of data in
petabytes a day (1015 bytes or 1000 terabytes) in some
applications – “Big Data”.
 Hadoop and Mapreduce programming approach to
distributed data as well as the Google File System have
given rise to Big Data technologies. Further
enhancements are taking place in the form of Spark based
technology.
 NoSQL (Not only SQL) systems have been designed for
rapid search and retrieval from documents, processing of
huge graphs, and other forms of unstructured data with
flexible models of transaction processing
CSIE30600/CSIEB0290 Database Systems Introduction 49

When NOT to Use a DBMS


 Main inhibitors (costs) of using a DBMS:
 High initial investment and possible need for additional
hardware. (No longer the case with cloud)
 The generality that a DBMS provides for defining and
processing data
 Overhead for providing security, concurrency control,
recovery, and integrity functions.
 When a DBMS may be unnecessary:
 If the database and applications are simple, well defined,
and not expected to change.
 If access to data by multiple users is not required.

CSIE30600/CSIEB0290 Database Systems Introduction 50

Note 25
CSIE30600/CSIEB0290 Database Systems Lecture 1 Introduction

When NOT to use a DBMS


 When a DBMS may be infeasible:
 In embedded systems where a general purpose
DBMS may not fit in available storage
 When no DBMS may suffice:
 If there are stringent real-time requirements that
may not be met because of DBMS overhead
 If the database system is not able to handle the
complexity of data because of modeling limitations
 If the database users need special operations not
supported by the DBMS (e.g. GIS and location
based services)
CSIE30600/CSIEB0290 Database Systems Introduction 51

Related Systems
Knowledge Management Systems

Expert Systems

Knowledge
Decision Support Systems

Information Management Info Systems

Database Systems
Data
DBMS

CSIE30600/CSIEB0290 Database Systems Introduction 52

Note 26
CSIE30600/CSIEB0290 Database Systems Lecture 1 Introduction

Summary
 Types of Databases and Database Applications
 Basic Definitions
 Typical DBMS Functionality
 Example of a Database (UNIVERSITY)
 Main Characteristics of the Database Approach
 Database Users
 Advantages of Using the Database Approach
 Database History and New Trend
 When NOT to Use Databases
 Related Systems
CSIE30600/CSIEB0290 Database Systems Introduction 53

Note 27

You might also like