178chapter 5 - Data Resource Management
178chapter 5 - Data Resource Management
Types of Databases
• Operational Databases
– Stores detailed data needed to support business processes
and operations
• Also called subject area databases (SADB), transaction
databases, and production databases
• Database examples: customer, human resource,
inventory
Data Resource Management
• Distributed Databases
– Distributed databases are copies or parts of databases
stored on servers at multiple locations
– a database in which portions of the database are stored in
multiple physical locations and processing is distributed
among multiple database nodes
• Improves database performance at worksites
– Advantages
• Protection of valuable data
• Data can be distributed into smaller databases
• Each location has control of its local data
• All locations can access any data, any where
Data Resource Management
• Distributed Databases
– Disadvantages
• Maintaining data accuracy
• Requires extra computing power
• Network necessary to access multiple databases
– Replication
• Look at each distributed database and find changes
• Apply changes to each distributed database
• Very complex and time consuming
– Duplication
• One database is master
• Duplicate the master after hours, in all locations
• Easier to accomplish
Data Resource Management
• External Databases
– Databases available for a fee from commercial online
services, or free from the Web
• Example: hypermedia databases, statistical databases,
bibliographic and full text databases
• Search engines like Google or Yahoo are external
databases
• Hypermedia Databases
– A hypermedia database contains
• Hyperlinked pages of multimedia
• Interrelated hypermedia page elements, rather than
interrelated data records
Components of Web-Based System
Data Warehouse
• Stores static data that has been extracted from other
databases in an organization
– Central source of data that has been cleaned,
transformed, and cataloged
– Data is used for data mining, analytical processing,
business analysis, research, decision support
• Data warehouses may be divided into data marts
– Subsets of data that focus on specific aspects
of a company (department or business process)
Data Warehouse Components
R
Applications and Data Marts
Data Mining
• Data in data warehouses are analyzed to reveal hidden
patterns and trends in historical data
• Data mining analysis help business to make strategic decisions
to gain competitive advantage
• Data mining can discover new or unknown correlations,
patterns, and trends in vast amount of data stored in data
warehouse.
• Data Mining software uses various pattern recognition
algorithms, and mathematical and statistical techniques
– Market-basket analysis to identify new product bundles
– Find root cause of quality or manufacturing problems
– Prevent customer attrition
– Acquire new customers
– Cross-sell to existing customers
– Profile customers with more accuracy
Traditional File Processing
• In traditional file processing system, data are
organized, stored, and processed in independent files
– Each business application designed to use specialized
data files containing specific types of data records
– Cumbersome, costly and inflexible to supply
information needed to manage modern business
• Problems
– Data redundancy
– Lack of data integration
– Data dependence (files, storage devices, software)
– Lack of data integrity or standardization
Problems of File Processing
• Data redundancy
– Duplicate data
– Caused problem when data has to be updated
– Separate file maintenance programs
– Inconsistency among data stored in separate files
• Lack of data integrity or standardization
– Different files and programs define different definition
for data elements
– Lack of accuracy and completeness
– Lack of control over its access and use
Problems of File Processing
• Data dependence
– Files, storage locations, software are interdependent
– Changes in the format and structure of data/records
in a file required changes to be made in all the
programs that access the file
– Program maintenance was a major burden and
resulted into inconsistency
• Lack of data integration
– Separate individual files couldn’t be used for
connecting them for ad-hoc queries
– Resulted into various reports than one consolidated
view
Traditional File Processing
Database Management
• The foundation of modern methods of managing
organizational data
– Consolidates data records formerly in separate
files into databases (ETL)
– Data can be accessed by many different
application programs
– A database management system (DBMS) is the
software interface between users and databases
– Involves creation, maintaining and using data
Database Management
Database Management System
• In mainframe and server computer systems, a software
package that is used to…
– Create new databases and database applications
– Maintain the quality of the data in an organization’s
databases
– Use the databases of an organization to provide the
information needed by end users
• Database Development
– Defining and organizing the content, relationships, and structure of
the data needed to build a database
• Database Application Development
– Using DBMS to create prototypes of queries, forms, reports, Web
pages
• Database Maintenance
– Using transaction processing systems and other tools to add, delete,
update, and correct data