Lecture 11
Lecture 11
DATA RESOURCE
MANAGEMENT
Distributed Databases
• Distributed databases are copies or parts of databases stored on servers at
multiple locations
• Improves database performance at worksites
• Advantages
• Protection of valuable data
• Data can be distributed into smaller databases
• Each location has control of its local data
• All locations can access any data, any where
• Disadvantages
• Maintaining data accuracy
• Replication
• Look at each distributed database and find changes
• Apply changes to each distributed database
• Very complex
• Duplication
• One database is master
• Duplicate the master after hours, in all locations
• Easier to accomplish
Chapter 5 Data Resource ManagementChapter 5 5
External Databases
• Databases available for a fee from commercial
online services, or free from the Web
• Example: hypermedia databases, statistical
databases, bibliographic and full text databases
• Search engines like Google or Yahoo are
external databases
Chapter 5 Data Resource ManagementChapter 5 7
Data Warehouses
• Organizations are required to keep all relevant data for several years for analyzing
and comparing the current year data with that of the past years which also makes
it easier to determine key trends taking place. All this data from previous years are
stored in a database warehouse.
• Since the data stored has gone through all kinds of screening, editing and
integration, it does not need any further editing or alteration.
• Central source of data that has been cleaned, transformed, and cataloged.
• Data is used for data mining, analytical processing, analysis, research, decision
support
Data Marts
• A data mart is a simple form of a data warehouse that is
focused on a single subject (or functional area), such as
Sales or Finance or Marketing.
Data Mining
• Data mining is the process of finding anomalies, patterns
and correlations within large data sets to predict
outcomes. Using a broad range of techniques, we can use
this information to increase revenues, cut costs, improve
customer relationships, reduce risks and more.
Database Interrogation
• Database Interrogation: Capability of a DBMS to report information from
the database in response to End-Users’ request.
• Query Language: Allows easy, immediate access to ad hoc requests.
• SQL Queries
• Query form is SELECT…FROM…WHERE…
Chapter 5 Data Resource ManagementChapter 5 17
Questions
• Define Data Resource Management with its activities.3
• What is Data stewards and what capabilities a Data Stewards should
have? 3
• Define Distributed Databases with advantages and disadvantages.4
• Explain Data warehouse, Data mining and Mata Marts with example.
3
• How do you differentiate between Data Warehouse and Data Mart? 3
• Make a list of what Data Mining can do? 4
• Graphically represent the relation between Data Marts and their
applications. 3