0% found this document useful (0 votes)

9 views

Lecture-7

Uploaded by

samkh866n

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

Lecture-7

Uploaded by

samkh866n

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 32

INFRASTRUCTURE AS THE

FOUNDATION FOR DATA

WAREHOUSING
Lecture # 07
Instructor: Mr. Sharjeel Ahmed
Slide Elements
• Infrastructure as the Foundation for Data Warehousing
• Infrastructure Supporting Architecture
• Operational Infrastructure
• Physical Infrastructure
• Hardware And Operating System
• Platform Options
• Server Hardware
• Database Software
• Parallel Processing Options
• Selection of the DBMS
• Collection of Tools
INFRASTRUCTURE SUPPORTING
ARCHITECTURE
Infrastructure Supporting Architecture
• Data warehouse infrastructure includes all the foundational elements that
enable the architecture to be implemented.

• The infrastructure includes several elements such as server hardware,

operating system, network software, database software, the LAN and WAN,
vendor tools for every architectural component, people, procedures, and
training.
Infrastructure Classification
• The elements of the data warehouse infrastructure may be classified into
two categories: operational infrastructure and physical infrastructure.
• The physical infrastructure is much wider and more fundamental.

Operational Infrastructure
• Operational infrastructure to support each architectural component consists of
• People
• Procedures
• Training
• Management software
• These are not the people and procedures needed for developing the data
warehouse.
• These are the ones needed to keep the data warehouse going. These
elements are as essential as the hardware and software that keep the data
warehouse running. They support the management of the data warehouse and
maintain its efficiency
Infrastructure Classification (Cont. )
Physical Infrastructure
• The platform consists of the basic hardware components, the operating
system with its utility software, the network, and the network software. Along
with the overall platform is the set of tools that run on the selected platform to
perform the various functions and services of individual architectural
components.
HARDWARE AND OPERATING SYSTEM
Hardware and Operating System
• Hardware and operating systems make up the computing environment for
your data warehouse.

• All the data extraction, transformation, integration, and staging jobs run on the
selected hardware under the chosen operating system.

• When you transport the consolidated and integrated data from the staging
area to your data warehouse repository, you make use of the server hardware
and the operating system software.

• When the queries are initiated from the client workstations, the server
hardware, in conjunction with the database software, executes the queries and
produces the results.
Guidelines for Hardware Selection
Here are some general guidelines for hardware selection, not entirely specific to
hardware for the data warehouse.

• Scalability: When your data warehouse grows in terms of the number of

users, the number of queries, and the complexity of the queries, ensure that
your selected hardware could be scaled up.

• Support: Vendor support is crucial for hardware maintenance. Make sure that
the support from the hardware vendor is at the highest possible level.

• Vendor Reference: It is important to check vendor references with other sites

using hardware from this vendor. You do not want to be caught with your data
warehouse being down because of hardware malfunctions when the CEO
wants some critical analysis to be completed.

• Vendor Stability: Check on the stability and staying power of the vendor.
Guidelines for OS Selection
let us quickly consider a few general criteria for the selection of the operating
system. First of all, the operating system must be compatible with the hardware.
A list of criteria follows

• Scalability: Along with the hardware and database software, the operating
system must be able to support the increase in the number of users and
applications.

• Security: The operating system must provide each client with a secure
environment.

• Reliability: The operating system must be able to protect the environment

from application malfunctions.

• Availability: The computing environment must continue to be available after

abnormal application terminations.
Guidelines for OS Selection (Cont. )
• Preemptive Multitasking: The server hardware must be able to balance the
allocation of time and resources among the multiple tasks. Also, the operating
system must be able to let a higher priority task preempt or interrupt another
task as and when needed.

• Use multithreaded approach: The operating system must be able to serve

multiple requests concurrently by distributing threads to multiple processors in
a multiprocessor hardware configuration. This feature is very important
because multiprocessor configurations are architectures of choice in a data
warehouse environment.

• Memory protection: In a data warehouse environment, large numbers of

queries are common. That means that multiple queries will be executing
concurrently. A memory protection feature in an operating system prevents
one task from violating the memory space of another
Common Options for Hardware and OS
let us go through the following list of three common options.
Mainframes:
• Leftover hardware from legacy applications
• Primarily designed for OLTP and not for decision support applications
• Not cost-effective for data warehousing
• Not easily scalable
• Rarely used for data warehousing when too much spare resources are
available for smaller data marts
Open System Servers
• UNIX servers, the choice medium for most data warehouses
• Generally robust
• Adapted for parallel processing
NT Servers
• Support medium-sized data warehouses
• Limited parallel processing capabilities
• Cost-effective for medium-sized and small data warehouses
Platform Options
Recap:
• Let us get back to quick summary recap of the functions and services of the
architectural components in the three major areas:
• Data Acquisition: data extraction, data transformation, data cleansing, data
integration, and data staging.
• Data Storage: data loading, archiving, and data management.
• Information Delivery: report generation, query processing, and complex
analysis.

Platform Options:
• We will now discuss platform options in terms of the functions in these three
areas.
Platform Options
1. Single Platform Option

• This is the most straightforward and simplest option for implementing the data
warehouse architecture.
• In this option, all functions from the backend data extraction to the front-end
query processing are performed on a single computing platform.
• This was perhaps the earliest approach, when developers were implementing
data warehouses on existing mainframes, minicomputers, or a single UNIX-
based server.
• Because all operations in the data acquisition, data storage, and information
delivery areas take place on the same platform, this option hardly ever
encounters any compatibility or interface problems.
• The data flows smoothly from beginning to end without any platform-to-
platform conversions. No middleware is needed.
• All tools work in a single computing environment.
Platform Options – Hybrid Option

• If the company falls in the category where the legacy platform will
accommodate your data warehouse, then, by all means, take the approach of
a single-platform solution. Again, the single-platform solution, if feasible, is an
easier solution.
• For the rest of us who are not that fortunate, we have to consider other
options. Let us begin with data extraction, the first major operation, and follow
the flow of data until it is consolidated into load images and waiting in the
staging area.

• We will now step through the data flow and examine the platform options.

i. Data Extraction: In any data warehouse, it is best to perform the data

extraction function from each source system on its own computing platform.
If your telephone sales data resides in a minicomputer environment, create
extract files on the mini-computer itself for telephone sales.
Platform Options – Hybrid Option (Cont. )

ii. Initial Reformatting and Merging: After creating the raw data extracts from
the various sources, the extracted files from each source are reformatted
and merged into a smaller number of extract files. Just like the extraction
step, it is best to do this step of initial merging of each set of source extracts
on the source platform itself.

iii. Preliminary Data Cleansing: In this step, you verify the extracted data from
each data source for any missing values in individual fields, supply default
values, and perform basic edits. This is another step for the computing
platform of the source system itself.

iv. Transformation and Consolidation: This step comprises all the major data
transformation and integration functions. Usually, you will use transformation
software tools for this purpose. Where is the best place to perform this step?
Obviously, not in any individual legacy platform. You perform this step on the
platform where your staging area resides.
Platform Options – Hybrid Option (Cont. )

v. Validation and Final Quality Check: This step of final validation and
quality check is a strong candidate for the staging area. You will arrange for
this step to happen on that platform.

vi. Creation of Load Images: This step creates load images for individual
database files of the data warehouse repository. This step almost always
occurs in the staging area and, therefore, on the platform where the staging
area resides.
DATABASE SOFTWARE
Database Software
• Data-warehouse related add-ons are becoming part of the database offerings.
• The database software that started out for use in operational OLTP systems is
being enhanced to cater to decision support systems.
• Some RDBMS products now include support for the data acquisition area of
the data warehouse.
• Mass loading & retrieval of data from other DB systems have become easier.
• Some vendors have paid special attention to the data transformation function.
• Replication features have been reinforced to assist in bulk refreshes and
incremental loading of the data warehouse.
• Apart from these enhancements, the more important ones relate to load
balancing and query performance. These two features are critical in a data
warehouse. Your data warehouse is query-centric.
• Everything that can be done to improve query performance is most desirable.
• The DBMS vendors are providing parallel processing features to improve
query performance. Let us briefly review the parallel processing options within
the DBMS that can take full advantage of parallel server hardware.
Parallel Processing Options
• Parallel processing options in database software are intended only for
machines with multiple processors.
• Most of the current database software can parallelize a large number of
operations. These operations include the following: mass loading of data, full
table scans, queries with exclusion conditions, queries with grouping, selection
with distinct values, aggregation, sorting, creation of tables using sub queries,
creating and rebuilding indexes, inserting rows into a table from other tables,
enabling constraints, star transformation (an optimization technique when
processing queries against a STAR schema), and so on.
• Let us now examine what happens when a user initiates a query at the
workstation. Each session accesses the database through a server process.
The query is sent to the DBMS and data retrieval takes place from the
database. Data is retrieved and the results are sent back, all under the control
of the dedicated server process. The query dispatcher software is responsible
for splitting the work, distributing the units to be performed among the pool of
available query server processes, and balancing the load. Finally, the results
of the query processes are assembled and returned as a single, consolidated
result set.
Parallel Processing Options (Cont. )
Inter-query Parallelization
• In this method, several server processes handle multiple requests
simultaneously. Multiple queries may be serviced based on your server
configuration and the number of available processors.
• However, inter-query parallelism is limited. Multiple queries are processed
concurrently, but each query is still being processed serially by a single server
process. Suppose a query consists of index read, data read, sort, and join
operations; these operations are carried out in this order. Each operation must
finish before the next one can begin. Parts of the same query do not execute
in parallel. To overcome this limitation, many DBMS vendors have come up
with versions of their products to provide intra-query parallelization.
Intra-query Parallelization
• Using the intra-query parallelization technique, the DBMS splits the query into
the lower-level operations of index read, data read, data join, and data sort.
Then each one of these basic operations is executed in parallel on a single
processor. The final result set is the consolidation of the intermediary results.
Parallel Processing Options (Cont. )
Three ways a DBMS can provide intra-query parallelization
i. Horizontal Parallelism.
• The data is partitioned across multiple disks. Parallel processing occurs within
each single task in the query.
Parallel Processing Options (Cont. )
ii. Vertical Parallelism.
• This kind of parallelism occurs among different tasks, not just a single task in a
query as in the case of horizontal parallelism. All component query operations
are executed in parallel, but in a pipelined manner. This assumes that the
RDBMS has the capability to decompose the query into subtasks; each
subtask has all the operations of index read, data read, join, and sort. Then
each subtask executes on the data in serial fashion.

iii. Hybrid Method.

• In this method, the query decomposer partitions the query both horizontally
and vertically. Naturally, this approach produces the best results. You will
realize the greatest utilization of resources, optimal performance, and high
scalability
Selection of the DBMS
• Selection of the DBMS is most crucial. Your choice of the DBMS must match
with the selected server hardware.
• Apart from the criteria that the selected DBMS must have load balancing and
parallel processing options, the other key features listed below must be
considered when selecting the DBMS for your data warehouse.
• Query governor—to anticipate and abort runaway queries
• Query optimizer—to parse and optimize user queries
• Query management—to balance the execution of different types of queries
• Load utility—for high-performance data loading, recovery, and restart
• Metadata management—with an active data catalog or dictionary
• Scalability—in terms of both number of users and data volumes
• Extensibility—having hybrid extensions to OLAP databases
• Portability—across platforms
• Query tool APIs—for tools from leading vendors
• Administration—providing support for all DBA functions
COLLECTION OF TOOLS
Architecture First, Then Tools
• The title of this subsection simply means this: ignore the tools; design the
architecture first; then, and only then, choose the tools to match the functions
and services stipulated for the architectural components.

• Do the architecture first; select the tools later.

• Why is this principle sacred? Why is it not advisable to just buy the set of tools
and then use the tools to build and to deploy your data warehouse?

• The reason for this is that The tool may not meet the requirements as would
have been reflected in the architecture.
Collection of Tools
• Software tools are available for every architectural component of the data
warehouse.
• Software tools are extremely important in a data warehouse. As you have
seen from this figure, tools cover all the major functions.
Types of Software Tools
Data Modeling
• Enable developers to create and maintain data models for the source systems
and warehouse target databases. If necessary, data models may be created
for the staging area.
• Provide forward engineering capabilities to generate the database schema.
• Provide reverse engineering capabilities to generate the data model from the
data dictionary entries of existing source databases.
• Provide dimensional modeling capabilities to data designers for creating STAR
schemas.

Data Extraction
• Two primary extraction methods are available: bulk extraction for full refreshes
and change-based replication for incremental loads.
• Tool choices depend on the following factors: source system platforms and
databases, and available built-in extraction and duplication facilities in the
source systems.
Types of Software Tools (Cont. )
Data Transformation
• Transform extracted data into appropriate formats and data structures.
• Provide default values as specified.
• Major features include field splitting, consolidation, standardization, and de-
duplication.

Data Loading
• Load transformed and consolidated data in the form of load images into the
data warehouse repository.
• Some loaders generate primary keys for the tables being loaded.
• For load images available on the same RDBMS engine as the data
warehouse, pre-coded procedures stored on the database itself may be used
for loading.
Types of Software Tools (Cont. )
Data Quality
• Assist in locating and correcting data errors.
• May be used on the data in the staging area or on the source systems directly.
• Help resolve data inconsistencies in load images.

Queries and Reports

• Allow users to produce canned, graphic-intensive, sophisticated reports.
• Help users to formulate and run queries.
• Two main classifications are report writers, report servers.
Types of Software Tools (Cont. )
Online Analytical Processing (OLAP)
• Allow users to run complex dimensional queries.
• Enable users to generate canned queries.
• Two categories of online analytical processing are multidimensional online
analytical processing (MOLAP) and relational online analytical processing
(ROLAP). MOLAP works with proprietary multidimensional databases that
receive data feeds from the main data warehouse. ROLAP provides online
analytical processing capabilities from the relational database of the data
warehouse itself.

Alert Systems
• Highlight and get user’s attention based on defined exceptions.
• Provide alerts from the data warehouse database to support strategic
decisions.
• Three basic alert types are: from individual source systems, from integrated
enterprise-wide data warehouses, and from individual data marts.
Types of Software Tools (Cont. )
Middleware and Connectivity
• Transparent access to source systems in heterogeneous environments.
• Transparent access to databases of different types on multiple platforms.
• Tools are moderately expensive but prove to be invaluable for providing
interoperability among the various data warehouse components.

Data Warehouse Management

• Assist data warehouse administrators in day-to-day management.
• Some tools focus on the load process and track load histories.
• Other tools track types and number of user queries.

Modern Database Management Test Bank Chapter 7
No ratings yet
Modern Database Management Test Bank Chapter 7
37 pages
DataWarehouse Concept
100% (1)
DataWarehouse Concept
18 pages
Learn Data Warehousing in 24 Hours
From Everand
Learn Data Warehousing in 24 Hours
Alex Nordeen
No ratings yet
Knowledge Management Research Paper
100% (2)
Knowledge Management Research Paper
27 pages
Data-ware-unit-2 (1)
No ratings yet
Data-ware-unit-2 (1)
23 pages
Building Blocks & Trends in Data Warehouse
No ratings yet
Building Blocks & Trends in Data Warehouse
45 pages
Data Warehouse
No ratings yet
Data Warehouse
71 pages
Data Warehouse Project Management
No ratings yet
Data Warehouse Project Management
11 pages
Business Intelligence - Data Warehouse Implementation
100% (1)
Business Intelligence - Data Warehouse Implementation
157 pages
Merging Transaction and Analytical Processing
No ratings yet
Merging Transaction and Analytical Processing
4 pages
UNIT 2
No ratings yet
UNIT 2
17 pages
DWDM UNIT2
No ratings yet
DWDM UNIT2
7 pages
Data Ware Housing
No ratings yet
Data Ware Housing
10 pages
Chap 2 - Data Warehousing Part I (2)
No ratings yet
Chap 2 - Data Warehousing Part I (2)
31 pages
MIS-15 - Data and Knowledge Management
No ratings yet
MIS-15 - Data and Knowledge Management
55 pages
DWH Informatica Session PDF
No ratings yet
DWH Informatica Session PDF
32 pages
CH 2 Introduction To Data Warehousing
No ratings yet
CH 2 Introduction To Data Warehousing
31 pages
What Is A Data Warehouse?
No ratings yet
What Is A Data Warehouse?
48 pages
Data Ware House Architectures
No ratings yet
Data Ware House Architectures
34 pages
Data Warehousing Notes
No ratings yet
Data Warehousing Notes
34 pages
DM Part 2
No ratings yet
DM Part 2
24 pages
The Data Warehousing Development Lifecycle
100% (1)
The Data Warehousing Development Lifecycle
5 pages
Data Warehousing & Data Mining-A View
No ratings yet
Data Warehousing & Data Mining-A View
11 pages
Infrastructure of Data Warehouse: Ms. Ashwini Rao Asst - Prof.IT
No ratings yet
Infrastructure of Data Warehouse: Ms. Ashwini Rao Asst - Prof.IT
32 pages
An Introduction To Data Warehousing: System Services Corporation, Chicago, Illinois
No ratings yet
An Introduction To Data Warehousing: System Services Corporation, Chicago, Illinois
19 pages
Data W Areho Us e
100% (1)
Data W Areho Us e
9 pages
Data Warehouses: FPT University
No ratings yet
Data Warehouses: FPT University
46 pages
Unit 6 Data Warehousing
No ratings yet
Unit 6 Data Warehousing
40 pages
An Introduction To Data Warehousing: System Services Corporation, Chicago, Illinois
No ratings yet
An Introduction To Data Warehousing: System Services Corporation, Chicago, Illinois
19 pages
Customer Relationship Management: Unit - IV: Lesson - 8
No ratings yet
Customer Relationship Management: Unit - IV: Lesson - 8
76 pages
Data warehousing and Data mining Original Notes (1)
No ratings yet
Data warehousing and Data mining Original Notes (1)
47 pages
Assignment of Information Technology: Submitted To: Submitted by
No ratings yet
Assignment of Information Technology: Submitted To: Submitted by
14 pages
An Introduction To Data Warehousing1
No ratings yet
An Introduction To Data Warehousing1
20 pages
DWDM
No ratings yet
DWDM
15 pages
Selected Topics of Recent Trends in Information Technology
No ratings yet
Selected Topics of Recent Trends in Information Technology
21 pages
Trends in Data Warehousing and Business Intelligence
No ratings yet
Trends in Data Warehousing and Business Intelligence
44 pages
Data Warehouse Components
No ratings yet
Data Warehouse Components
26 pages
2024 Meeting 1 - Data Warehouse Fundamentals
No ratings yet
2024 Meeting 1 - Data Warehouse Fundamentals
47 pages
Medical
No ratings yet
Medical
3 pages
3510-6510_Ch3
No ratings yet
3510-6510_Ch3
64 pages
What Is A Data Warehouse?
No ratings yet
What Is A Data Warehouse?
48 pages
Unit 1
No ratings yet
Unit 1
34 pages
Data Warehouse Final Report
No ratings yet
Data Warehouse Final Report
19 pages
Data Warehousing
100% (4)
Data Warehousing
28 pages
In T e G R A Ti o N: Integration of Data
No ratings yet
In T e G R A Ti o N: Integration of Data
21 pages
DMW p1 Merged
No ratings yet
DMW p1 Merged
316 pages
Lect 4
No ratings yet
Lect 4
27 pages
UnitI_DSBD_4-2022
No ratings yet
UnitI_DSBD_4-2022
42 pages
DATA WAREHOUSE
No ratings yet
DATA WAREHOUSE
143 pages
Data War Eh Puse
No ratings yet
Data War Eh Puse
51 pages
Finalpresentation 111220200340 Phpapp01
No ratings yet
Finalpresentation 111220200340 Phpapp01
18 pages
Lecture 1 Introduction To Data Warehousing
No ratings yet
Lecture 1 Introduction To Data Warehousing
41 pages
D W H Info: Main Menu DWH Concepts and Fundamentals Back
No ratings yet
D W H Info: Main Menu DWH Concepts and Fundamentals Back
7 pages
MCS-221 2024-25 em
No ratings yet
MCS-221 2024-25 em
34 pages
BIDA NOTES (1)
No ratings yet
BIDA NOTES (1)
67 pages
Introduction To Data Warehousing
No ratings yet
Introduction To Data Warehousing
43 pages
Data Warehouse Concepts With Dimensional Modeling
100% (1)
Data Warehouse Concepts With Dimensional Modeling
36 pages
Unit 1
No ratings yet
Unit 1
99 pages
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Mastering Terraform A Comprehensive Guide to Infrastructure As Code
From Everand
Mastering Terraform A Comprehensive Guide to Infrastructure As Code
Mario Marinov
No ratings yet
SAS Interview Questions You'll Most Likely Be Asked
From Everand
SAS Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
From Everand
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
Jonathan Rigdon
No ratings yet
Group 4
No ratings yet
Group 4
11 pages
FLED 218.02 Syllabus Revised 2
No ratings yet
FLED 218.02 Syllabus Revised 2
7 pages
Informatics Practices Practical List22-2323
No ratings yet
Informatics Practices Practical List22-2323
7 pages
Statistics in Experimental Research: Mark Anthony F. Casimiro Marikina Science High School-JHS
No ratings yet
Statistics in Experimental Research: Mark Anthony F. Casimiro Marikina Science High School-JHS
32 pages
SION Exam
No ratings yet
SION Exam
8 pages
Reviewer English 10
No ratings yet
Reviewer English 10
2 pages
SQL Server Hardware Ebook - 2 100
No ratings yet
SQL Server Hardware Ebook - 2 100
99 pages
Expansion / Modernisation Project
No ratings yet
Expansion / Modernisation Project
49 pages
Prashanth - Data Engineer
No ratings yet
Prashanth - Data Engineer
8 pages
Storage Concepts HQT 0050 Exam
No ratings yet
Storage Concepts HQT 0050 Exam
2 pages
Windows: Programme A May
No ratings yet
Windows: Programme A May
24 pages
Data Import::: Cheat Sheet
No ratings yet
Data Import::: Cheat Sheet
2 pages
Intelligent Fraud Detection in Financial Statements Using Machine Learning and Data Mining: A Systematic Literature Review
No ratings yet
Intelligent Fraud Detection in Financial Statements Using Machine Learning and Data Mining: A Systematic Literature Review
25 pages
Report On Progress of Professional Portfolio - NFDN 1002
No ratings yet
Report On Progress of Professional Portfolio - NFDN 1002
2 pages
Database Management System Lec3:: 3.1 What Is ER Diagram
No ratings yet
Database Management System Lec3:: 3.1 What Is ER Diagram
8 pages
Lawis, Peter IT 201 Quiz 1
No ratings yet
Lawis, Peter IT 201 Quiz 1
2 pages
AIX - Using Files
No ratings yet
AIX - Using Files
16 pages
Practical Research
No ratings yet
Practical Research
76 pages
IBM Storwize V7000 For Lenovo: Product Guide
No ratings yet
IBM Storwize V7000 For Lenovo: Product Guide
33 pages
5 Cache and Main Memory
100% (1)
5 Cache and Main Memory
15 pages
Chapter 2 - Warehouse Activity Profiling
No ratings yet
Chapter 2 - Warehouse Activity Profiling
39 pages
Project Guideline SSS1024
No ratings yet
Project Guideline SSS1024
8 pages
Science Fair Project Directions
No ratings yet
Science Fair Project Directions
2 pages
Economics Sba 2016
No ratings yet
Economics Sba 2016
3 pages
STAT 111 LECTURE NOTE 01 pdf
No ratings yet
STAT 111 LECTURE NOTE 01 pdf
14 pages
Purpose of DBMS
No ratings yet
Purpose of DBMS
12 pages
Lob Based Deep Learning Models For Stock Price Trend Prediction: A Benchmark Study
No ratings yet
Lob Based Deep Learning Models For Stock Price Trend Prediction: A Benchmark Study
45 pages
Presentation 1212
No ratings yet
Presentation 1212
27 pages

Lecture-7

Uploaded by

Lecture-7

Uploaded by

INFRASTRUCTURE AS THE

FOUNDATION FOR DATA

• The infrastructure includes several elements such as server hardware,

• Scalability: When your data warehouse grows in terms of the number of

• Vendor Reference: It is important to check vendor references with other sites

• Reliability: The operating system must be able to protect the environment

• Availability: The computing environment must continue to be available after

• Use multithreaded approach: The operating system must be able to serve

• Memory protection: In a data warehouse environment, large numbers of

i. Data Extraction: In any data warehouse, it is best to perform the data

iii. Hybrid Method.

• Do the architecture first; select the tools later.

Queries and Reports

Data Warehouse Management

You might also like