Data Warehouse: in It's Simplest Form, A Data Ware House Is A Collection of Key
Data Warehouse: in It's Simplest Form, A Data Ware House Is A Collection of Key
Before looking into the details of each of the managers we could get a broad idea about their functionality by mapping the processes that we studied in the
previous chapter to the managers. The extracting and loading processes are taken care of by the load manager. The processes of cleanup and transformation of
data as also of back up and archiving are the duties of the ware house manage, while the query manager, as the name implies is to take case of query
management.
i). The processes to populate the ware house have to be able to extract the data, clean it up, and make it available to the analysis
systems. This is done on a daily / weekly basis depending on the quantum of the data population to be incorporated.
ii). The day to day management of data ware house is not to be confused with maintenance and management of hardware and
software. When large amounts of data are stored and new data are being continually added at regular intervals, maintaince of the
quality of data becomes an important element.
iii). Ability to accommodate changes implies the system is structured in such a way as to be able to cope with future changes without
the entire system being remodeled. Based on these, we can view the processes that a typical data ware house scheme should support as
follows.
25. What are the different system management tools used for
data warehouse?
The different system management tools used for data warehouse :-
i). Configuration managers
ii). schedule managers
iii). event managers
iv). database mangers
v). back up recovery managers
vi). resource and performance a monitors.
Q No. 4 [5]
Creation of a DW leads to a direct increase in quality of analyses as the table structures are simpler (you keep only the needed information in simpler tables),
standardized (well-documented table structures), and often denormalized (to reduce the linkages between tables and the corresponding complexity of queries). A
DW drastically reduces the 'cost-per-analysis' and thus permits more analysis per FTE. Having a well-designed DW is the foundation successful BI/Analytics initiatives
are built upon.
If you are still running your reports off the main transaction database, answer this simple question: Would the solution still work next year with 20% more customers,
50% more business, 70% more users, and 300% more reports? What about the year after next? If you are sure that your solution will run without any changes,
great!! However, if you have already budgeted to buy new state-of-the-art hardware and 25 new Oracle licenses with those partition-options and the 33 other cool-
sounding features, good luck to you. (You can probably send me a ticket to Hawaii, since it's gonna cost you just a minute fraction of your budget)
After all, both are databases, and both have some tables containing data. If you look deeper, you'd find that both have indexes, keys, views, and the regular jing-
bang. So is that 'Data warehouse' really different from the tables in your application? And if the two aren't really different, maybe you can just run your queries and
reports directly from your application databases!
Well, to be fair, that may be just what you are doing right now, running some EOD (end-of-day) reports as complex SQL queries and shipping them off to those who
need them. And this scheme might just be serving you fine right now. Nothing wrong with that if it works for you.
But before you start patting yourself on the back for having avoided a data warehouse altogether, do spend a moment to understand the differences, and to
appreciate the pros and cons of either approach.
The primary difference between transaction database and a data warehouse is that while the former is designed (and optimized) to record , the latter has to be
designed (and optimized) to respond to analysis questions that are critical for your business.
It's probably simpler and more sensible to create a new DW exclusively for your BI needs. And if you are cash strapped, you could easily do that at extremely low
costs by using excellent open source databases like MySQL.