Medical

The document discusses the design and implementation of a data warehouse at the University of Florida. Key points: - The data warehouse stores detailed transaction data from various sources in a highly denormalized DB2 repository and feeds data to different models for various administrative units. - The data warehouse aims to generate reports, feed business intelligence tools, forecast trends, and train machine learning models. - Goals for implementation include dramatic performance gains, reasonable additional storage, transparency for users, direct benefits for all users, minimal impact on costs and administrative responsibilities.

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views3 pages

Medical

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Excutive summary

A discussion of the design and modeling issues associated with a data warehouse for the University of
Florida, as developed by the office of the Chief Information Officer (CIO). The data warehouse is
designed and implemented on a mainframe system using a highly de-normalized DB2 repository for
detailed transaction data and for feeding data to heterogeneous data models owned by different
administrative units. Further more in this report we will discuss other aspects of the dara warehouse
along with the stakeholders analysis and also the curent and future states.
Needs
Since the need for Data Warehouse is to generate reports, feed data to Business Intelligence (BI)
tools, forecast trends, and train Machine Learning models. Data Warehouse stores data from multiple
sources such as APIs, Databases, Cloud Storage, etc., using the ETL (Extract Load Transform) process.
The concept of the data warehouse has existed since the 1980s, when it was developed to help
transition data from merely powering operations to fueling decision support systems that reveal
business intelligence.
Expected outcomes
The data warehouse must provide data services to numerous administrative units on campus. It is
both a parallel and serial processing environment, which executes different tasks and requests
concurrently. Implementation goals include:
• Dramatic performance gains for as many categories of user queries as possible,
• A reasonable amount of extra data storage to the warehouse,
• Complete transparency to end users and to application designers,
• Direct benefit to all users, regardless of which query tool they use,
• Impact the cost of the data system as little as possible, and
• Impact the DBA’s administrative responsibilities as little as possible.
scope
A data warehouse is a central server system that permits the storage, analysis, and interpretation of
data to aid in decision-making. It is a storage area that houses structured data (database tables, Excel
sheets) as well as semi-structured data (XML files, webpages) for tracking and reporting. So therefore
it becomes an essentail part for the organization.
Inscope
In current times the world is moving at fast pace so it becomes important for all the finacial and
business cooperations to have data warehouses because it makes the transcation of data easy and
theirfore it allows the user to save their time.
Out of scope
Small cooperations didn’t need data warehouses it is because they can manage that with single
system because the traffic is not high according to their business, similarly small finanail systems
also didn’t need data warehosues.

Stakeholder analysis

CEO
The CEO is the main stakeholder in our case because of managing a insititue overall operations. This
may include delegating and directing agendas, driving profitability, managing organizational structure,
strategy, and communicating with the board.
Department mangers
Department mangers are the other stakeholders because they have to manages the daily activities of
the team responsible for the design, implementation, maintenance, and support of data of
warehouse systems and projects. Oversees data design and the creation of database architecture and
data repositories.

Assumptions
The system will not shut at any moment so the end can access that at any moment they want.
The required sit will open in 1 click without any delay.
No one will be able to hack that data becuase of its high level security.
Constraint
It uses a simple set of XML-like tags to integrate HTML or XML with data from dynamic queries. It
does not require any user-written CICS programs for data access.
End users view the data through dynamic Web pages that access predefined canned queries produced
by Eagle Server Pages (ESP).
Dependencies
the choice of technology.
the checking the suitability.
the installation of hardware and software.
the development of guidelines for performant use.
Cureent state diagram
The current of data warehouse is
 Extracting data from legacy systems and other data sources,
 Cleansing, scrubbing, and preparing data for decision supports, Page
 Proceedings of the 2001 American Society for Engineering Education Annual Conference &
Exposition Copyright  2001, American Society for Engineering Education
 Maintaining consistent data in appropriate data storage,
 Ensuring and protecting information assets at minimum cost,
 Accessing and analyzing data using a variety of end user tools,
 Mining data for significant relationships, and
 Providing both summarized data as well as extremely fine-grained data.

Future state of warehouse

The future state of data warehouse will be

• Determine the data types, primary key, foreign keys, and how data will be passed between tables. •
Define and determine the parameters of the table storage.
• Estimate the size of the table storage and the entire data warehouse – The lengths of each attribute,
the number of rows for the initial prototype, the full historical load, and incremental rows per load
when in production.
• Develop the initial indexing plan and define the indexes, Overview the indexes and query strategies
to optimize the database in the process. We build many sets of indexes for queries to increase the
efficiency of data retrieval. We use query analyzers to view the results of queries, optimize the
queries, and improve the indexes on the query.

Business requirements
Platform Functions
These features establish a baseline for the system to operate around. Interactivity refers to the
communication process between human users and the software and how easy the system is to use.
Customizations and white labeling allow users to remake the software to their preferences and needs.
This has the double benefit of a seamless experience with other software systems you might use and
the assurance that your employees will actually use it.
Scalability
Scalability is one of the most vital differentiators for a data warehouse solution. A robust solution
scales rapidly to terabytes or even petabytes of data and concurrent users without downtimes or
disruptions. Elasticity refers to scaling up and down instantly to meet demands. Scale up rapidly to
handle unexpected workloads, scale down just as quickly to reduce resources and expenses.
Performance Requirements
At the end of the day, your data warehouse should be able to handle huge workloads efficiently,
utilize finite resources to deliver the best performance, parallelly process multiple queries, users and
processes – enhancing analytics and business decisions. An ideal solution lets you stream data in real
time while sustaining ACID properties for transactions. Workload separation is essential for parallel
processing, it refers to the proper balancing and prioritization of processes and users. Increasing data
load throughput enables faster ETL processing while a lower latency leads to faster querying.
Data Visualization
Once data is organized in a data warehouse, it is ready to be visualized. This involves the system
discovering trends and patterns in data sets and generating graphs, charts, scattergrams and other
visual depictions. Visualization makes complex statistical relations easy to interpret for users. Did you
know that when we sit down to read a website, we only read an average of 28 percent of the words
on the page? We skim, make assumptions and extrapolate based on the words we do read to glean
information. That’s one reason visual depictions are so much more effective at delivering information
to our brains. Data visualization helps bridge that gap and offer information that sticks.
Integrations
While some BI tools restrict their users to proprietary architecture, more and more are offering a
range of integrations with other kinds of software systems and data sources. For example, service-
centered organizations need to be able to draw data directly from their CRM to generate reports and
visualizations on that information. Extract, transform, load (ETL) is also a crucial integration. ETL
combines three database functions into a single tool in order to transfer data from one database to
another.

Business Communication Skills UNIT 1
No ratings yet
Business Communication Skills UNIT 1
23 pages
Thesis Proposal Conceptual Framework
100% (2)
Thesis Proposal Conceptual Framework
8 pages
Unit 4 Web Programming
No ratings yet
Unit 4 Web Programming
184 pages
Web Design Proposal
100% (1)
Web Design Proposal
15 pages
DL Texturing OBA DTY EFK en
100% (1)
DL Texturing OBA DTY EFK en
28 pages
Far From The Tree - Andrew Solomon - Free Download, Borrow, and Streaming - Internet Archive
No ratings yet
Far From The Tree - Andrew Solomon - Free Download, Borrow, and Streaming - Internet Archive
3 pages
Step-By-Step Example For Practical PCB Design - Power Supply Design Tutorial Section 3-3 - Power Electronics
No ratings yet
Step-By-Step Example For Practical PCB Design - Power Supply Design Tutorial Section 3-3 - Power Electronics
28 pages
Ahb7016t LM
No ratings yet
Ahb7016t LM
3 pages
Ce6306 Strength of Materials Ii/Iii Mechanical Engineering
No ratings yet
Ce6306 Strength of Materials Ii/Iii Mechanical Engineering
29 pages
Computer SSC CGL 2022 Tier II Paper I - RBE - Compressed
No ratings yet
Computer SSC CGL 2022 Tier II Paper I - RBE - Compressed
17 pages
JS-A.L.U01 (Types and Coercion)
No ratings yet
JS-A.L.U01 (Types and Coercion)
92 pages
CSC118 - Fundamentals of Algorithm Development
0% (1)
CSC118 - Fundamentals of Algorithm Development
3 pages
Memory Management Exercises Answers New
No ratings yet
Memory Management Exercises Answers New
7 pages
Schneider - 45RIEC PDF
No ratings yet
Schneider - 45RIEC PDF
28 pages
UNIT - 1 - Datawarehouse & Data Mining
100% (1)
UNIT - 1 - Datawarehouse & Data Mining
24 pages
Solidworks Tutorial
No ratings yet
Solidworks Tutorial
14 pages
Unit 1 DWDM Complete
No ratings yet
Unit 1 DWDM Complete
104 pages
Module 1 DMDW
No ratings yet
Module 1 DMDW
64 pages
Trends in Data Warehousing and Business Intelligence
No ratings yet
Trends in Data Warehousing and Business Intelligence
44 pages
DMW Unit 1
No ratings yet
DMW Unit 1
56 pages
Unit-I DW - Architecture
100% (1)
Unit-I DW - Architecture
96 pages
Gaurav Resume
No ratings yet
Gaurav Resume
1 page
XII Sci Practical SLips
No ratings yet
XII Sci Practical SLips
2 pages
Ais 102 Final Module 1 5
No ratings yet
Ais 102 Final Module 1 5
8 pages
Business Intelligence - Data Warehouse Implementation
100% (1)
Business Intelligence - Data Warehouse Implementation
157 pages
Unit 6 Data Warehousing
No ratings yet
Unit 6 Data Warehousing
40 pages
Lec09-Data Warehousing
No ratings yet
Lec09-Data Warehousing
32 pages
BHEL Unit Implements ERP Package
No ratings yet
BHEL Unit Implements ERP Package
9 pages
$RRWYO9T
No ratings yet
$RRWYO9T
71 pages
Gov Uscourts FLSD 521536 237 7
No ratings yet
Gov Uscourts FLSD 521536 237 7
5 pages
DW Unit 1
No ratings yet
DW Unit 1
29 pages
Business Requirements Document Template Eeee
No ratings yet
Business Requirements Document Template Eeee
11 pages
Unit 1 - CS-703
No ratings yet
Unit 1 - CS-703
16 pages
DBMS II Seven 7
No ratings yet
DBMS II Seven 7
13 pages
Datawarehouse Unit-2
No ratings yet
Datawarehouse Unit-2
59 pages
Data Warehousing and Its Role in BI
No ratings yet
Data Warehousing and Its Role in BI
10 pages
Data Warehousing PArt B
No ratings yet
Data Warehousing PArt B
7 pages
Integral Control - Odp
No ratings yet
Integral Control - Odp
16 pages
1 & 2 Data Warehousing - 021052
No ratings yet
1 & 2 Data Warehousing - 021052
80 pages
Data Warehouse Unit 1
No ratings yet
Data Warehouse Unit 1
7 pages
OOPS Project Proposal-3
No ratings yet
OOPS Project Proposal-3
3 pages
Chap 2 - Data Warehousing Part I
No ratings yet
Chap 2 - Data Warehousing Part I
31 pages
Unit 2 Data Warehousing and OLAP
No ratings yet
Unit 2 Data Warehousing and OLAP
72 pages
Unit 1 Notes - DW
No ratings yet
Unit 1 Notes - DW
25 pages
Oracle Inventory Setups
No ratings yet
Oracle Inventory Setups
3 pages
2024 Meeting 1 - Data Warehouse Fundamentals
No ratings yet
2024 Meeting 1 - Data Warehouse Fundamentals
47 pages
PDP Erik Conrath
No ratings yet
PDP Erik Conrath
8 pages
Introduction To Data Warehouse Edited
No ratings yet
Introduction To Data Warehouse Edited
34 pages
02 DataWarehousing and OLAP
No ratings yet
02 DataWarehousing and OLAP
66 pages
Data Warehouse
No ratings yet
Data Warehouse
143 pages
Unit-1.1 Data Warehouse
No ratings yet
Unit-1.1 Data Warehouse
29 pages
WWW Kratikal Com Blog How Is Vulnerability Management Different From Vulnerability Assessment
No ratings yet
WWW Kratikal Com Blog How Is Vulnerability Management Different From Vulnerability Assessment
7 pages
BA Unit2 Own
No ratings yet
BA Unit2 Own
10 pages
Data War Eh Puse
No ratings yet
Data War Eh Puse
51 pages
Lect 5 Data Warehousing I - 240924 - 033406
No ratings yet
Lect 5 Data Warehousing I - 240924 - 033406
38 pages
UNITyssu 1 LT
No ratings yet
UNITyssu 1 LT
12 pages
Allen Bradley File List
No ratings yet
Allen Bradley File List
2 pages
DWDM202
No ratings yet
DWDM202
6 pages
Data Warehousing Concepts
No ratings yet
Data Warehousing Concepts
9 pages
Module 3 - Datawarehousing
No ratings yet
Module 3 - Datawarehousing
45 pages
Building Blocks & Trends in Data Warehouse
No ratings yet
Building Blocks & Trends in Data Warehouse
45 pages
What Is A Data Warehouse?
No ratings yet
What Is A Data Warehouse?
48 pages
Madhav Institute of Technology & Science, Gwalior
No ratings yet
Madhav Institute of Technology & Science, Gwalior
2 pages
2024 Meeting 1 - Data Warehouse Fundamentals
No ratings yet
2024 Meeting 1 - Data Warehouse Fundamentals
47 pages
Ma 242 Midterm 1 W Solutions 2006 Linear Algebra Nikola Popovic PDF
No ratings yet
Ma 242 Midterm 1 W Solutions 2006 Linear Algebra Nikola Popovic PDF
5 pages
What Is A Data Warehouse?
No ratings yet
What Is A Data Warehouse?
48 pages
Introduction To Data Warehouse
No ratings yet
Introduction To Data Warehouse
34 pages
DWDM
No ratings yet
DWDM
15 pages
SNP Log
No ratings yet
SNP Log
3 pages
A Complete Notes
No ratings yet
A Complete Notes
10 pages
R16 4-2 DataMining Notes UNIT-I
No ratings yet
R16 4-2 DataMining Notes UNIT-I
31 pages
Urban Planning and GIS
No ratings yet
Urban Planning and GIS
2 pages
DW Module-1
No ratings yet
DW Module-1
4 pages
DWDM Notes - Final
No ratings yet
DWDM Notes - Final
46 pages
Data Warehouse
No ratings yet
Data Warehouse
5 pages
DWM Reviewer
No ratings yet
DWM Reviewer
18 pages
Data Warehouse - Final
No ratings yet
Data Warehouse - Final
28 pages
Data Warehouse Components
No ratings yet
Data Warehouse Components
26 pages
Overview of Data Warehousing and OLAP
No ratings yet
Overview of Data Warehousing and OLAP
12 pages
Database Management System
From Everand
Database Management System
Manish Soni
No ratings yet
THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE: "THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE"
From Everand
THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE: "THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE"
AJIT DASH
2/5 (2)
DM Part 2
No ratings yet
DM Part 2
24 pages
Warehousing
No ratings yet
Warehousing
15 pages
Data Warehouse: Tobiasgroup, Inc
No ratings yet
Data Warehouse: Tobiasgroup, Inc
18 pages
Data Warehousing Concepts
No ratings yet
Data Warehousing Concepts
9 pages
Data Ware House
No ratings yet
Data Ware House
6 pages
Data Warehousing and On-Line Analytical Processing
No ratings yet
Data Warehousing and On-Line Analytical Processing
40 pages
Data Warehouse Final Report
No ratings yet
Data Warehouse Final Report
19 pages
Practical Data Strategies and Recipes
From Everand
Practical Data Strategies and Recipes
Tom Henricksen
No ratings yet
Learn Data Warehousing in 24 Hours
From Everand
Learn Data Warehousing in 24 Hours
Alex Nordeen
No ratings yet

Medical

Uploaded by

Medical

Uploaded by

Excutive summary

Future state of warehouse

The future state of data warehouse will be

You might also like