Critical Data Warehouse Trends

The document discusses several trends in data warehousing including self-service analytics, real-time data processing, machine learning, data virtualization, metadata management, columnar storage, in-memory processing, and multiplatform/multi-cloud capabilities. It provides examples and definitions for each trend.

Uploaded by

Alexander Quemada

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views30 pages

Critical Data Warehouse Trends

Uploaded by

Alexander Quemada

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 30

Critical Data

Warehouse Trends
Self-Service Analytics:
This is the ability for the user to access and utilize available resources
(e.g. storage, compute, memory) so that they can acquire, profile,
wrangle and analyze data (structured or unstructured) for some
analytical purpose on their own.
Self-service analytics is a common buzzword for many organizations
which desire to be more data driven and less dependent on IT for their
data needs. Organizations are increasingly implementing self-service
capabilities to enable and promote a data-driven culture within their
organizations.
Real-time data is the process of analyzing
data to create insights in real time. When raw
data is received, it is immediately processed
to empower near-instant decision-making.
Instead of being stored, it is made available to
promote insights as quickly as possible,
furthering organizations’ profitability,
efficiency, and business outcomes.
Some real-world applications of real-time processing
are found in banking systems, data streaming, customer
service structures, and weather radars. Without
real-time processing, these industries would not be
possible or would deeply lack accuracy.

For example, weather radar is heavily reliant on the

real-time insights provided by this system of data
processing. Due to the sheer volume of data that is
being collected by supercomputers to study weather
interactions and predictions, real-time processing is
absolutely critical to successful interpretation.
The integration of machine learning into artificial intelligence
has revolutionized how businesses function. It has helped
improve business processes' accuracy and efficiency by
providing valuable insights through data analysis.

Machine learning is a subset of artificial intelligence that

focuses on developing algorithms and models that enable
computers to learn and improve from experience without
being explicitly programmed. It allows machines to
understand and analyze patterns and make predictions and
decisions based on those patterns. This capability has been a
game-changer for businesses across various industries.
Real life examples of Machine Learning (MI):

1. Facial recognition
Facial recognition is one of the more obvious
applications of machine learning. People previously
received name suggestions for their mobile photos and
Facebook tagging, but now someone is immediately
tagged and verified by comparing and analyzing
patterns through facial contours.
Real life examples of Machine Learning (MI):

2. Product recommendations
Do you wonder how Amazon or other retailers
frequently know what you might like to purchase? Or,
have they gotten it wildly wrong and you wonder how
they came up with the recommendation? Thank
machine learning. Targeted marketing with retail uses
machine learning to group customers based on buying
habits or demographic similarities, and by extrapolating
what one person may want from someone else’s
purchases.
Real life examples of Machine Learning (MI):
3. Email automation and spam filtering
While your inbox seems relatively boring, machine learning
influences its function behind the scenes. Email automation is
a direct result of successful machine learning, and one
function that goes most unnoticed is spam filtering.
Successful spam filtering adapts and finds patterns in email
content that is undesirable. This includes data from email
domains, a sender’s physical location message text and
structure, and IP addresses. It also requires help from users
as they mark emails when they’re mistakenly filed. With each
marked email, a new data reference is added that helps with
future accuracy.
Data virtualization is an umbrella term used to describe an approach to
data management that allows an application to retrieve and manipulate
data without requiring technical details about the data. This can include
how the data is formatted or where it is physically located. The goal of
data virtualization is to create a single representation of data from
multiple, disparate sources without having to copy or move the data.

Data virtualization software aggregates structured and unstructured

data sources for virtual viewing through a dashboard or visualization
tool.
Metadata is simply data about data. It means it is a description and context of
the data. It helps to organize, find and understand data.

Metadata management refers to the organization and control of data

which describes technical, business, or operational aspects of other
data. It involves a range of processes, policies, and technologies which
describe and give meaning to your data via searchable key attributes
such as order number or customer ID.
Enterprise metadata management helps find the data needed and to
trust that that data is accurate. The company likely has a large volume
of complex data coming from many sources. And you need to be able
to find, understand and trust the right information to gain actionable
insights that improve your business.
Ultimately, managed metadata makes it easier for all types of users to
find, understand, and access the specific information assets they need.
Columnar storage (also known as column-oriented or c-store) is a
data storage technique that organizes and stores data by columns. It
is used for data warehousing and big data analytics, where fast
query performance and efficient data compression are essential.

In a columnar database, each column of a table is stored separately,

with all values from that column grouped together. This means that
individual data elements of a particular attribute, such as “Name” or
“Age,” are stored together.

This is in contrast to traditional row-oriented databases, where each

row is stored contiguously, including all attributes of that row.
In-Memory Processing
In-memory processing is the practice of taking action on data
entirely in computer memory (e.g., in RAM). This is in contrast to
other techniques of processing data which rely on reading and
writing data to and from slower media such as disk drives.
In-memory processing typically implies large-scale environments
where multiple computers are pooled together so their collective
RAM can be used as a large and fast storage medium. Since the
storage appears as one big, single allocation of RAM, large data sets
can be processed all at once, versus processing data sets that only
fit into the RAM of a single computer.
Multiplatform typically means capable of running on two or more
different hardware platforms. For example, versions of software
available for the Windows and Mac desktop environments are
multiplatform as is software that is available for iOS and Android
mobile devices. An interpreter is very often multiplatform.
Although the source code may be the same, the interpreter
runtime engines are available for two or more hardware
platforms.

Multi-cloud is the utilization of two or more public cloud

providers to serve an organization’s IT services and
infrastructure. There is no single multi-cloud vendor.

Usually the reason for the multi-cloud model is that a single

vendor is not able to perfectly meet all needs of an enterprise.
With several cloud providers, a company can also avoid data
THANK
YOU

IT Infrastructure Management
No ratings yet
IT Infrastructure Management
8 pages
Zak, Cameron - Data Mining Concepts and Techniques - Complete Guide To A Comprehensive Understanding of Data Mining (2020) - Libgen - Li
No ratings yet
Zak, Cameron - Data Mining Concepts and Techniques - Complete Guide To A Comprehensive Understanding of Data Mining (2020) - Libgen - Li
372 pages
Big Data 4 Manuscripts - Data Analytics For Beginners, Deep Learning With Keras, Analyzing Data With Power BI, Convolutional... (Williams, Anthony) (Z-Library)
No ratings yet
Big Data 4 Manuscripts - Data Analytics For Beginners, Deep Learning With Keras, Analyzing Data With Power BI, Convolutional... (Williams, Anthony) (Z-Library)
218 pages
ISM MODULE 1 Introduction To Information Storage
No ratings yet
ISM MODULE 1 Introduction To Information Storage
46 pages
Unit 1 - Introduction To Data Mining and Data Warehousing
No ratings yet
Unit 1 - Introduction To Data Mining and Data Warehousing
84 pages
1 DM Intro1
No ratings yet
1 DM Intro1
34 pages
Laudon Mis16 PPT Ch06
No ratings yet
Laudon Mis16 PPT Ch06
42 pages
1 DM Intro
No ratings yet
1 DM Intro
34 pages
Information Storage and Management-V.5
No ratings yet
Information Storage and Management-V.5
211 pages
Data Mining and Warehousing - L1 & L2
No ratings yet
Data Mining and Warehousing - L1 & L2
30 pages
Bat 334 Database Management Systems 4
No ratings yet
Bat 334 Database Management Systems 4
23 pages
Cambridge IGCSE: Computer Science 0478/12
No ratings yet
Cambridge IGCSE: Computer Science 0478/12
12 pages
02 Data Science
No ratings yet
02 Data Science
23 pages
Data Processing and Management Information System (AvtoBərpaEdilmiş)
100% (1)
Data Processing and Management Information System (AvtoBərpaEdilmiş)
6 pages
Chapter 5
No ratings yet
Chapter 5
41 pages
WA0002
No ratings yet
WA0002
22 pages
Business Data Management Week 1 - Read-Only
No ratings yet
Business Data Management Week 1 - Read-Only
49 pages
Chapter-4 3
No ratings yet
Chapter-4 3
25 pages
Business Intelligence Notes
No ratings yet
Business Intelligence Notes
27 pages
M Is Business Intelligence Big Data A Nay Tics
No ratings yet
M Is Business Intelligence Big Data A Nay Tics
7 pages
CH 16 Data and Competitive Advantage
No ratings yet
CH 16 Data and Competitive Advantage
48 pages
Csit 217 M2
No ratings yet
Csit 217 M2
50 pages
Multidimensional Data Mode:-: Characteristics of Data Warehouse
100% (1)
Multidimensional Data Mode:-: Characteristics of Data Warehouse
26 pages
Data Warehousing
No ratings yet
Data Warehousing
23 pages
Comp 20333 Reviewer (Highlighted)
No ratings yet
Comp 20333 Reviewer (Highlighted)
21 pages
Concept of Big Data
No ratings yet
Concept of Big Data
29 pages
CSD 1043: Big Data Fundamentals Week1: Big Data Landscape: Definitions
No ratings yet
CSD 1043: Big Data Fundamentals Week1: Big Data Landscape: Definitions
13 pages
TIS Chapter 3
No ratings yet
TIS Chapter 3
36 pages
Data Warehouse & Data Mining
No ratings yet
Data Warehouse & Data Mining
41 pages
DBMS, Data Warehousing and Data Mining
No ratings yet
DBMS, Data Warehousing and Data Mining
31 pages
By Bi Jay Mishra
No ratings yet
By Bi Jay Mishra
685 pages
Chapter 5 Data Resource Management
100% (1)
Chapter 5 Data Resource Management
6 pages
Ccs367-Storage Technologies-Unit - I
No ratings yet
Ccs367-Storage Technologies-Unit - I
53 pages
Week 5 Database
No ratings yet
Week 5 Database
34 pages
Data Analytics-Moutran Diane
No ratings yet
Data Analytics-Moutran Diane
3 pages
Final Mis2
No ratings yet
Final Mis2
23 pages
Pathfinder Glossary Web Final
No ratings yet
Pathfinder Glossary Web Final
4 pages
Form 1 Term 1 ICT Quiz
No ratings yet
Form 1 Term 1 ICT Quiz
10 pages
Emerging Concepts & Trends in Business Analytics
No ratings yet
Emerging Concepts & Trends in Business Analytics
15 pages
Week 8 - Graded + Practice Assignment
No ratings yet
Week 8 - Graded + Practice Assignment
12 pages
Information System Assignment
No ratings yet
Information System Assignment
51 pages
Overview of Future Skills - Unit 4
No ratings yet
Overview of Future Skills - Unit 4
1 page
Chapter 5 Data Resource Management
No ratings yet
Chapter 5 Data Resource Management
39 pages
It Reviewer
No ratings yet
It Reviewer
6 pages
Basic MCQs of Computer Sciences
67% (3)
Basic MCQs of Computer Sciences
19 pages
Storage and Information Management (8 It 3)
100% (2)
Storage and Information Management (8 It 3)
11 pages
Session 9
No ratings yet
Session 9
12 pages
Data Glossary - Michael Dillon
No ratings yet
Data Glossary - Michael Dillon
11 pages
Data Mining v3
No ratings yet
Data Mining v3
54 pages
30 Artificial Intelligence Terms You Need To Know
No ratings yet
30 Artificial Intelligence Terms You Need To Know
5 pages
DW Glossery
No ratings yet
DW Glossery
13 pages
Ba Notes
No ratings yet
Ba Notes
7 pages
Summary Bisdig
No ratings yet
Summary Bisdig
36 pages
VSAM
100% (1)
VSAM
18 pages
Eserver I5 and Db2: Business Intelligence Concepts
No ratings yet
Eserver I5 and Db2: Business Intelligence Concepts
12 pages
Enterprise Applications: A Conceptual Look at ERP, CRM, and SCM
No ratings yet
Enterprise Applications: A Conceptual Look at ERP, CRM, and SCM
13 pages
Foundation of Information Management Systems-Chapter2
No ratings yet
Foundation of Information Management Systems-Chapter2
19 pages
MC0076 Q. What Do You Understand by Information Processes Data?
No ratings yet
MC0076 Q. What Do You Understand by Information Processes Data?
10 pages
Unit 1
No ratings yet
Unit 1
61 pages
Data Mining and Data Warehouse: Qis College of Engineering & Technology Ongole
No ratings yet
Data Mining and Data Warehouse: Qis College of Engineering & Technology Ongole
10 pages
Fda 1
No ratings yet
Fda 1
5 pages
This Set of Computer Fundamentals Multiple Choice Questions & Answers (MCQS) Focuses On "The Input Unit"
No ratings yet
This Set of Computer Fundamentals Multiple Choice Questions & Answers (MCQS) Focuses On "The Input Unit"
44 pages
Upgrading The Controller Hardware On A Pair of
No ratings yet
Upgrading The Controller Hardware On A Pair of
33 pages
Oracle Testking 1z0-070 v2019-03-20 by Rodrigo 56q
No ratings yet
Oracle Testking 1z0-070 v2019-03-20 by Rodrigo 56q
43 pages
HPE MSA 2060 Storage Data Sheet-PSN1012748869IEEN
No ratings yet
HPE MSA 2060 Storage Data Sheet-PSN1012748869IEEN
3 pages
How Evolution of Database Led To Data Mining
No ratings yet
How Evolution of Database Led To Data Mining
10 pages
End of Term 1 Test g8 2024
No ratings yet
End of Term 1 Test g8 2024
3 pages
I Cste 10
No ratings yet
I Cste 10
9 pages
Course Book Computer Architecture
No ratings yet
Course Book Computer Architecture
8 pages
Samsung NVMe SSD 980 PRO Data Sheet - Rev.2.1
No ratings yet
Samsung NVMe SSD 980 PRO Data Sheet - Rev.2.1
5 pages
Fundamentals of Telecommunications Engineering (TC-101) PDF
No ratings yet
Fundamentals of Telecommunications Engineering (TC-101) PDF
72 pages
Assignment 1
No ratings yet
Assignment 1
4 pages
R Man
No ratings yet
R Man
0 pages
Physical Security Perimeters 1743253021
No ratings yet
Physical Security Perimeters 1743253021
29 pages
HPE StoreEasy 1000 Storage With The HPE StoreEasy Management Console-A00041171enw
No ratings yet
HPE StoreEasy 1000 Storage With The HPE StoreEasy Management Console-A00041171enw
46 pages
SONNET 116 Lesson
No ratings yet
SONNET 116 Lesson
3 pages
Data Mining and Data Warehouse
No ratings yet
Data Mining and Data Warehouse
11 pages
E-45A, E-85A Color Controller Parts List
No ratings yet
E-45A, E-85A Color Controller Parts List
11 pages
Get On Track: Motorola Charm
No ratings yet
Get On Track: Motorola Charm
60 pages
Question No 1:: A) Describe Von Neuman Architecture?
No ratings yet
Question No 1:: A) Describe Von Neuman Architecture?
4 pages
CS401 Notes
No ratings yet
CS401 Notes
14 pages
04chapter Information Technology in Business - Hardware Effy Oz
No ratings yet
04chapter Information Technology in Business - Hardware Effy Oz
36 pages
Lesson Plan in Budgeting
No ratings yet
Lesson Plan in Budgeting
5 pages
Part Two: Supply and Demand I: How Markets Work
No ratings yet
Part Two: Supply and Demand I: How Markets Work
42 pages
RAIDXpert2 UserGuide Enu
No ratings yet
RAIDXpert2 UserGuide Enu
140 pages
Set A - Wisdom Long Quiz
No ratings yet
Set A - Wisdom Long Quiz
22 pages
mrktg01 Week2
No ratings yet
mrktg01 Week2
15 pages
AWS Essentials
No ratings yet
AWS Essentials
15 pages
Long Quiz-2
No ratings yet
Long Quiz-2
2 pages
Cashflow Statement
No ratings yet
Cashflow Statement
8 pages
Computer System Servicing
No ratings yet
Computer System Servicing
34 pages
Amor
No ratings yet
Amor
11 pages
Introduction To Computers
No ratings yet
Introduction To Computers
38 pages
FINAL Is PAPER DaguroMarcosQuemada
No ratings yet
FINAL Is PAPER DaguroMarcosQuemada
18 pages
B U S E C O 1: Theory of Individual Behavior
No ratings yet
B U S E C O 1: Theory of Individual Behavior
29 pages
Introduction To Computers Fundamentals
No ratings yet
Introduction To Computers Fundamentals
25 pages
UBPD
No ratings yet
UBPD
4 pages
12222
No ratings yet
12222
9 pages
COSMAN1 Learning Packet 1 Introductionto Cost Accounting
No ratings yet
COSMAN1 Learning Packet 1 Introductionto Cost Accounting
2 pages
ETHICS A - SYNTHESIS Integration
No ratings yet
ETHICS A - SYNTHESIS Integration
7 pages
Division (GR No. 126334, Nov 23, 2001) Emilio Emnace V. Ca
No ratings yet
Division (GR No. 126334, Nov 23, 2001) Emilio Emnace V. Ca
3 pages
Lesson Plan Budgeting 2 2
No ratings yet
Lesson Plan Budgeting 2 2
2 pages
GN4 - Backup and Disk Management - 6213016 - 01
No ratings yet
GN4 - Backup and Disk Management - 6213016 - 01
5 pages
Franchising Activity Worksheet 3
No ratings yet
Franchising Activity Worksheet 3
4 pages
Paired
No ratings yet
Paired
5 pages
ETHICS Post Module Activity#4
No ratings yet
ETHICS Post Module Activity#4
4 pages
DS2246 Disk Shelf Installation and Setup
No ratings yet
DS2246 Disk Shelf Installation and Setup
2 pages
Litmin Presentation
No ratings yet
Litmin Presentation
2 pages
BUSINESS TAX..docx - PROBLEM SET Problem 1 VAT-Exempt Transactions Determine Which Transaction(s) Is (Are) Exempt From VAT. 1. S
No ratings yet
BUSINESS TAX..docx - PROBLEM SET Problem 1 VAT-Exempt Transactions Determine Which Transaction(s) Is (Are) Exempt From VAT. 1. S
1 page

Critical Data Warehouse Trends

Uploaded by

Critical Data Warehouse Trends

Uploaded by

Critical Data

For example, weather radar is heavily reliant on the

Machine learning is a subset of artificial intelligence that

Data virtualization software aggregates structured and unstructured

Metadata management refers to the organization and control of data

In a columnar database, each column of a table is stored separately,

This is in contrast to traditional row-oriented databases, where each

Multi-cloud is the utilization of two or more public cloud

Usually the reason for the multi-cloud model is that a single

You might also like