0% found this document useful (0 votes)

20 views10 pages

Data Warehousing and Data Mining

Uploaded by

Julfikar asif

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views10 pages

Data Warehousing and Data Mining

Uploaded by

Julfikar asif

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Data Warehousing

1 Characteristics a data warehouse

A data warehouse is an environment where a user can find strategic information. However, it
put data such a way that a user can find strategic information to make better decisions. It is
not a product containing a single software or hardware to provide strategic information. The
basic characterise of a data warehouse are:
 Fully user driven not event driven
 Data store for analysis and decision support
 Capable to provide solutions for very complex and unpredictable query
 Flexible, responsive, and answer in an interactive way (for example, Subway
management want to know “which sandwich is most profitable?” Then the next
question may be “which store sells most profitable sandwich?” Then the next question
may be” which classes of customer are having most profitable sandwiches?” a data
warehouse environment can answer the above queries interactively.)
2 Defining a data warehouse
Bill Inmon, father of data warehouse, provides four characteristics of data warehouse: subject
oriented, integrated, time variant, and non-volatile. A data warehouse is build to support
management’s decisions.
2.1 Subject-oriented Data
In operational system, data is application processing oriented. For example, order processing
application need data for entering order, checking availability, authenticate the customer’s
payment, and arranging shipment the order to the customer.
On the other hand, a data warehouse stores data by subject. Data store by business subjects
which may vary organizations to organizations. For example, a manufacturing company’s
critical business subjects are sales, inventory, and delivery of a shipment. Subway critical
business subject is sales at the point of sales terminal (POST).
2.2 Integrated data
In a data warehouse, data comes from various operational systems. Venda Ltd has two
subways. Both stores have different operational systems. One store is using SUBSHOP 08
and another one is using SUBSHOP 05. So the inconsistencies should be removed from both
operational systems before data stored in the data warehouse.
2.3 Time-Variant Data
In an operational system, data contains current values. For example, an account receives
system store information about the outstanding balance in the customer balance. On the other

Page 1 of 10
hand, data warehouse store historical data for analysis and decision making. For example, if
Subway management wants to discover the hidden patterns of a customer spending, they need
data from both current purchase and past purchase. The time variant of data will help Subway
management to analyse past data by relating current information to forecasts the future.
2.4 Non-volatile data
Data from operational systems are transformed, integrated, and store in a data warehouse for
analysis purpose not to run day to day business operations. For example, when an order
comes from a customer, a data warehouse will be used to know the current status of the stock.
The operational system will be used to check the status of the product. Usually data in a data
warehouse is not update. Data read form a data warehouse for query and analysis.

3 Dimensional Modelling
Dimensional Modelling is a logical design technique deriving its name from business
dimensions. This model is suitable for queries and analysis. The characteristics of the
dimensional modelling are:
 It provide the way to access the data
 It is a query centric data model.
 It shows the interactions among dimensions and fact tables.
 It is flexible for drilling down, rolling up along dimension hierarchy
One of the major problems in the OLAP is that usage of information is totally unpredictable.
Users cannot define their requirements clearly. Even they do not know how they would like to
use the information or process it On the other hand, in OLTP precise functions are specified
by end-users.

3.1 Star Schema

Star Schema also known as dimensional model. It forms ‘star-like’ structure, which is called a
star schema or star join. Every dimensional model (DM) is composed of one (or more) fact
tables, and a number of dimension tables. Fact table contains numeric data value and
dimension table contain description of the fact table.
For example, what is the sale amount in Consumer Product category, for young customers in
April 2008? Here sales amount is in the fact table and dimension tables are customer and
time. Basically fact tables are narrow, big (many rows), numeric, growing over time. On the
other hand, dimension tables are wide, small (few rows), descriptive, static.

Page 2 of 10
Dimension 1

Dimension 3 Fact Table Dimension 2

Dimension 4

Figure: Star Schema

3.2 Snowflake Schema

In order to keep the data warehouse simple and easy to understand, dimension tables are not
in normal form. They contain huge redundant information about hierarchies. Normalizing
dimension tables leads to snowflake schema. In reality, snow flaking not recommended in
most cases. Because more tables introduce more complex design, more joins and make the
queries slower.

Page 3 of 10
Figure 2.4 snowflake schema
3.3 Star Flake Schema
The star flake schema is a hybrid schema derived from star and snowflake schema. The star
flake schema contains a fact table and a set of demoralised and normalised dimension tables.

Page 4 of 10
Store
Supplier
PK Store_ID
PK Supplier_ID
Staff
Store_Name
PK Staff_ID Location Supplier_Name
Phone_Number ADDRESS
First_Name Address PHONE
Last_Name Fax_Number email
Address Website
Post_Code Fax

Sales Customer

PK,FK1 Product_ID PK Customer_ID

PK,FK6 Customer_ID
Promotion PK,FK3 Supplier_ID First_name
PK,FK2 Promotion_ID Last_Name
PK Promotion_ID PK,FK4 Store_ID Post_Code
PK,FK5 Time_Key State
Promotion_Nmae PK,FK7 Staff_ID
Price
Ad_Type Price
Time
Category
Product PK Time_Key
PK Category_ID
PK Product_ID Day_Number
Category Day_Of_The_Week
Product_name
Buying_Price
Brand description
FK1 Brand_ID
PK Brand_ID

brand
FK1 Category_ID

Figure: star flake schema

3.4 OLTP vs. OLAP
OLTP OLAP
Source of data Data comes from day to day Data comes from various
transaction operational systems
Data content Data contains current values Data is historical, derived,
summarized
Purpose of data To run day to day operation To make decisions
Access Type read, write, delete, update, add Data is used for query purpose
Usage Users can define almost all the Unpredictable
requirements at the early stage
Response time Very fast (Sub-second) Depends on the complexity and
data involve in the query (Several

Page 5 of 10
second to minutes).
Users Depends on the system The number of users is less than
(thousands). OPTP (hundreds).
Database Design Entity Relationship Modeling Data is demoralized. Dimensional
is used to develop the system. modeling is used for the OLAP
Data is highly normalized design.
Space Requirement OLTP takes less space than OLAP takes much more space
OLAP because is archived than OLTP because this the place
regularly ( 100 MB to GB) where data is archived from
operational system (100 GB to
TB)
Queries Queries are relatively simple Queries are complex, often needs
and need retrieval of few aggregations, multi dimensional
records views of data
Access Frequency Access frequency is very high Access frequency is low as it is
as it is used for day to day used for query.
operation
What the data reveals OLTP provides information OLAP provides multidimensional
about business processes views of business activities
Table: OLTP vs. OLAP

Data Mining
The term data mining refers analyzing large databases to find useful patterns. Like
knowledge discovery in artificial intelligence (also called machine learning), or statistical
analysis, data mining attempts to discover rules and patterns from data. However, data mining
differs from machine learning and statistics in that it deals with large volumes of data, stored
primarily on disk. That is, data mining deals with “knowledge discovery in databases.”
Some types of knowledge discovered from a database can be represented by a set of rules.
The following is an example of a rule, stated informally: “Young women with annual incomes
greater than $50,000 are the most likely people to buy small sports cars.” Of course such rules
are not universally true, and have degrees of “support” and “confidence,” as we shall see.
Other types of knowledge are represented by equations relating different variables to each
other, or by other mechanisms for predicting outcomes when the values of some variables are
known.
3.5 Applications of Data Mining
The discovered knowledge has numerous applications. The most widely used applications are
those that require some sort of prediction. For instance, when a person applies for a credit
card, the credit-card company wants to predict if the person is a good credit risk. The

Page 6 of 10
prediction is to be based on known attributes of the person, such as age, income, debts, and
past debt repayment history. Rules for making the prediction are derived from the same
attributes of past and current credit card holders, along with their observed behavior, such as
whether they defaulted on their credit card dues. Other types of prediction include predicting
which customers may switch over to a competitor (these customers may be offered special
discounts to tempt them not to switch), predicting which people are likely to respond to
promotional mail (“junk mail”), or predicting what types of phone calling card usage are
likely to be fraudulent.

Another class of applications looks for associations, for instance, books that tend to be
bought together. If a customer buys a book, an online bookstore may suggest other associated
books. If a person buys a camera, the system may suggest accessories that tend to be bought
along with cameras. A good salesperson is aware of such patterns and exploits them to make
additional sales. The challenge is to automate the process. Other types of associations may
lead to discovery of causation. For instance, discovery of unexpected associations between a
newly introduced medicine and cardiac problems led to the finding that the medicine may
cause cardiac problems in some people. The medicine was then withdrawn from the market.

Associations are an example of descriptive patterns. Clusters are another example of such
patterns. For example, over a century ago a cluster of typhoid cases was found around a well,
which led to the discovery that the water in the well was contaminated and was spreading
typhoid. Detection of clusters of disease remains important even today.

3.6 Classification
Prediction is one of the most important types of data mining. We outline what is
classification, study techniques for building one type of classifiers, called decision tree
classifiers, and then study other prediction techniques.
Abstractly, the classification problem is this: Given that items belong to one of several
classes, and given past instances (called training instances) of items along with the classes to
which they belong, the problem is to predict the class to which a new item belongs. The class
of the new instance is not known, so other attributes of the instance must be used to predict
the class.
Classification can be done by finding rules that partition the given data into disjoint groups.
For instance, suppose that a credit-card company wants to decide whether or not to give a
credit card to an applicant. The company has a variety of information about the person, such
as her age, educational background, annual income, and current debts that it can use for
making a decision.
Some of this information could be relevant to the credit worthiness of the applicant, whereas
some may not be. To make the decision, the company assigns a creditworthiness level of
excellent, good, average, or bad to each of a sample set of current customers according to
each customer’s payment history. Then, the company attempts to find rules that classify its
current customers into excellent, good, average, or bad, on the basis of the information about
the person, other than the actual payment history (which is unavailable for new customers).
Let us consider just two attributes: education level (highest degree earned) and income. The
rules may be of
the following form:
∀ person P, P.degree = masters and P.income > 75, 000
⇒ P.credit = excellent
∀ person P, P.degree = bachelors or
(P.income ≥ 25, 000 and P.income ≤ 75, 000) ⇒ P.credit = good

Page 7 of 10
Similar rules would also be present for the other credit worthiness levels (average and bad).
The process of building a classifier starts from a sample of data, called a training set. For
each tuple in the training set, the class to which the tuple belongs is already known. For
instance, the training set for a credit-card application may be the existing customers, with
their credit worthiness determined from their payment history. The actual data, or population,
may consist of all people, including those who are not existing customers.

3.7 Decision Tree Classifiers

The decision tree classifier is a widely used technique for classification. As the name
suggests, decision tree classifiers use a tree; each leaf node has an associated class, and each
internal node has a predicate (or more generally, a function) associated with it. Figure 22.6
shows an example of a decision tree.
To classify a new instance, we start at the root, and traverse the tree to reach a leaf; at an
internal node we evaluate the predicate (or function) on the data instance,

to find which child to go to. The process continues till we reach a leaf node. For example, if
the degree level of a person is masters, and the persons income is 40K, starting from the root
we follow the edge labeled “masters,” and from there the edge labeled “25K to 75K,” to reach
a leaf. The class at the leaf is “good,” so we predict that the credit risk of that person is good.

3.8 Association Rules

Retail shops are often interested in associations between different items that people buy.
Examples of such associations are:
• Someone who buys bread is quite likely also to buy milk
• A person who bought the book Database System Concepts is quite likely also to buy the
book Operating System Concepts.
Association information can be used in several ways. When a customer buys a particular
book, an online shop may suggest associated books. A grocery shop may decide to place
bread close to milk, since they are often bought together, to help shoppers finish their task
faster. Or the shop may place them at opposite ends of a row, and place other associated items
in between to tempt people to buy those items as well, as the shoppers walk from one end of

Page 8 of 10
the row to the other. A shop that offers discounts on one associated item may not offer a
discount on the other, since the customer will probably buy the other anyway.
An example of an association rule is
bread ⇒ milk

In the context of grocery-store purchases, the rule says that customers who buy bread also
tend to buy milk with a high probability. An association rule must have an associated
population: the population consists of a set of instances. In the grocery-store example, the
population may consist of all grocery store purchases; each purchase is an instance. In the
case of a bookstore, the population may consist of all people who made purchases, regardless
of when they made a purchase. Each customer is an instance.
Here, the analyst has decided that when a purchase is made is not significant, whereas for the
grocery-store example, the analyst may have decided to concentrate on single purchases,
ignoring multiple visits by the same customer. Rules have an associated support, as well as an
associated confidence. These are defined in the context of the population:

• Support is a measure of what fraction of the population satisfies both the antecedent and the
consequent of the rule.
For instance, suppose only 0.001 percent of all purchases include milk and screwdrivers. The
support for the rule
milk ⇒ screwdrivers

is low. The rule may not even be statistically significant—perhaps there was only a single
purchase that included both milk and screwdrivers. Businesses are usually not interested in
rules that have low support, since they involve few customers, and are not worth bothering
about.
On the other hand, if 50 percent of all purchases involve milk and bread, then support for
rules involving bread and milk (and no other item) is relatively high, and such rules may be
worth attention. Exactly what minimum degree of support is considered desirable depends on
the application.

• Confidence is a measure of how often the consequent is true when the antecedent is true.
For instance, the rule bread ⇒ milk has a confidence of 80 percent if 80 percent of the
purchases that include bread also include milk. A rule with a low confidence is not
meaningful. In business
applications, rules usually have confidences significantly less than 100 percent, whereas in
other domains, such as in physics, rules may have high confidences.
Note that the confidence of
bread ⇒ milk

may be very different from the confidence of milk ⇒ bread, although both have the same
support.
3.9 Clustering
Intuitively, clustering refers to the problem of finding clusters of points in the given data. The
problem of clustering can be formalized from distance metrics in several ways. One way is to
phrase it as the problem of grouping points into k sets (for a given k) so that the average
distance of points from the centroid of their assigned cluster is minimized.5 Another way is to
group points so that the average distance between every pair of points in each cluster is
minimized. There are other definitions too; see the bibliographical notes for details. But the
intuition behind all these definitions is to group similar points together in a single set.

Page 9 of 10
Another type of clustering appears in classification systems in biology. (Such classification
systems do not attempt to predict classes, rather they attempt to cluster related items
together.) For instance, leopards and humans are clustered under the class mammalia, while
crocodiles and snakes are clustered under reptilia. Both mammalian and reptilia come under
the common class chordata. The clustering of mammalia has further subclusters, such as
carnivora and primates. We thus have hierarchical clustering.

Given characteristics of different species, biologists have created a complex hierarchical

clustering scheme grouping related species together at different levels of the hierarchy.
Hierarchical clustering is also useful in other domains—for clustering documents, for
example. Internet directory systems (such as Yahoo’s) cluster related documents in a
hierarchical fashion (see Section 22.5.5). Hierarchical clustering algorithms can be classified
as agglomerative clustering algorithms, which start by building small clusters and then
creater higher levels, or divisive clustering algorithms, which first create higher levels of the
hierarchical clustering, then refine each resulting cluster into lower level clusters.

The statistics community has studied clustering extensively. Database research has provided
scalable clustering algorithms that can cluster very large data sets (that may not fit in
memory). The Birch clustering algorithm is one such algorithm. Intuitively, data points are
inserted into a multidimensional tree structure (based on R-trees, described in Section
23.3.5.3), and guided to appropriate leaf nodes based on nearness to representative points in
the internal nodes of the tree. Nearby points are thus clustered together in leaf nodes, and
summarized if there are more points than fit in memory. Some post processing after insertion
of all points gives the desired overall clustering. See the bibliographical notes for references
to the Birch algorithm, and other techniques for clustering, including algorithms for
hierarchical clustering.
An interesting application of clustering is to predict what new movies (or books, or music) a
person is likely to be interested in, on the basis of:

1. The person’s past preferences in movies

2. Other people with similar past preferences
3. The preferences of such people for new movies

One approach to this problem is as follows. To find people with similar past preferences we
create clusters of people based on their preferences for movies. The accuracy of clustering can
be improved by previously clustering movies by their similarity, so even if people have not
seen the same movies, if they have seen similar movies they would be clustered together. We
can repeat the clustering, alternately clustering people, then movies, then people, and so on till
we reach an equilibrium. Given a new user, we find a cluster of users most similar to that
user, on the basis of the user’s preferences for movies already seen. We then predict movies in
movie clusters that are popular with that user’s cluster as likely to be interesting to the new
user. In fact, this problem is an instance of collaborative filtering, where users collaborate in
the task of filtering information to find information of interest.

Page 10 of 10

Proposal For IT Services
100% (3)
Proposal For IT Services
2 pages
ETL Testing - PPT
No ratings yet
ETL Testing - PPT
77 pages
Week 9 Data Warehouse Concepts
No ratings yet
Week 9 Data Warehouse Concepts
35 pages
Omni 3D Layout Tutorial: Basic Land: No License? No Problem!
No ratings yet
Omni 3D Layout Tutorial: Basic Land: No License? No Problem!
12 pages
R12 Payments
No ratings yet
R12 Payments
35 pages
Iso 13709
0% (4)
Iso 13709
120 pages
Data Warehousing Interview Q&A
No ratings yet
Data Warehousing Interview Q&A
14 pages
Data Warehousing and Data Mining 3rd Class Second Course: Dr. Khalil I. Ghathwan
No ratings yet
Data Warehousing and Data Mining 3rd Class Second Course: Dr. Khalil I. Ghathwan
32 pages
Data Mining and Warehosuing Lecture 01
No ratings yet
Data Mining and Warehosuing Lecture 01
36 pages
Vicon DIM IV
No ratings yet
Vicon DIM IV
26 pages
DATA WAREHOUSE Basic Concepts
No ratings yet
DATA WAREHOUSE Basic Concepts
26 pages
Data Warehousing Unit 1,2
No ratings yet
Data Warehousing Unit 1,2
9 pages
R20-DMT Unit-I
No ratings yet
R20-DMT Unit-I
24 pages
DWM Unit 1
No ratings yet
DWM Unit 1
67 pages
The Key in Business Is To Know Something That Nobody Else Knows.
No ratings yet
The Key in Business Is To Know Something That Nobody Else Knows.
43 pages
2.data Warehouse and OLAP
No ratings yet
2.data Warehouse and OLAP
14 pages
What Is A Data Warehouse?
No ratings yet
What Is A Data Warehouse?
47 pages
Unit 1
No ratings yet
Unit 1
99 pages
DW Concepts
100% (1)
DW Concepts
40 pages
Data Warehouse Concepts & Terminology: - Vamshi Myana
No ratings yet
Data Warehouse Concepts & Terminology: - Vamshi Myana
39 pages
Introduction To Data Warehousing
No ratings yet
Introduction To Data Warehousing
46 pages
DWH Meterial
No ratings yet
DWH Meterial
9 pages
Data Warehousing: Data Models and OLAP Operations: Lecture-1
No ratings yet
Data Warehousing: Data Models and OLAP Operations: Lecture-1
47 pages
DWDM Concept Demonstration
No ratings yet
DWDM Concept Demonstration
102 pages
Data Mining& Data Warehousing.
No ratings yet
Data Mining& Data Warehousing.
13 pages
Data Dictionary
No ratings yet
Data Dictionary
11 pages
Unit 2
No ratings yet
Unit 2
31 pages
Data Warehouse
No ratings yet
Data Warehouse
16 pages
Data Warehousing: People Making Technology Wor K™
100% (1)
Data Warehousing: People Making Technology Wor K™
44 pages
Data Warehousing: Data Models and OLAP Operations
No ratings yet
Data Warehousing: Data Models and OLAP Operations
41 pages
Data Warehouse
No ratings yet
Data Warehouse
71 pages
Data Mining Unit-2 Notes
No ratings yet
Data Mining Unit-2 Notes
8 pages
2 Data Mining Terms & Concepts
No ratings yet
2 Data Mining Terms & Concepts
44 pages
Data Warehouse
No ratings yet
Data Warehouse
4 pages
Ch4 DW Detailed Version
No ratings yet
Ch4 DW Detailed Version
39 pages
Microsoft Data Warehouse Design Considerations
No ratings yet
Microsoft Data Warehouse Design Considerations
36 pages
Unit 2 DATA WAREHOUSE AND DATA MART
No ratings yet
Unit 2 DATA WAREHOUSE AND DATA MART
17 pages
DWDM Unit 2
No ratings yet
DWDM Unit 2
21 pages
DW&DM Material
No ratings yet
DW&DM Material
107 pages
OBIEE - Quick Guide
No ratings yet
OBIEE - Quick Guide
78 pages
DWM Unit-I Notes
No ratings yet
DWM Unit-I Notes
9 pages
Unit 5 DW
No ratings yet
Unit 5 DW
12 pages
Unit - I
No ratings yet
Unit - I
65 pages
BusinessIntelligence 2023
No ratings yet
BusinessIntelligence 2023
36 pages
Chapter-2 DM
No ratings yet
Chapter-2 DM
23 pages
Introduction To Data Warehouse
No ratings yet
Introduction To Data Warehouse
15 pages
Data Mining UNIT I
No ratings yet
Data Mining UNIT I
11 pages
Chapter 2.introduction To Data Warehouse
No ratings yet
Chapter 2.introduction To Data Warehouse
49 pages
Data Mining and Warehousing
No ratings yet
Data Mining and Warehousing
18 pages
DWDM Unit-2 Final
No ratings yet
DWDM Unit-2 Final
21 pages
Lecture Notes
No ratings yet
Lecture Notes
5 pages
CST466-M1 - Ktunotes - in
No ratings yet
CST466-M1 - Ktunotes - in
24 pages
Data Warehousing
No ratings yet
Data Warehousing
21 pages
Unit IV - Data Warehousing and OLAP Technologies
No ratings yet
Unit IV - Data Warehousing and OLAP Technologies
68 pages
Data Warehousing - Browning
No ratings yet
Data Warehousing - Browning
22 pages
DBMS II Seven 7
No ratings yet
DBMS II Seven 7
13 pages
BDA Unit 2 B.tech
No ratings yet
BDA Unit 2 B.tech
9 pages
Module1 Part3
No ratings yet
Module1 Part3
46 pages
Informatica FAQs
No ratings yet
Informatica FAQs
143 pages
Idq New Log Files
No ratings yet
Idq New Log Files
187 pages
Datascience Unit 02 1
No ratings yet
Datascience Unit 02 1
53 pages
Defining Data Mining and Data Warehouse (Adugna Gutema)
No ratings yet
Defining Data Mining and Data Warehouse (Adugna Gutema)
9 pages
Microsoft Dynamics GP 2010 Implementation
From Everand
Microsoft Dynamics GP 2010 Implementation
Victoria Yudin
5/5 (2)
Learning Dynamics NAV Patterns: Create solutions that are easy to maintain, are quick to upgrade, and follow proven concepts and design
From Everand
Learning Dynamics NAV Patterns: Create solutions that are easy to maintain, are quick to upgrade, and follow proven concepts and design
Marije Brummel
No ratings yet
F4XT Manuale Istruzioni It
No ratings yet
F4XT Manuale Istruzioni It
9 pages
Ajp Tyif (9165)
No ratings yet
Ajp Tyif (9165)
14 pages
Implementing ISO IEC 12207 Standard Usin
No ratings yet
Implementing ISO IEC 12207 Standard Usin
14 pages
Mu 3 Steli
No ratings yet
Mu 3 Steli
18 pages
Compact HMI 800 5.0 Getting Started
100% (1)
Compact HMI 800 5.0 Getting Started
176 pages
ITIL 4 Foundation - Quick Reference Cards
100% (2)
ITIL 4 Foundation - Quick Reference Cards
14 pages
Associate Cloud Engineer Exam - Free Actual Q&As, Page 4 - ExamTopics
No ratings yet
Associate Cloud Engineer Exam - Free Actual Q&As, Page 4 - ExamTopics
3 pages
Dell Latitude E6410 (Compal LA-5472P) PDF
No ratings yet
Dell Latitude E6410 (Compal LA-5472P) PDF
66 pages
4 ProcessesProf
No ratings yet
4 ProcessesProf
3 pages
Rhod RGB: Quick Installation Guide
No ratings yet
Rhod RGB: Quick Installation Guide
12 pages
BDA - Module 5
No ratings yet
BDA - Module 5
31 pages
OCP - SQL&PL - SQL (Vol1)
100% (1)
OCP - SQL&PL - SQL (Vol1)
322 pages
ICM Module 1 MCQ
No ratings yet
ICM Module 1 MCQ
29 pages
2017 DB Guenter Unbescheid Authentifizierung Und Autorisierung in Oracle Systemen Praesentation
No ratings yet
2017 DB Guenter Unbescheid Authentifizierung Und Autorisierung in Oracle Systemen Praesentation
32 pages
2VAA000334 en S Turbine Auto Synchronizer AS800 Product Guide
No ratings yet
2VAA000334 en S Turbine Auto Synchronizer AS800 Product Guide
72 pages
Office 2013 Keys
No ratings yet
Office 2013 Keys
1 page
Excel XP Pivot Tables Exercises
No ratings yet
Excel XP Pivot Tables Exercises
6 pages
1 What Is A Pivot Table
No ratings yet
1 What Is A Pivot Table
6 pages
Soft Maint
No ratings yet
Soft Maint
2 pages
POA For Kernel Patch Upgradation On 2-Node Cluster C24 and Cfab
No ratings yet
POA For Kernel Patch Upgradation On 2-Node Cluster C24 and Cfab
4 pages
Ultimate Guide To BPMN2 Bonitasoft en
No ratings yet
Ultimate Guide To BPMN2 Bonitasoft en
26 pages
U14sss Map PM
No ratings yet
U14sss Map PM
227 pages
Gursharan Singh: VSRK Capital Pvt. LTD
No ratings yet
Gursharan Singh: VSRK Capital Pvt. LTD
1 page
Datasheets Procesador Ecu Megane
No ratings yet
Datasheets Procesador Ecu Megane
298 pages
TFTP and Inetd Configuration On Solaris 10 v1
No ratings yet
TFTP and Inetd Configuration On Solaris 10 v1
3 pages
8-Data Management
No ratings yet
8-Data Management
6 pages

Data Warehousing and Data Mining

Uploaded by

Data Warehousing and Data Mining

Uploaded by

Data Warehousing

1 Characteristics a data warehouse

3.1 Star Schema

Dimension 3 Fact Table Dimension 2

Figure: Star Schema

3.2 Snowflake Schema

PK,FK1 Product_ID PK Customer_ID

Figure: star flake schema

3.7 Decision Tree Classifiers

3.8 Association Rules

Given characteristics of different species, biologists have created a complex hierarchical

1. The person’s past preferences in movies

You might also like