0% found this document useful (0 votes)

15 views9 pages

Homework

The document describes setting up a data warehouse database with dimension and fact tables linked by foreign keys and partitioned by country. It also covers loading the data from source databases into the data warehouse using SQL Server Integration Services (SSIS) data flows and creating views for analysis. Machine learning models are built in Weka on a predictive view to compare the performance of decision tree and Bayesian classifiers.

Uploaded by

fatima alhaji

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views9 pages

Homework

Uploaded by

fatima alhaji

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 9

- database:

///

CREATE DATABASE 2S23SalesDWH;

USE 2S23SalesDWH;

///

- DimProd table:

///

```

CREATE TABLE DimProd (

ProdID INT PRIMARY KEY IDENTITY(1,1),

Size VARCHAR(50),

Category VARCHAR(50)

);

///

- DimCust table:

///

CREATE TABLE DimCust (

CustID INT PRIMARY KEY IDENTITY(1,1),

Age INT,

Gender VARCHAR(10),

AnnualIncome DECIMAL(15,2),

NumChildren INT

);

///
- DimAdrs table:

///

CREATE TABLE DimAdrs (

AdrsID INT PRIMARY KEY IDENTITY(1,1),

CountryRegion VARCHAR(50)

);

///

basic fact table linked with dimension tables and with sales amount as measure:

///

CREATE TABLE FactSale (

SaleID INT PRIMARY KEY IDENTITY(1,1),

ProdID INT,

CustID INT,

AdrsID INT,

SaleAmount DECIMAL(15,2),

FOREIGN KEY (ProdID) REFERENCES DimProd (ProdID),

FOREIGN KEY (CustID) REFERENCES DimCust (CustID),

FOREIGN KEY (AdrsID) REFERENCES DimAdrs (AdrsID)

);

///

You might need to alter the `FactSale` table later according to part 2 instructions. Also, adjust the field
types and sizes according to your actual data.
2.

-filegroups:

///

ALTER DATABASE 2S23SalesDWH

ADD FILEGROUP USA,

ADD FILEGROUP Canada,

ADD FILEGROUP Mexico;

///

-add files to the filegroups:

///

ALTER DATABASE 2S23SalesDWH

ADD FILE

NAME = 'USAData',

FILENAME = 'C:\path\USAData.ndf',

SIZE = 5MB,

MAXSIZE = 100MB,

FILEGROWTH = 5MB

TO FILEGROUP USA,

ADD FILE

NAME = 'CanadaData',

FILENAME = 'C:\path\CanadaData.ndf',

SIZE = 5MB,

MAXSIZE = 100MB,
FILEGROWTH = 5MB

TO FILEGROUP Canada,

ADD FILE

NAME = 'MexicoData',

FILENAME = 'C:\path\MexicoData.ndf',

SIZE = 5MB,

MAXSIZE = 100MB,

FILEGROWTH = 5MB

TO FILEGROUP Mexico;

///

-the partition function. Note that you need to decide on the boundaries for partitioning:

///

CREATE PARTITION FUNCTION CountryRegionPF (VARCHAR(50))

AS RANGE LEFT FOR VALUES ('USA', 'Canada', 'Mexico');

///

- the partition scheme:

///

CREATE PARTITION SCHEME CountryRegionPS

AS PARTITION CountryRegionPF

TO (USA, Canada, Mexico, [PRIMARY]);

///
recreate the `FactSale` table within the partition scheme. ( drop it first if it already exists):

///

DROP TABLE IF EXISTS FactSale;

CREATE TABLE FactSale (

SaleID INT PRIMARY KEY IDENTITY(1,1),

ProdID INT,

CustID INT,

AdrsID INT,

SaleAmount DECIMAL(15,2),

CountryRegion VARCHAR(50),

FOREIGN KEY (ProdID) REFERENCES DimProd (ProdID),

FOREIGN KEY (CustID) REFERENCES DimCust (CustID),

FOREIGN KEY (AdrsID) REFERENCES DimAdrs (AdrsID)

ON CountryRegionPS (CountryRegion);

///
3.

The first part of the task, creating views, is done in SQL. DataFlow tasks are typically created in SQL
Server Integration Services (SSIS) which is a graphical tool, so they can't be represented as code.

the views:

///

-- Create view for product information

CREATE VIEW vw_Products AS

SELECT ProdID, Size, Category

FROM AdventureWork.Product;

-- Create view for place of sale information

CREATE VIEW vw_Address AS

SELECT AdrsID, CountryRegion

FROM AdventureWork.Address;

-- Create view for sales transaction information

CREATE VIEW vw_Sales AS

SELECT SaleID, ProdID, CustID, AdrsID, SaleAmount

FROM AdventureWork.Sales;

///
You would create the DataFlow tasks in SSIS as follows:

1. Open SQL Server Data Tools (SSDT) and create a new Integration Services Project.

2. In the Control Flow tab, drag a Data Flow Task from the Toolbox onto the design surface.

3. Double-click the Data Flow Task to go to the Data Flow tab.

4. Drag a Source Assistant from the Toolbox onto the design surface.

5. Double-click the Source Assistant and configure it to use the AdventureWork connection and select
the appropriate view.

6. Drag a Destination Assistant from the Toolbox onto the design surface.

7. Connect the Source Assistant to the Destination Assistant.

8. Double-click the Destination Assistant and configure it to use the 2S23SalesDWH connection and
select the appropriate table.

9. Repeat steps 2 to 8 for each view.

To extract the customer data from an Excel file, you would use the Excel Source in SSIS and configure it to
use the Excel connection manager and select the appropriate sheet.

If you need to do transformations on the data, you would add Transformations between the Source and
Destination. For example, you might add a Lookup Transformation to match up identifiers between the
AdventureWork database and the 2S23SalesDWH database.

Once your DataFlow tasks are set up, you can run the package to load the data into the 2S23SalesDWH
database.
4.

- create the view in SQL Server:

///

-- Create view for the predictive model

CREATE VIEW vw_PredictiveModel AS

SELECT dp.Size AS ProductSize, da.CountryRegion AS SalesCountry, dc.Age AS CustomerAge,

dc.Gender, dc.AnnualIncome, dc.NumChildren, fs.BuyBike

FROM FactSale fs

JOIN DimProd dp ON fs.ProdID = dp.ProdID

JOIN DimCust dc ON fs.CustID = dc.CustID

JOIN DimAdrs da ON fs.AdrsID = da.AdrsID;

///

create a DataFlow task in SSIS to export the data from this view to an Excel file. As previously stated, this
is a graphical process and can't be represented as code.

follow the same steps as for the other DataFlow tasks, but your Source would be the
vw_PredictiveModel view and your Destination would be an Excel connection manager.

The rest of the steps involve using Weka, which is a graphical tool for machine learning and data mining.
Here are the steps you would follow, although they can't be represented as code:

1. Open Weka and choose the Explorer.

2. Click Open file and select the Excel file you created (you may need to convert it to CSV first).

3. Choose the `Classify` tab.

4. Click on `Choose` and select `trees.J48` for the DecisionTree model.

5. In the `Test options` section, choose `Cross-validation` and enter `10` for the number of folds.

6. Click `Start`.
After the DecisionTree model has been built, you can view the tree by clicking on the `Visualize tree`
button.

To compare this with a Bayesian model:

1. Click on `Choose` again and select `bayes.NaiveBayes`.

2. Click `Start` again.

Now you can compare the results of the DecisionTree and Bayesian models by comparing the output in
the classifier output area. You would typically look at measures like accuracy, precision, recall, and the F-
measure to compare the performance of the two models.

Class 11 Business ST Ch-2 Notes
55% (11)
Class 11 Business ST Ch-2 Notes
25 pages
Peer-E-Kamil - by Umera Ahmed (Roman Urdu Translation by Sk. Danish) Malgun Gothic1
96% (134)
Peer-E-Kamil - by Umera Ahmed (Roman Urdu Translation by Sk. Danish) Malgun Gothic1
492 pages
400+ Love Shayari in English - Romantic, Sad, Girlfriend & Boyfriend
No ratings yet
400+ Love Shayari in English - Romantic, Sad, Girlfriend & Boyfriend
65 pages
11th English Guide - Unit 1 and 2 - Way To Success
69% (16)
11th English Guide - Unit 1 and 2 - Way To Success
78 pages
Kuch Nahi Ho Ga (Incest)
65% (26)
Kuch Nahi Ho Ga (Incest)
401 pages
Certificate - 2025 Navodaya Vidyalaya Samiti
73% (11)
Certificate - 2025 Navodaya Vidyalaya Samiti
1 page
Celebrity Format For Billing Clients On Donation
84% (92)
Celebrity Format For Billing Clients On Donation
2 pages
Module in Purposive Communication 1 First Year College
90% (58)
Module in Purposive Communication 1 First Year College
91 pages
Bad Life (VOL 1-4)
91% (11)
Bad Life (VOL 1-4)
788 pages
I Fell in Love With Blind Man 1 To 50
91% (23)
I Fell in Love With Blind Man 1 To 50
826 pages
Read People Like A Book by Patrick King-Edited
59% (117)
Read People Like A Book by Patrick King-Edited
12 pages
The Graham Effect Campus Diaries Book 1 Elle Kennedy Z Library
86% (14)
The Graham Effect Campus Diaries Book 1 Elle Kennedy Z Library
329 pages
Mera Nam Ahsan He or Mere Ghar Me Ham 5 Log Hain
67% (15)
Mera Nam Ahsan He or Mere Ghar Me Ham 5 Log Hain
9 pages
All Format
83% (18)
All Format
1 page
7th Time Loop 05
91% (22)
7th Time Loop 05
276 pages
You Amp Me - Tal Bauer
80% (10)
You Amp Me - Tal Bauer
304 pages
Save Me in English - Mona Kasten
90% (21)
Save Me in English - Mona Kasten
286 pages
Psycho Fae
91% (11)
Psycho Fae
402 pages
The Girl in The Green Dress - Jeni Haynes, George Blair-West
100% (5)
The Girl in The Green Dress - Jeni Haynes, George Blair-West
401 pages
Wind and Truth The Brand New Epic Stormli Brandon Sanderson
100% (3)
Wind and Truth The Brand New Epic Stormli Brandon Sanderson
1,592 pages
Lord of The Mysteries 1
100% (5)
Lord of The Mysteries 1
865 pages
Acknowledgement Receipt
88% (191)
Acknowledgement Receipt
1 page
Jawani Ka Nasha
66% (182)
Jawani Ka Nasha
3,571 pages
Fortis Advisors v. Krafton Via Aftermath
82% (17)
Fortis Advisors v. Krafton Via Aftermath
58 pages
1500 Vocabulary Words
79% (75)
1500 Vocabulary Words
27 pages
ADDICTED (The Novel) Book 1 - The Original English Translation
93% (15)
ADDICTED (The Novel) Book 1 - The Original English Translation
1,788 pages
ORAL COM 11 Quarter 1 Module 1 PDF
88% (170)
ORAL COM 11 Quarter 1 Module 1 PDF
37 pages
Dispenza - Breaking The Habit of Being Yourself PDF
99% (104)
Dispenza - Breaking The Habit of Being Yourself PDF
360 pages
New Microsoft Office Word Document
67% (3)
New Microsoft Office Word Document
115 pages
XXX BP
100% (2)
XXX BP
10 pages

Homework

Uploaded by

Homework

Uploaded by

- database:

CREATE DATABASE 2S23SalesDWH;

CREATE TABLE DimProd (

ProdID INT PRIMARY KEY IDENTITY(1,1),

CREATE TABLE DimCust (

CustID INT PRIMARY KEY IDENTITY(1,1),

CREATE TABLE DimAdrs (

AdrsID INT PRIMARY KEY IDENTITY(1,1),

CREATE TABLE FactSale (

SaleID INT PRIMARY KEY IDENTITY(1,1),

FOREIGN KEY (ProdID) REFERENCES DimProd (ProdID),

FOREIGN KEY (CustID) REFERENCES DimCust (CustID),

FOREIGN KEY (AdrsID) REFERENCES DimAdrs (AdrsID)

ALTER DATABASE 2S23SalesDWH

ADD FILEGROUP USA,

ADD FILEGROUP Canada,

ADD FILEGROUP Mexico;

-add files to the filegroups:

ALTER DATABASE 2S23SalesDWH

CREATE PARTITION FUNCTION CountryRegionPF (VARCHAR(50))

AS RANGE LEFT FOR VALUES ('USA', 'Canada', 'Mexico');

- the partition scheme:

CREATE PARTITION SCHEME CountryRegionPS

TO (USA, Canada, Mexico, [PRIMARY]);

DROP TABLE IF EXISTS FactSale;

CREATE TABLE FactSale (

SaleID INT PRIMARY KEY IDENTITY(1,1),

FOREIGN KEY (ProdID) REFERENCES DimProd (ProdID),

FOREIGN KEY (CustID) REFERENCES DimCust (CustID),

FOREIGN KEY (AdrsID) REFERENCES DimAdrs (AdrsID)

-- Create view for product information

CREATE VIEW vw_Products AS

SELECT ProdID, Size, Category

-- Create view for place of sale information

CREATE VIEW vw_Address AS

SELECT AdrsID, CountryRegion

-- Create view for sales transaction information

CREATE VIEW vw_Sales AS

SELECT SaleID, ProdID, CustID, AdrsID, SaleAmount

3. Double-click the Data Flow Task to go to the Data Flow tab.

7. Connect the Source Assistant to the Destination Assistant.

9. Repeat steps 2 to 8 for each view.

- create the view in SQL Server:

-- Create view for the predictive model

CREATE VIEW vw_PredictiveModel AS

SELECT dp.Size AS ProductSize, da.CountryRegion AS SalesCountry, dc.Age AS CustomerAge,

dc.Gender, dc.AnnualIncome, dc.NumChildren, fs.BuyBike

JOIN DimProd dp ON fs.ProdID = dp.ProdID

JOIN DimCust dc ON fs.CustID = dc.CustID

JOIN DimAdrs da ON fs.AdrsID = da.AdrsID;

1. Open Weka and choose the Explorer.

3. Choose the `Classify` tab.

4. Click on `Choose` and select `trees.J48` for the DecisionTree model.

To compare this with a Bayesian model:

1. Click on `Choose` again and select `bayes.NaiveBayes`.

2. Click `Start` again.

You might also like