0% found this document useful (0 votes)

11 views28 pages

BI - Lab Manual

The document outlines a series of assignments focused on importing legacy data, performing ETL processes, and creating OLAP models using tools like Power BI and Microsoft Excel. It introduces key concepts of Business Intelligence (BI), including data extraction, transformation, loading, and the creation of OLAP cubes. The assignments aim to provide practical experience with data integration and analysis techniques using various data sources and tools.

Uploaded by

kirteshpatil05

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views28 pages

BI - Lab Manual

Uploaded by

kirteshpatil05

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Title of the Assignment :

Import the legacy data from different sources such as ( Excel , Sql Server, Oracle etc.)
and load in the target system. ( You can download sample databases such as Adventure
works,Northwind, foodmart etc.)
Objective of the Assignment :
To introduce the concepts and components of Business Intelligence (BI)
Prerequisite:
1. Basics of dataset extensions.
2. Concept of data import.

Theory :
Legacy Data :
Legacy data, according to BusinessDictionary, is "information maintained in an old or out-
of-date format or computer system that is consequently challenging to access or handle."
Sources of Legacy Data
Where does legacy data come from? Virtually everywhere. Figure 1 indicates that there
are many sources from which you may obtain legacy data. This includes existing
databases, often relational, although non-RDBs such as hierarchical, network, object,
XML, object/relational databases, and NoSQL databases. Files, such as XML documents

or "flat files"• such as configuration files and comma-delimited text files, are also
common sources of legacy data. Software, including legacy applications that have been
wrapped (perhaps via CORBA) and legacy services such as web services or CICS
transactions, can also provide access to existing information. The point to be made is that
there is often far more to gaining access to legacy data than simply writing an SQL query
against an existing relational database.

How to import legacy data step by step.

Step 1: Open Power BI

Step 2 : Click on Get data following list will be displayed → select Excel

Step 3: Select required file and click on Open, Navigator screen appears
Step 4: Select file and click on edit
Step 5: Power query editor appears
Step 6: Again, go to Get Data and select OData feed

Step 7:
Paste url as http://services.odata.org/V3/Northwind/Northwind.svc/ Click on
ok
Step 8: Select orders table
And click on edit
Click on edit to view table

Conclusion : In this way we import the Legacy datasets using the Power BI Tool.
Group No: 1
Assignment No: 2

Title of the Assignment :-

Perform the Extraction Transformation and Loading (ETL) process to construct the
database in the Sql server.

Objective of the Assignment :

To introduce the concepts and components of Business Intelligence (BI)

Prerequisite:
1. Basics of dataset extensions.
2. Concept of data import.

Theory:
Extraction Transformation and Loading (ETL) :-

ETL, which stands for extract, transform and load, is a data integration process that combines
data from multiple data sources into a single, consistent data store that is loaded into a data
warehouse or other target system.

As the databases grew in popularity in the 1970s, ETL was introduced as a process for
integrating and loading data for computation and analysis, eventually becoming the primary
method to process data for data warehousing projects.

ETL provides the foundation for data analytics and machine learning workstreams. Through a
series of business rules, ETL cleanses and organizes data in a way which addresses specific
business intelligence needs, like monthly reporting, but it can also tackle more advanced
analytics, which can improve back-end processes or end user experiences. ETL is often used by
an organization to:
 Extract data from legacy systems
 Cleanse the data to improve data quality and establish consistency
 Load data into a target database

 How ETL works

The easiest way to understand how ETL works is to understand what happens in each step of the
process.

Extract

During data extraction, raw data is copied or exported from source locations to a staging area.
Data management teams can extract data from a variety of data sources, which can be structured
or unstructured. Those sources include but are not limited to:

 SQL or NoSQL servers

 CRM and ERP systems
 Flat files
 Email
 Web pages

 The benefits and challenges of ETL

ETL solutions improve quality by performing data cleansing prior to loading the data to a
different repository. A time-consuming batch operation, ETL is recommended more often for
creating smaller target data repositories that require less frequent updating, while other data
integration methods—including ELT (extract, load, transform), change data capture (CDC), and
data virtualization—are used to integrate increasingly larger volumes of data that changes or real-
time data streams.

 What Is ETL Process?

To put it simply, the data ETL process including extracting and compiling raw
data, transforming it to make it intelligible, and loading it into a target system, such as a database
or data warehouse, for easy access and analysis. ETL short for Extract, Transform, Load, is an
important component in the data ecosystem of any modern businesses. The ETL process is what
helps break down data silos and makes data easier to access for decision-makers.

Since data coming from multiple sources has a different schema, every dataset must be
transformed differently before utilizing BI and analytics. For instance, if you are compiling data
from source systems like SQL Server and Google Analytics, these two sources will need to be
treated individually throughout the ETL process. The importance of this process has increased
since big data analysis has become a necessary part of every organization.

Expedite Data Integration With

 ETL tools :-
In the past, organizations wrote their own ETL code. There are now many open source and
commercial ETL tools and cloud services to choose from. Typical capabilities of these products
include the following:

 Comprehensive automation and ease of use: Leading ETL tools automate the entire data
flow, from data sources to the target data warehouse. Many tools recommend rules for
extracting, transforming and loading the data.

 A visual, drag-and-drop interface: This functionality can be used for specifying rules
and data flows.

 Support for complex data management: This includes assistance with complex
calculations, data integrations, and string manipulations

 Security and compliance: The best ETL tools encrypt data both in motion and at rest and
are certified compliant with industry or government regulations, like HIPAA and GDPR.

Conclusion :-
In this way we import the Extraction Transformation and Loading (ETL) process to construct
the database in the Sql server.
Group No: 1
Assignment No: 3

Title of the Assignment :-

Create the cube with suitable dimension and fact tables based on OLAP, MOLAP and HOLAP
model

Objective of the Assignment :

To introduce the concepts and components of OLAP, MOLAP and HOLAP model.

Prerequisite:
1. Basics of dataset extensions.
2. Concept of data import.

Theory:

What is OLAP?

OLAP was introduced into the business intelligence (BI) space over 20 years ago, in a time where
computer hardware and software technology weren’t nearly as powerful as they are today. OLAP
introduced a (typically analysts) to easily perform multidimensional analysis of large volumes of
business data.

Aggregating, grouping, and joining data are the most difficult types of queries for a relational
database to process. The magic behind OLAP derives from its ability to pre-calculate and pre-
aggregate data. Otherwise, end users would be spending most of their time waiting for query results
to be returned by the database. However, it is also what causes OLAP-based solutions to be
extremely rigid and IT-intensive.

Limitations of OLAP cubes

 OLAP requires restructuring of data into a star/snowflake schema

 There is a limited number of dimensions (fields) a single OLAP cube

 It is nearly impossible to access transactional data in the OLAP cube

 Changes to an OLAP cube requires a full update of the cube – a lengthy process

OLAP stands for Relational Online Analytical Processing. ROLAP stores data in columns and
rows (also known as relational tables) and retrieves the information on demand through user
submitted queries. A ROLAP database can be accessed through complex SQL queries to
calculate information. ROLAP can handle large data volumes, but the larger the data, the slower
the processing times.

Because queries are made on-demand, ROLAP does not require the storage and pre-computation
of information. However, the disadvantage of ROLAP implementations are the potential
performance constraints and scalability limitations that result from large and inefficient join
operations between large tables. Examples of popular ROLAP products include Metacube by
Stanford Technology Group, Red Brick Warehouse by Red Brick Systems, and AXSYS Suite by
Information Advantage.

What is MOLAP?
MOLAP stands for Multidimensional Online An

MOLAP stands for Multidimensional Online Analytical Processing. MOLAP uses a

multidimensional cube that accesses stored data through various combinations. Data is pre-
computed, pre-summarized, and stored (a difference from ROLAP, where queries are served on-
demand).
A multicube approach has proved successful in MOLAP products. In this approach, a series of
dense, small, precalculated cubes make up a hypercube. Tools that incorporate MOLAP include
Oracle Essbase, IBM Cognos, and Apache Kylin.

Its simple interface makes MOLAP easy to use, even for inexperienced users. Its speedy data
retrieval makes it the best for “slicing and dicing” operations. One major disadvantage of MOLAP is
that it is less scalable than ROLAP, as it can handle a limited amount of data.

What is HOLAP?
HOLAP stands for Hybrid Online Analytical Processing. As the name suggests, the HOLAP
storage mode connects attributes of both MOLAP and ROLAP. Since HOLAP involves storing
part of your data in a ROLAP store and another part in a MOLAP store, developers get the
benefits of both.

With this use of the two OLAPs, the data is stored in both multidimensional databases and
relational databases. The decision to access one of the databases depends on which is most
appropriate for the requested processing application or type. This setup allows much more
flexibility for handling data. For theoretical processing, the data is stored in a multidimensional
database. For heavy processing, the data is stored in a relational database.

Microsoft Analysis Services and SAP AG BI Accelerator are products that run off HOLAP.

Sisense and Elasticubes

Similar to OLAP-based solutions, Sisense is a Business Intelligence software designed to enable
solutions where multiple business users perform ad-hoc data analysis on a centralized data
repository. On the other hand, Sisense does not achieve this by pre-calculating query results, but
rather by utilizing state-of-the-art technology called ElastiCube. It is a sophisticated columnar
database, which was specifically designed for Business Intelligence solutions. Its unique storage
and memory processing technology radically change the way business intelligence solutions
access data.

Powered by ElastiCube, Sisense delivers distinct advantages over OLAP-based

solutions:

 Instant query response times, without pre-calculation or pre-aggregation of data

 Creation of complicated star/snow flake schemas is not required
 A data warehouse is not required, but easily supported
 There are no physical limits to the number of dimensions an ElastiCube can
hold
 ElastiCube provides access to data in any granularity (not merely to aggregated
data)
 Changes to ElastiCubes can be done without re-building the entire data model
 An ElastiCube requires significantly less powerful hardware than a similar
OLAP cube.

Conclusion :-

In this way we import of the OLAP, MOLAP and HOLAP model.

Group No: 1
Assignment No: 4

Title of the Assignment :-

Import the data warehouse data in Microsoft Excel and create the Pivot table and Pivot Char

Objective of the Assignment :

To introduce the concepts and components of data warehouse data in Microsoft Excel and create
the Pivot table and Pivot Char.

Prerequisite:
1. Basics of dataset extensions.
2. Concept of data import.

Theory:

Data into Excel, and Create a Data Model

the first tutorial in a series designed to get you acquainted and comfortable using
Excel and its built-in data mash-up and analysis features. These tutorials build and
refine an Excel workbook from scratch, build a data model, then create amazing
interactive reports using Power View. The tutorials are designed to demonstrate
Microsoft Business Intelligence features and capabilities in Excel, PivotTables,
Power Pivot, and Power View.

In these tutorials you learn how to import and explore data in Excel, build and refine
a data model using Power Pivot, and create interactive reports with Power View that
you can publish, protect, and share.

At the end of this tutorial is a quiz you can take to test your learning.
This tutorial series uses data describing Olympic Medals, hosting countries, and
various Olympic sporting events. We suggest you go through each tutorial in order.
Also, tutorials use Excel 2013 with Power Pivot enabled. For more information on
Excel 2013, click here. For guidance on enabling Power Pivot, click here.

Import data from a database

We start this tutorial with a blank workbook. The goal in this section is to connect to
an external data source, and import that data into Excel for further analysis.

Let’s start by downloading some data from the Internet. The data describes Olympic
Medals, and is a Microsoft Access database.
1. Click the following links to download files we use during this tutorial
series. Download each of the four files to a location that’s easily accessible,
such as Downloads or My Documents, or to a new folder you create:

2. In Excel 2013, open a blank workbook.

3. Click DATA > Get External Data > From Access. The ribbon adjusts
dynamically based on the width of your workbook, so the commands on
your ribbon may look slightly different from the following screens. The
first screen shows the ribbon when a workbook is wide, the second image
shows a workbook that has been resized to take up only a portion of the
screen.
4. Select the OlympicMedals.accdb file you downloaded and click Open. The
following Select Table window appears, displaying the tables found in the
database. Tables in a database are similar to worksheets or tables in Excel.
Check the Enable selection of multiple tables box, and select all the
tables. Then click OK.

5. The Import Data window appears.

Select the PivotTable Report option, which imports the tables into Excel
and prepares a PivotTable for analyzing the imported tables, and click OK.
6. Once the data is imported, a PivotTable is created using the imported tables.

With the data imported into Excel, and the Data Model automatically created, you’re
ready to explore the data.

Explore data using a PivotTable

Exploring imported data is easy using a PivotTable. In a PivotTable, you drag fields
(similar to columns in Excel) from tables (like the tables you just imported from the
Access database) into different areas of the PivotTable to adjust how it presents your
data. A PivotTable has four areas: FILTERS, COLUMNS, ROWS, and VALUES.
It might take some experimenting to determine which area a field should be dragged
to. You can drag as many or few fields from your tables as you like, until the
PivotTable presents your data how you want to see it. Feel free to explore by
dragging fields into different areas of the PivotTable; the underlying data is not
affected when you arrange fields in a PivotTable.

Let’s explore the Olympic Medals data in the PivotTable, starting with Olympic
medalists organized by discipline, medal type, and the athlete’s country or region.
1. In PivotTable Fields, expand the Medals table by clicking the arrow
beside it. Find the NOC_CountryRegion field in the
expanded Medals table, and drag it to the COLUMNS area. NOC stands
for National Olympic Committees, which is the organizational unit for a
country or region.
2. Next, from the Disciplines table, drag Discipline to the ROWS area.
3. Let’s filter Disciplines to display only five sports: Archery, Diving,
Fencing, Figure Skating, and Speed Skating. You can do this from within
the PivotTable Fields area, or from the Row Labels filter in the PivotTable
itself.

Click anywhere in the PivotTable to ensure the Excel PivotTable is selected. In

the PivotTable Fields list, where the Disciplines table expanded, hover over its
Discipline field and a dropdown arrow appears to the right of the field. Click the
dropdown, click (Select All)to remove all selections, then scroll down and select
Archery, Diving, Fencing, Figure Skating, and Speed Skating. Click OK.

Or, in the Row Labels section of the PivotTable, click the dropdown next to Row
Labels in the PivotTable, click (Select All) to remove all selections, then scroll
down and select Archery, Diving, Fencing, Figure Skating, and Speed Skating.
Click OK.

In PivotTable Fields, from the Medals table, drag Medal to the VALUES area.
Since Values must be numeric, Excel automatically changes Medal to Count of
Medal.

From the Medals table, select Medal again and drag it into the FILTERS area.
Let’s filter the PivotTable to display only those countries or regions with more than
90 total medals. Here’s how.

a. In the PivotTable, click the dropdown to the right of Column Labels.

b. Select Value Filters and select Greater Than….

Your PivotTable looks like the following screen.

With little effort, you now have a basic PivotTable that includes fields from three
different tables. What made this task so simple were the pre-existing relationships
among the tables. Because table relationships existed in the source database, and
because you imported all the tables in a single operation, Excel could recreate those
table relationships in its Data Model.

But what if your data originates from different sources, or is imported at a later time?
Typically, you can create relationships with new data based on matching columns. In
the next step, you import additional tables, and learn how to create new relationships.
Import data from a spreadsheet

Now let’s import data from another source, this time from an existing workbook,
then specify the relationships between our existing data and the new data.
Relationships let you analyze collections of data in Excel, and create interesting and
immersive visualizations from the data you import.

Let’s start by creating a blank worksheet, then import data from an Excel workbook.

1. Insert a new Excel worksheet, and name it Sports.

2. Browse to the folder that contains the downloaded sample data files, and
open OlympicSports.xlsx.
3. Select and copy the data in Sheet1. If you select a cell with data, such as
cell A1, you can press Ctrl + A to select all adjacent data. Close the
OlympicSports.xlsx workbook.
4. On the Sports worksheet, place your cursor in cell A1 and paste the data.
5. With the data still highlighted, press Ctrl + T to format the data as a table.
You can also format the data as a table from the ribbon by selecting HOME
> Format as Table. Since the data has headers, select My table has
headers in the Create Table window that appears, as shown here.

Formatting the data as a table has many advantages. You can assign a name
to a table, which makes it easy to identify. You can also establish
relationships between tables, enabling exploration and analysis in
PivotTables, Power Pivot, and Power View.
6. Name the table. In TABLE TOOLS > DESIGN > Properties, locate
the Table Name field and type Sports. The workbook looks like the
following screen.

7. Save the workbook.

Import data using copy and paste

Now that we’ve imported data from an Excel workbook, let’s import data from a
table we find on a web page, or any other source from which we can copy and paste
into Excel. In the following steps, you add the Olympic host cities from a table.

1. Insert a new Excel worksheet, and name it Hosts.

2. Select and copy the following table, including the table headers.

Conclusion :-

In this way we import of data warehouse data in Microsoft Excel

and create the Pivot table and Pivot Char.
Group No: 1
Assignment No: 5

Title of the Assignment :-

Perform the data classification using classification algorithm. Or Perform

the data clustering using clustering algorithm.

Objective of the Assignment :

To introduce the concepts classification using classification algorithm. Or

Perform the data clustering using clustering algorithm.

Prerequisite:
1. Basics of dataset extensions.
2. Concept of data import.

Theory:

K-Means Clustering Algorithm

K-Means Clustering is an unsupervised learning algorithm that is used to
solve the clustering problems in machine learning or data science. In this
topic, we will learn what is K-means clustering algorithm, how the
algorithm works, along with the Python implementation of k-means
clustering.
What is K-Means Algorithm?
K-Means Clustering is an Unsupervised Learning algorithm, which groups
the unlabeled dataset into different clusters. Here K defines the number of
pre-defined clusters that need to be created in the process, as if K=2, there
will be two clusters, and for K=3, there will be three clusters, and so on.
It is an iterative algorithm that divides the unlabeled dataset into k different
clusters in such a way that each dataset belongs only one group that has
similar properties.
It allows us to cluster the data into different groups and a convenient way to
discover the categories of groups in the unlabeled dataset on its own
without the need for any training.
It is a centroid-based algorithm, where each cluster is associated with a
centroid. The main aim of this algorithm is to minimize the sum of
distances between the data point and their corresponding clusters.

How does the K-Means Algorithm Work?

The working of the K-Means algorithm is explained in the below steps:
Step-1: Select the number K to decide the number of clusters.
Step-2: Select random K points or centroids. (It can be other from the input
dataset).
Step-3: Assign each data point to their closest centroid, which will form
the predefined K clusters.
Step-4: Calculate the variance and place a new centroid of each cluster.
Step-5: Repeat the third steps, which means reassign each datapoint to the
new closest centroid of each cluster.
Step-6: If any reassignment occurs, then go to step-4 else go to FINISH.
Step-7: The model is ready.
Let's understand the above steps by considering the visual plots:
Suppose we have two variables M1 and M2. The x-y axis scatter plot of
these two variables is given below:

KMeans Clustering for Classification

Clustering as a method of finding subgroups within observations is used
widely in applications like market segmentation wherein we try and find
some structure in the data. Although an unsupervised machine learning
technique, the clusters can be used as features in a supervised machine
learning model.
KMeans is a clustering algorithm which divides observations into k clusters.
Since we can dictate the amount of clusters, it can be easily used in
classification where we divide data into clusters which can be equal to or
more than the number of classes.

I’ll be using the MNIST dataset which comes with scikit learn which is a
collection of labelled handwritten digits and use KMeans to find clusters
within the dataset and test how good it is as a feature.

I have created a class named clust for this purpose which when initialized
takes in a sklearn dataset and divides it into train and test dataset.

The function KMeans applies KMeans clustering to the train data with the
number of classes as the number of clusters to be made and creates labels
both for train and test data. The parameter output controls how do we want
to use these new labels, ‘add’ will add the labels as a feature in the dataset
and ‘replace’ will use the labels instead of the train and test dataset to train
our classification model.

 Conclusion :-
In this way we import data clustering using clustering algorithm.

How To Grow More Vegetables
100% (7)
How To Grow More Vegetables
168 pages
Abundance Meditation
75% (12)
Abundance Meditation
15 pages
Final BI Lab Manual
No ratings yet
Final BI Lab Manual
42 pages
Data Warehouse Architectures Business Intelligence Information Assets
No ratings yet
Data Warehouse Architectures Business Intelligence Information Assets
5 pages
LeanUX Canvas v5
No ratings yet
LeanUX Canvas v5
2 pages
Unit 123: Fixing Sheet Materials: Multiple Choice Questions
100% (2)
Unit 123: Fixing Sheet Materials: Multiple Choice Questions
4 pages
LP-VI - BI - Lab Manual
No ratings yet
LP-VI - BI - Lab Manual
48 pages
Etl Testing Material
100% (2)
Etl Testing Material
17 pages
DW Concepts
100% (1)
DW Concepts
40 pages
HashiCorp Certified Terraform Associate (003) WhizCard
No ratings yet
HashiCorp Certified Terraform Associate (003) WhizCard
21 pages
Power Bi Lab Manual
No ratings yet
Power Bi Lab Manual
98 pages
MSPDCL Tariff Order Fy 2019-20
No ratings yet
MSPDCL Tariff Order Fy 2019-20
164 pages
BI Notes QA
No ratings yet
BI Notes QA
76 pages
CV Template For Software Engineer
No ratings yet
CV Template For Software Engineer
4 pages
BoM For Transformer
No ratings yet
BoM For Transformer
24 pages
BI Manual (Ankita)
No ratings yet
BI Manual (Ankita)
76 pages
Daikin VRV 5 Tai Lieu Huong Dan Lap Dat Va Van Hanh 3
No ratings yet
Daikin VRV 5 Tai Lieu Huong Dan Lap Dat Va Van Hanh 3
64 pages
BI Journal Manish 3
No ratings yet
BI Journal Manish 3
55 pages
Practical 1: Import The Legacy Data From Different Sources Such As (Excel, Sqlserver, Oracle Etc.) and Load in The Target System
No ratings yet
Practical 1: Import The Legacy Data From Different Sources Such As (Excel, Sqlserver, Oracle Etc.) and Load in The Target System
69 pages
Apush DBQ
100% (5)
Apush DBQ
2 pages
DSS ch2
No ratings yet
DSS ch2
112 pages
TYIT-BI-Lab-Manual AY 23-24-1
No ratings yet
TYIT-BI-Lab-Manual AY 23-24-1
129 pages
All Bi
No ratings yet
All Bi
17 pages
Be Lp-Vi (Bi Lab Manual)
No ratings yet
Be Lp-Vi (Bi Lab Manual)
57 pages
Ass 1
No ratings yet
Ass 1
31 pages
Phone: Email: Website:: CIN: L90001MH2001PLC130485
No ratings yet
Phone: Email: Website:: CIN: L90001MH2001PLC130485
41 pages
Testing PDF
No ratings yet
Testing PDF
17 pages
Bi Lab
No ratings yet
Bi Lab
47 pages
New BI Practical
No ratings yet
New BI Practical
67 pages
DWDM LAB Final Manualtest
No ratings yet
DWDM LAB Final Manualtest
134 pages
1 s2.0 S254252932200339X Main
No ratings yet
1 s2.0 S254252932200339X Main
33 pages
Incongruities: This Comes From A Difference Between What A Product/service Actually Is and What
No ratings yet
Incongruities: This Comes From A Difference Between What A Product/service Actually Is and What
2 pages
A216 DWM EXP 2b
No ratings yet
A216 DWM EXP 2b
33 pages
BI Manual (E-Next - In)
No ratings yet
BI Manual (E-Next - In)
66 pages
Cours BI
No ratings yet
Cours BI
30 pages
Structural Analysis of A Multi-Storeyed PDF
No ratings yet
Structural Analysis of A Multi-Storeyed PDF
5 pages
Work Psychology Understanding Human Behaviour in The Workplace 4th Edition Joanne Silvester Ebook All Chapters PDF
100% (2)
Work Psychology Understanding Human Behaviour in The Workplace 4th Edition Joanne Silvester Ebook All Chapters PDF
41 pages
8915 Bi Patil Aniket Shankar
No ratings yet
8915 Bi Patil Aniket Shankar
74 pages
Lab Manual
No ratings yet
Lab Manual
32 pages
Unit 1 - Lecture 1.2,3 - Data Science & Big Data
No ratings yet
Unit 1 - Lecture 1.2,3 - Data Science & Big Data
34 pages
Designing An ETL Solution
No ratings yet
Designing An ETL Solution
26 pages
01-Introduction To BI
No ratings yet
01-Introduction To BI
41 pages
Documento Actas JEOR
No ratings yet
Documento Actas JEOR
15 pages
Classé CD DVD 1 CD DVD Player SM
No ratings yet
Classé CD DVD 1 CD DVD Player SM
44 pages
Ccs341-Dw-Int I Key-Set Ii - Ar
No ratings yet
Ccs341-Dw-Int I Key-Set Ii - Ar
14 pages
University of Mumbai: Teacher's Reference Manual
No ratings yet
University of Mumbai: Teacher's Reference Manual
66 pages
Sem3 Unit1 DW
No ratings yet
Sem3 Unit1 DW
12 pages
SQL Server 2008 For Business Intelligence: UTS Short Course
No ratings yet
SQL Server 2008 For Business Intelligence: UTS Short Course
62 pages
Business Intelligence Overview
No ratings yet
Business Intelligence Overview
20 pages
Kabul University: Computer Science Faculty
No ratings yet
Kabul University: Computer Science Faculty
27 pages
BI Lab Manual
No ratings yet
BI Lab Manual
21 pages
Chapter 3 Data Warehouse & OLAP
No ratings yet
Chapter 3 Data Warehouse & OLAP
17 pages
Ey HK Tax Alert 1 Dec Issue 17
No ratings yet
Ey HK Tax Alert 1 Dec Issue 17
6 pages
LP VI Bi Lab Manual
No ratings yet
LP VI Bi Lab Manual
28 pages
ToneLab LE Manual
No ratings yet
ToneLab LE Manual
128 pages
DAA - Chapter 02
No ratings yet
DAA - Chapter 02
12 pages
OLAP (Online Analytical Processing) : Zalpa Rathod (39) Yatin Puthran (37) Mayuri Pawar (35) Mitesh Patil
No ratings yet
OLAP (Online Analytical Processing) : Zalpa Rathod (39) Yatin Puthran (37) Mayuri Pawar (35) Mitesh Patil
37 pages
GOLF Proposal
No ratings yet
GOLF Proposal
7 pages
ch5 MDX Summary
No ratings yet
ch5 MDX Summary
8 pages
Data Warehouse
No ratings yet
Data Warehouse
10 pages
The Concepts of Business Intelligence
No ratings yet
The Concepts of Business Intelligence
30 pages
Unit 2 QB
No ratings yet
Unit 2 QB
8 pages
2m Unit2
No ratings yet
2m Unit2
5 pages
Hoàng Nguyễn Duy Anh- 11230008
No ratings yet
Hoàng Nguyễn Duy Anh- 11230008
10 pages
Smoke Control System in High Rise Building
No ratings yet
Smoke Control System in High Rise Building
8 pages
Online Analytical Processing: OLAP (Or Online Analytical Processing) Has Been Growing in Popularity Due To The
No ratings yet
Online Analytical Processing: OLAP (Or Online Analytical Processing) Has Been Growing in Popularity Due To The
12 pages
Important Concepts in Big Data
No ratings yet
Important Concepts in Big Data
6 pages
LP-VI Handwritten Writeups
No ratings yet
LP-VI Handwritten Writeups
9 pages
MS Stack PowerBI Topics
No ratings yet
MS Stack PowerBI Topics
5 pages
Business Intelligence 101
No ratings yet
Business Intelligence 101
8 pages
Leave and License Agreement: LICENSOR" (Which Expression Shall Unless It Be Repugnant To The Context or Meaning
No ratings yet
Leave and License Agreement: LICENSOR" (Which Expression Shall Unless It Be Repugnant To The Context or Meaning
7 pages
Invoice
No ratings yet
Invoice
1 page
Marcos Araneta Vs CA Final
No ratings yet
Marcos Araneta Vs CA Final
5 pages
Bi Manual
No ratings yet
Bi Manual
66 pages
690196-Legal Change CTe Solution
No ratings yet
690196-Legal Change CTe Solution
2 pages
ETL Interview Questions
No ratings yet
ETL Interview Questions
18 pages
Teradata and ETL Testing
No ratings yet
Teradata and ETL Testing
17 pages
DLG G-1 Unit 4 Esp1
No ratings yet
DLG G-1 Unit 4 Esp1
9 pages
Election Worker Lawsuit Dismissal
No ratings yet
Election Worker Lawsuit Dismissal
12 pages
ETL Process: - 4 Major Components
No ratings yet
ETL Process: - 4 Major Components
27 pages
Big Data Analytics Notes
No ratings yet
Big Data Analytics Notes
9 pages
LabTech Software - Remote Monitoring & Management Blue Software Appin
No ratings yet
LabTech Software - Remote Monitoring & Management Blue Software Appin
2 pages
E Xtract T Ransform L OAD: MIS Systems (Acct, HR) Legacy Systems
No ratings yet
E Xtract T Ransform L OAD: MIS Systems (Acct, HR) Legacy Systems
30 pages
Shayna Parker Resume 2018
No ratings yet
Shayna Parker Resume 2018
2 pages
Business Intelligence & Business Performance Mgt.: อภิชาต ชมภูนุช Sunday, June 27, 2010
No ratings yet
Business Intelligence & Business Performance Mgt.: อภิชาต ชมภูนุช Sunday, June 27, 2010
50 pages
ELT Architecture and Implementation: Definitive Reference for Developers and Engineers
From Everand
ELT Architecture and Implementation: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Learn Data Warehousing in 24 Hours
From Everand
Learn Data Warehousing in 24 Hours
Alex Nordeen
No ratings yet
Learn SQL: Database Management Basics
From Everand
Learn SQL: Database Management Basics
Kiet Huynh
No ratings yet
THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE: "THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE"
From Everand
THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE: "THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE"
AJIT DASH
2/5 (2)
Oracle Information Integration, Migration, and Consolidation
From Everand
Oracle Information Integration, Migration, and Consolidation
Jason Williamson
No ratings yet

BI - Lab Manual

Uploaded by

BI - Lab Manual

Uploaded by

Title of the Assignment :

How to import legacy data step by step.

Title of the Assignment :-

Objective of the Assignment :

 How ETL works

 SQL or NoSQL servers

 The benefits and challenges of ETL

 What Is ETL Process?

Expedite Data Integration With

Title of the Assignment :-

Objective of the Assignment :

Limitations of OLAP cubes

 OLAP requires restructuring of data into a star/snowflake schema

 There is a limited number of dimensions (fields) a single OLAP cube

 It is nearly impossible to access transactional data in the OLAP cube

MOLAP stands for Multidimensional Online Analytical Processing. MOLAP uses a

Sisense and Elasticubes

Powered by ElastiCube, Sisense delivers distinct advantages over OLAP-based

 Instant query response times, without pre-calculation or pre-aggregation of data

In this way we import of the OLAP, MOLAP and HOLAP model.

Title of the Assignment :-

Objective of the Assignment :

Data into Excel, and Create a Data Model

Import data from a database

2. In Excel 2013, open a blank workbook.

5. The Import Data window appears.

Explore data using a PivotTable

Click anywhere in the PivotTable to ensure the Excel PivotTable is selected. In

a. In the PivotTable, click the dropdown to the right of Column Labels.

Your PivotTable looks like the following screen.

1. Insert a new Excel worksheet, and name it Sports.

7. Save the workbook.

Import data using copy and paste

1. Insert a new Excel worksheet, and name it Hosts.

In this way we import of data warehouse data in Microsoft Excel

Title of the Assignment :-

Perform the data classification using classification algorithm. Or Perform

Objective of the Assignment :

To introduce the concepts classification using classification algorithm. Or

K-Means Clustering Algorithm

How does the K-Means Algorithm Work?

KMeans Clustering for Classification

You might also like