0% found this document useful (0 votes)

142 views18 pages

Building Data Pipeline With Pentaho Lab Guide

This document provides a guided demonstration of using Pentaho to build a data pipeline from data ingestion to analytics visualization. It reviews a Pentaho transformation that obtains energy generation and usage data from an Excel file, prepares the data by building a data model, and publishes the data as a service. It then reviews a Pentaho job that runs the transformation and publishes the data model. Finally, it demonstrates analyzing and visualizing the data in Pentaho Analyzer. The goal is to describe how to use Pentaho transformations and jobs to ingest, transform, model, and publish data for analytics purposes.

Uploaded by

Deni Diana

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

142 views18 pages

Building Data Pipeline With Pentaho Lab Guide

Uploaded by

Deni Diana

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Hitachi NEXT 2018

Building a Data Pipeline With Pentaho –

From Ingest to Analytics

Contents
Page 2: Guided Demonstration: Data Source to Dashboard
Page 3: Review the InputData Transformation
Page 11: Review and Run the CT2000 Job
Page 14: Create an Analysis Using the RenewableEnergy Model
Page 16: View the CT2000 Dashboard
HITACHI IS A TRADEMARK OR REGISTERED TRADEMARK OF
PageHITACHI,
17: Resources
LTD.
1
Guided Demonstration: Data Source to Dashboard
Introduction In this guided demonstration, you will review a Pentaho Data Integration (PDI)
transformation that obtains data about energy generation and usage around the
world, prepares the data for analytics by building a data model (cube), and
publishes the data to the repository as a data service. You will then review a PDI
job that runs the transformation and publishes the cube to the repository so it
can be used for analytics. Finally, you will use Analyzer to analyze and visualize
the data.

Objectives After completing this guided demonstration, you will be able to:

• Describe the purpose of a Transformation and the following

transformation steps:
- Microsoft Excel Input
- Select Values
- Modified Java Script Value
- Filter Rows
- Sort Rows
- Row Denormaliser
- Annotate Stream
• Create a Pentaho Data Service from a transformation step
• Describe the purpose of a Job and the following job entries:
- Start
- Transformation
- Build Model
- Publish Model
• Use Pentaho Analyzer to analyzer and visualize data

Note The transformation and job reviewed in this demonstration use a sampling of
PDI steps and job entries. The steps and job entries used in production vary
depending on the incoming data and the business objectives.

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 2

Review the InputData Transformation

Start Pentaho Data Integration (Spoon) and Connect to the Repository

1. On the desktop, double-click the Data Integration icon.

2. To connect to the repository, at the far right of the toolbar, click Connect, and then click
Pentaho Repository.
3. Enter the User Name as admin, and the Password as password, and then click Connect.

Open the InputData Transformation

Transformations are used to describe the data flows for Extract, Transform, and Load (ETL) processes,
such as reading from a source, transforming data, and loading it into a target location. Each “step” in a
transformation applies specific logic to the data flowing through the transformation. The steps are
connected with “hops” that define the pathways the data follow through the transformation. The data
flowing through the transformation is referred to as the “stream.”

The InputData transformation receives data from a Microsoft Excel file containing data about energy
generation and usage around the world. It then fine tunes the data, creates a data model (OLAP cube),
and publishes the data to the repository as a Pentaho Data Service.

To open the InputData transformation:

1. From the menu, select File, and then click Open.
2. Navigate to the Public>CT2000>files>KTR folder.
3. Double-click InputDataTransformation.

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 3

Review the Microsoft Excel Input Step

The Microsoft Excel Input step provides the ability to read data from one or more Excel and Open Office
files. In this example, the Excel file contains data about energy generation and usage by country for the
years 2000-2015.

To review the Microsoft Excel Input step:

1. Double-click the Input Data xls step, and then review the configuration of the Files tab.

2. Click the Fields tab, and then review the configuration.

3. To preview the data, click Preview Rows, and then click OK.

4. To close the preview, click Close, and then to close the step dialog, click OK.

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 4

Review the Select Values Step

The Select Values step is useful for selecting, removing, renaming, changing data types and configuring
the length and precision of the fields in the stream. In this example, the fields are reordered, and the
Technology field is replicated four times to create the Tech1, Tech2, Tech3, and Tech4 fields. You will
see the purpose of those fields later in this demonstration.

To review the Select Values step:

1. Double-click the Defines fields step, and then review the configuration.

2. To close the step dialog, click OK.

Review the Modified Java Script Value Step

The Modified Java Script Value step provides an expression based user interface for building JavaScript
expressions. This step also allows you to create multiple scripts for each step. The Technology field from
the spreadsheet contains the specific type of energy (for example, Renewable Municipal Waste). Since
the specific energy sources can be categorized into higher levels, the expressions in this step assign the
energy source to various categories to create a hierarchy that will be used in the OLAP cube.

For example, the Technology “Renewable Municipal Waste” gets turned into the following four fields:
Tech1: Total Renewable Energy
Tech2: Bioenergy
Tech3: Solid Biofuels
Tech4: Renewable Municipal Waste

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 5

To review the Modified Java Script Value step:
1. Double-click the Builds tech hierarchy step.
2. Click the Item_0 tab, and then review the script.

3. Click the Script 1 tab, and then review the script.

4. To close the step dialog, click OK.

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 6

Review the Filter Rows Step

The Filter Rows step filters rows based on conditions and comparisons. The rows are then directed
based on whether the filter evaluates to ‘true’ or ‘false.’ In this example, the previous JavaScript step
results in some redundant data, so those rows are filtered out of the stream.

To review the Filter Rows step:

1. Double-click the Filters out redundancy step, and then review the configuration.

2. To close the step dialog, click OK.

Review the Sort Rows Step

The Sort rows step sorts rows based on the fields you specify and on whether they should be sorted in
ascending or descending order.

To review the Sort Rows step:

1. Double-click the Sort rows step, and then review the configuration.

2. To close the step dialog, click OK.

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 7

Review the Row Denormaliser Step

The Row Denormaliser step allows you denormalize data by looking up key-value pairs. It also allows you
to immediately convert data types. In this example, the Indicator field is used denormalize the rows and
create two additional fields: Total Generated GWh and Total Capacity MW.

To review the Row Denormaliser step:

1. Double-click the Denormalises Indicator step, and then review the configuration.

2. To close the step dialog, click OK, and then click Close.

Review the Second Filter Rows Step

The second Filter Rows step removes rows with Total Capacity MW of zero.

To review the Filter Rows step:

1. Double-click the Remove Capacity = 0 step, and then review the configuration.

2. To close the step dialog, click OK.

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 8

Review the Annotate Stream Step

The Annotate Stream step helps you refine your data for the Streamlined Data Refinery by creating
measures, link dimensions, or attributes on stream field(s) which you specify. In this example, the Total
Generated GWh and Total Capacity MW are defined as measures, and the remaining fields are defined
as dimensions within hierarchies for the location and the technologies. The Annotate Stream modifies
the default model produced from the Build Model job entry. You will review the Build Model job entry
later in this demonstration.

To review the Annotate Stream step:

1. Double-click the Sets measures and hierarchies step, and then review the configuration.

2. To close the step dialog, click OK.

Review the Output Step

Prototyping a data model can be time consuming, particularly when it involves setting up databases,
creating the data model and setting up a data warehouse, then negotiating accesses so that analysts can
visualize the data and provide feedback. One way to streamline this process is to make the output of a
transformation step a Pentaho Data Service. The output of the transformation step is exposed by the
data service so that the output data can be queried as if it were stored in a physical table, even though
the results of the transformation are not stored in a physical database. Instead, results are published to
the Pentaho Server as a virtual table. The results of this transformation are being used to create a
Pentaho Data Service called DataServiceCT2000.

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 9

To review the Data Service:
1. Right-click the OUTPUT step, then click Data Services, and then click Edit.

2. To close the Data Service dialog, click OK.

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 10

Review and Run the CT2000 Job

Open the CT2000 Job

Jobs are used to coordinate ETL activities such as defining the flow and dependencies for what order
transformations should be run, or prepare for execution by checking various conditions such as
ensuring a source file is available.

The CT2000 job executes the InputDataTransformation, builds the data model (cube) based on the
Annotate Stream step, and then publishes the model to the repository. After the job runs, the data
service and model are available for reporting, analysis, and dashboarding.

To open the CT2000 job:

1. From the Menu, select File, and then click Open.

2. Double-click CT2000JOB.

Review the Build Model Job Entry

The Build Model job entry creates Data Source Wizard (DSW) data models. In this example, the
RenewableEnergy model is created from the DataServiceCT2000 data service based on the annotations
defined in the Annotate Stream step.

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 11

To review the Build Model job entry:
1. Double-click the Build Model job entry.

2. To close the job entry dialog, click OK.

Review the Publish Model Job Entry

The Publish Model job entry allows you to publish the data model created with the Build Model job
entry so it is available for use on the Pentaho Server.

To review the Publish Model job entry:

1. Double-click the Publish Model job entry.

2. To close the job entry dialog, click OK.

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 12

Run the CT2000 Job

To run the CT2000 job:

1. On the sub-toolbar, click the Run button.
2. Verify the Run Options, and then click Run.

Notice the green checkmarks indicating that each job entry successfully completed.

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 13

Create an Analysis Using the RenewableEnergy Model

Start the Pentaho User Console

1. On the desktop, double-click the User Console Login icon.

2. In the User Name field, type admin, then in the Password field, type password, and then click
Login.

Create an Analysis Using the RenewableEnergy Model

To create a new analysis:

1. From the Home Perspective, click Create New>Analysis Report.
2. In the Select Data Source window, click Renewable Energy:Renewable Energy, and then click
OK.
3. Review the RenewableEnergy model/cube.

4. To add Total Generated (GWh) to the Measures, double-click Total Generated (GWh).
5. To add Continent to the Rows, double-click Continent.

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 14

6. To add Tech2 to the Columns, select Tech2 and drag it to the Columns drop zone on the Layout
panel.
7. To drill down to the Tech3 level for Bioenergy, double-click the Bioenergy column header.
8. To drill down to the Tech4 level for Solid biofuels, double-click the Solid biofuels column
header.
9. To keep only the Renewable municipal waste data, right-click the Renewable municipal waste
column header, and then click Keep Only Renewable municipal waste.
10. To drill down to the Country level for Europe, double-click the Europe row header.
11. To view the analysis as a chart, on the toolbar, click the Choose chart type icon, and then click
Column.

12. To return to the table, on the toolbar, click the Switch to table format icon.
13. To close the analysis, on the Analysis Report tab, click the X, and then click Yes. (It is not
necessary to save this analysis.)

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 15

View the CT2000 Dashboard

The CT2000 dashboard was created with the CTools using the RenewableEnergy data model and the
DataServiceCT2000 data service to provide an interactive dashboard that allows users to explore the
data from various perspectives.
To view the CT2000 dashboard:
1. From the Home Perspective, click Browse Files.
2. In the Folders panel, navigate to the Public>CT2000>dashboards folder.
3. In the Files panel, double-click the CDE sample file.

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 16

Resources

Hitachi Vantara Web Site

https://www.hitachivantara.com

Innovate with Data and Analytics

https://www.hitachivantara.com/en-us/solutions/data-analytics.html

Pentaho Data Integration

https://www.hitachivantara.com/en-us/products/big-data-integration-analytics/pentaho-data-
integration.html

Pentaho Business Analytics

https://www.hitachivantara.com/en-us/products/big-data-integration-analytics/pentaho-business-
analytics.html

Training

https://www.hitachivantara.com/en-us/services/training-certification/training/pentaho.html

Pentaho Data Integration

DI1000: Pentaho Data Integration Fundamentals

DI1500: Pentaho Data Integration Advanced

Pentaho Business Analytics

BA1000: Business Analytics User Console

BA2000: Business Analytics Report Designer

BA3000: Business Analytics Data Modeling

CTools

CT1000: CTools Fundamentals

CT1500: CTools Advanced

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 17

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 18

Bar Bending Schedule Template Doc34
No ratings yet
Bar Bending Schedule Template Doc34
4 pages
Components of Android
No ratings yet
Components of Android
36 pages
TALEND ESB 6.0 Cours 1444874212 - 00 - Course - LessonTOC - 13 Files Merged
No ratings yet
TALEND ESB 6.0 Cours 1444874212 - 00 - Course - LessonTOC - 13 Files Merged
203 pages
Using DDE To Read - Write Data From Excel To A Controller
100% (1)
Using DDE To Read - Write Data From Excel To A Controller
2 pages
De Mod 5 Deploy Workloads With Databricks Workflows
No ratings yet
De Mod 5 Deploy Workloads With Databricks Workflows
19 pages
Unofficial TIBCO® Business Works™ Interview Questions, Answers, and Explanations: TIBCO Certification Review Questions
From Everand
Unofficial TIBCO® Business Works™ Interview Questions, Answers, and Explanations: TIBCO Certification Review Questions
equitypress
3.5/5 (2)
IBM I DB2 Web Query For I Version 2.1 Implementation Guide
100% (2)
IBM I DB2 Web Query For I Version 2.1 Implementation Guide
880 pages
Heroku Cloud Application Development
From Everand
Heroku Cloud Application Development
Anubhav Hanjura
No ratings yet
Perform Audit Testing in Excel
No ratings yet
Perform Audit Testing in Excel
16 pages
FMEA 4th Edition Blank P and D FMEA Sheets in Excel
No ratings yet
FMEA 4th Edition Blank P and D FMEA Sheets in Excel
11 pages
Pentaho Data Integration
No ratings yet
Pentaho Data Integration
99 pages
Microsoft Certified: Power BI Data Analyst Associate PL 300 Practice Tests
From Everand
Microsoft Certified: Power BI Data Analyst Associate PL 300 Practice Tests
CertSquad Professional Trainers
No ratings yet
Pentaho Kettle Pdi Eng
No ratings yet
Pentaho Kettle Pdi Eng
17 pages
Data Mining Lab Notes
0% (1)
Data Mining Lab Notes
93 pages
Percona Monitoring and Management Documentation: Date .Getfullyear )
No ratings yet
Percona Monitoring and Management Documentation: Date .Getfullyear )
589 pages
Sqlalchemy 0 7 3
No ratings yet
Sqlalchemy 0 7 3
540 pages
Web Scraping
No ratings yet
Web Scraping
35 pages
Big Data Government Use Case Gartner
No ratings yet
Big Data Government Use Case Gartner
40 pages
ETL
No ratings yet
ETL
50 pages
Mastering JBoss Drools 6 - Sample Chapter
No ratings yet
Mastering JBoss Drools 6 - Sample Chapter
26 pages
Git Commands: Getting & Creating Projects
No ratings yet
Git Commands: Getting & Creating Projects
3 pages
820210125174156
No ratings yet
820210125174156
16 pages
Guided Tutorial For Pentaho Data Integration Using Oracle
No ratings yet
Guided Tutorial For Pentaho Data Integration Using Oracle
41 pages
Forecasting MySQL Performance and Scalability
100% (1)
Forecasting MySQL Performance and Scalability
41 pages
Pig Slides
No ratings yet
Pig Slides
46 pages
Flask Restplus
No ratings yet
Flask Restplus
86 pages
Datapipeline DG
No ratings yet
Datapipeline DG
337 pages
SSIS Logging Implementation
No ratings yet
SSIS Logging Implementation
15 pages
DataScience With Python Course Content Syllabus Meritude
No ratings yet
DataScience With Python Course Content Syllabus Meritude
10 pages
Linux and H/W Optimizations For MySQL
100% (2)
Linux and H/W Optimizations For MySQL
160 pages
Harsh Kumar - Resume
No ratings yet
Harsh Kumar - Resume
2 pages
Transformer All Functions
100% (1)
Transformer All Functions
47 pages
Data Versioning For Graph Databases
No ratings yet
Data Versioning For Graph Databases
71 pages
Andriod Employee Tracker
0% (1)
Andriod Employee Tracker
16 pages
DB Campus Drive Preparation Materials Geeks4Geeks
No ratings yet
DB Campus Drive Preparation Materials Geeks4Geeks
14 pages
The Node - Js Developer Roadmap For 2021
No ratings yet
The Node - Js Developer Roadmap For 2021
6 pages
Akka PDF
No ratings yet
Akka PDF
454 pages
SQL Tutorial Cheat Sheet
100% (1)
SQL Tutorial Cheat Sheet
18 pages
Apache Hue-Cloudera
No ratings yet
Apache Hue-Cloudera
63 pages
Extract Transform Load
No ratings yet
Extract Transform Load
80 pages
IBM MDM 11.6 Installation: Topology, Software Bundles, Prerequisites, Steps and Issues
No ratings yet
IBM MDM 11.6 Installation: Topology, Software Bundles, Prerequisites, Steps and Issues
5 pages
Resume 3
No ratings yet
Resume 3
4 pages
Analysis Guide
No ratings yet
Analysis Guide
81 pages
Dataserver For Oracle Guide
No ratings yet
Dataserver For Oracle Guide
284 pages
Pentaho Creating Solutions 1.1.4
No ratings yet
Pentaho Creating Solutions 1.1.4
60 pages
Unstructured Dataload Into Hive Database Through PySpark
No ratings yet
Unstructured Dataload Into Hive Database Through PySpark
9 pages
Pig
No ratings yet
Pig
16 pages
Cloud
No ratings yet
Cloud
25 pages
Informatica Power Center Best Practices
No ratings yet
Informatica Power Center Best Practices
8 pages
DS Interview Question Ineuron
100% (1)
DS Interview Question Ineuron
208 pages
Install Sqoop
No ratings yet
Install Sqoop
7 pages
Acceleo User Guide
No ratings yet
Acceleo User Guide
56 pages
Talend Data Integration Basics
No ratings yet
Talend Data Integration Basics
3 pages
The Heroku Hackers Guide
No ratings yet
The Heroku Hackers Guide
61 pages
Final Project Documentation
No ratings yet
Final Project Documentation
53 pages
Oops Through Java Lab Manual - R22
No ratings yet
Oops Through Java Lab Manual - R22
77 pages
Hortonworks Data Platform: Apache Hive Performance Tuning
No ratings yet
Hortonworks Data Platform: Apache Hive Performance Tuning
48 pages
Resume - SujanGyawali - Full Stack Python Developer
No ratings yet
Resume - SujanGyawali - Full Stack Python Developer
3 pages
Cloud Computing and DevOps Internship Trainings With Pay After Placement and Job Assistance
No ratings yet
Cloud Computing and DevOps Internship Trainings With Pay After Placement and Job Assistance
22 pages
Datastage Admin
No ratings yet
Datastage Admin
161 pages
Krishna Resume
No ratings yet
Krishna Resume
2 pages
ColdFusion Interview Questions, Answers, and Explanations: ColdFusion Certification Review
From Everand
ColdFusion Interview Questions, Answers, and Explanations: ColdFusion Certification Review
equitypress
No ratings yet
Instant Redis Optimization How-to
From Everand
Instant Redis Optimization How-to
Arun Chinnachamy
No ratings yet
SQLite Complete Self-Assessment Guide
From Everand
SQLite Complete Self-Assessment Guide
Gerardus Blokdyk
No ratings yet
How Tax Is Leveraging AI - Including Machine Learning - in 2019
No ratings yet
How Tax Is Leveraging AI - Including Machine Learning - in 2019
11 pages
Artificial Intelligence and Machine Learning Final1
No ratings yet
Artificial Intelligence and Machine Learning Final1
19 pages
12 Pre Employment Testing Hacks That 99 of Recruiters Arent Using
No ratings yet
12 Pre Employment Testing Hacks That 99 of Recruiters Arent Using
8 pages
Analisis Data COVID-19 Mingguan Satuan Tugas Per 18 Oktober 2020 Vfinal
No ratings yet
Analisis Data COVID-19 Mingguan Satuan Tugas Per 18 Oktober 2020 Vfinal
105 pages
Academic Test 1 PDF
0% (1)
Academic Test 1 PDF
13 pages
El Sawy2010
No ratings yet
El Sawy2010
21 pages
MB-500 Microsoft Exam Updated Dumps
No ratings yet
MB-500 Microsoft Exam Updated Dumps
27 pages
2023 OL ICT 596 Mark Guides
No ratings yet
2023 OL ICT 596 Mark Guides
9 pages
LIMN Introduction July09
No ratings yet
LIMN Introduction July09
9 pages
Pwu-Cdcec Calamba: The Problem and It's Background
No ratings yet
Pwu-Cdcec Calamba: The Problem and It's Background
19 pages
10.0-Overview Presentation SAP Business One Version For SAP HANA-1
No ratings yet
10.0-Overview Presentation SAP Business One Version For SAP HANA-1
34 pages
Tri Plot - v1 4 2
100% (1)
Tri Plot - v1 4 2
82 pages
Office Tab - Tabbed Browsing, Editing and Managing For Microsoft® Office 2003, 2007 and 2010
No ratings yet
Office Tab - Tabbed Browsing, Editing and Managing For Microsoft® Office 2003, 2007 and 2010
17 pages
BBA (CA) Sem1
No ratings yet
BBA (CA) Sem1
21 pages
Ca Practical - Sem-Ii PDF
No ratings yet
Ca Practical - Sem-Ii PDF
24 pages
Cog 612 PDF
No ratings yet
Cog 612 PDF
20 pages
MS Office
0% (1)
MS Office
47 pages
ITC Course Outline - Spring 2021
No ratings yet
ITC Course Outline - Spring 2021
3 pages
IFD5 Manual - Issue 5
No ratings yet
IFD5 Manual - Issue 5
30 pages
MS Excel Explained PDF
No ratings yet
MS Excel Explained PDF
18 pages
Important Features of Ms-Word
No ratings yet
Important Features of Ms-Word
4 pages
Advanced Spreadsheet Skills
No ratings yet
Advanced Spreadsheet Skills
21 pages
02 - SS-Activities - Application of Function Library
No ratings yet
02 - SS-Activities - Application of Function Library
4 pages
Ariba
33% (3)
Ariba
356 pages
To Freeze Rows:: Compare Information in Your Workbook. Fortunately, Excel
No ratings yet
To Freeze Rows:: Compare Information in Your Workbook. Fortunately, Excel
5 pages
1 DiaSys 2.50
No ratings yet
1 DiaSys 2.50
114 pages
Audit Manual-3
No ratings yet
Audit Manual-3
33 pages
Rciub122 14
No ratings yet
Rciub122 14
20 pages
Excel Basics
No ratings yet
Excel Basics
17 pages
BCA Ist-Year AC 15 14.08.2020
No ratings yet
BCA Ist-Year AC 15 14.08.2020
30 pages
MacWorld 8903 March 1989-SE30
No ratings yet
MacWorld 8903 March 1989-SE30
312 pages

Building Data Pipeline With Pentaho Lab Guide

Uploaded by

Building Data Pipeline With Pentaho Lab Guide

Uploaded by

Hitachi NEXT 2018

Building a Data Pipeline With Pentaho –

• Describe the purpose of a Transformation and the following

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 2

Start Pentaho Data Integration (Spoon) and Connect to the Repository

1. On the desktop, double-click the Data Integration icon.

Open the InputData Transformation

To open the InputData transformation:

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 3

To review the Microsoft Excel Input step:

2. Click the Fields tab, and then review the configuration.

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 4

To review the Select Values step:

2. To close the step dialog, click OK.

Review the Modified Java Script Value Step

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 5

3. Click the Script 1 tab, and then review the script.

4. To close the step dialog, click OK.

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 6

To review the Filter Rows step:

2. To close the step dialog, click OK.

Review the Sort Rows Step

To review the Sort Rows step:

2. To close the step dialog, click OK.

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 7

To review the Row Denormaliser step:

Review the Second Filter Rows Step

To review the Filter Rows step:

2. To close the step dialog, click OK.

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 8

To review the Annotate Stream step:

2. To close the step dialog, click OK.

Review the Output Step

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 9

2. To close the Data Service dialog, click OK.

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 10

Open the CT2000 Job

To open the CT2000 job:

1. From the Menu, select File, and then click Open.

Review the Build Model Job Entry

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 11

2. To close the job entry dialog, click OK.

Review the Publish Model Job Entry

To review the Publish Model job entry:

2. To close the job entry dialog, click OK.

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 12

To run the CT2000 job:

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 13

Start the Pentaho User Console

1. On the desktop, double-click the User Console Login icon.

Create an Analysis Using the RenewableEnergy Model

To create a new analysis:

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 14

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 15

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 16

Hitachi Vantara Web Site

Innovate with Data and Analytics

Pentaho Data Integration

Pentaho Business Analytics

Pentaho Data Integration

DI1000: Pentaho Data Integration Fundamentals

DI1500: Pentaho Data Integration Advanced

Pentaho Business Analytics

BA1000: Business Analytics User Console

BA2000: Business Analytics Report Designer

BA3000: Business Analytics Data Modeling

CT1000: CTools Fundamentals

CT1500: CTools Advanced

HITACHI is a trademark or registered trademark of Hitachi, Ltd. 17

You might also like