Installation of Pentaho Data Integration and Database Driver For Module 5
Installation of Pentaho Data Integration and Database Driver For Module 5
To install Pentaho, you should follow the steps below. It is highly recommended that you use
the community edition from SourceForge as the instructions in this document follow the
community edition interface.
The latest stable version is 5.0.1 although the latest version on SourceForge is 5.4. You
should be able to use either version to complete the tutorial and assignment although the
guided tutorial is written for the latest stable version 5.0.1.
Windows uses should copy the folder data-integration to the folder C:\Program
Files\Pentaho. Mac and Linux users (including Oracle Virtual Box) may move the file to any
folder. Note that the Oracle Virtual Box is a Linux environment.
If you are using the Oracle Database Virtual Box Appliance, you should download and
unzip the PDI zip file inside the Virtual Box. If you download the PDI zip file in Windows, you
may have difficulty making a connection to an Oracle database in the module 5 assignment.
To ensure that the installation worked, you should launch Pentaho Data Integration.
Run the file Spoon.bat by double clicking on it. You may want to create a shortcut to the
spoon.bat file so starting data integration is easier. If you get a permission error or cannot
execute the bat file, you should right click and select “Run as Administrator”. For Mac and
Linux users, run the Spoon.sh from terminal (./spoon.sh). Note that the Oracle Virtual Box is
a Linux environment.
After you launch Pentaho Data Integration, you will see the Welcome window (Figure 1)
and then the Spoon designer (Figure 2).
Exit Spoon before installing the database driver file in the next part of the instructions.
22 September 2021 Installation of Pentaho Data Integration Page 2
You need to install the JDBC driver for the specific version of the DBMS that you previously
installed.
For Oracle 12C, the JDBC driver is “ojdbc7.jar”. You can download it from the following page.
You can also find JDBC files for other Oracle server versions on this page.
http://www.oracle.com/technetwork/database/features/jdbc/index-091264.html
http://dev.mysql.com/downloads/connector/j
22 September 2021 Installation of Pentaho Data Integration Page 4
The JDBC for MySQL comes in two formats: .zip and .msi. Each format has a different
extraction method. Mac and Linux users (including Oracle Virtual Box) cannot use the
.msi format.
For the .zip file, the unzipped folder has many files. You only need to copy the mysql-
connector-java-5.1.36-bin.jar file.
For the .msi file, you should double click to extract it to the folder C:\Program
Files(x86)\MySQL\MySQL Connector J. In this folder, you will find many files and the
only needed one is mysql-connector-java-5.1.36-bin.jar.
For Windows users, after downloading the JDBC file, you should copy the file to the following
folder:
C:\Program Files\Pentaho\data-integration\lib
For Mac and Linux users (including Oracle Virtual Box), you should copy the driver to the data-
integration/lib directory.
After copying the JDBC file to the specified folder, you should restart Pentaho Data Integration.