Skip to main content
Pentaho Documentation

Select Data Integration (DI) Installation Options

There are several methods to install Pentaho Data Integration (DI). This article helps you make decisions about which method is best for you. If you want to upgrade your current version of the DI components to the newest version, see Upgrade BA and DI Systems.

Prerequisites

There are no prerequisites for this section.

No special expertise is required to understand the concepts presented in this article.

About DI Components

This table lists the components and plugins that come with DI.

Table 1. DI Components Table
Component Type

Name

Description

Server Data Integration Server The DI Server is a core component that executes data integration jobs and transformations using the Pentaho Data Integration Engine. It allows you to manage users and roles (default security) or integrate security to your existing security provider such as LDAP or Active Directory. The DI Server provides the ability to centrally store and manage your data integration jobs and transformations. It also provides the services allowing you to schedule and monitor scheduled activities, which are accessed from Spoon.
Design Tool Spoon Graphical design environment for modeling Jobs and Transformations. Transformations are metadata models describing the flow of data. Jobs are workflow-like models for coordinating resources, execution and dependencies of data integration activities. It also provides a user interface to the DI Server for managing security and schedules.
Design Tool Kitchen Command line tool for executing jobs modeled in Spoon. You can use Kitchen to run jobs at a specific time. For instance, you can use Kitchen to write a cron job that will generate a daily sales report at 3:00 a.m. each morning.
Design Tool Pan Command line tool for executing transformations modeled in Spoon. This is similar to Kitchen, but where Kitchen executes jobs, Pan executes transformations.
Design Tool Carte Carte is a web server that enables remote execution of jobs and transformations. 
Plugin Agile BI Provides a set of tools that allows developers to collaborate on BI project prototyping.  It provides Modeling and Visualization perspectives.

Additionally there are plugins that are developed by the community and therefore, are not documented here.

Decide Where to Install Components

If you are evaluating DI components, we strongly recommend that you install all components on one computer. This enables you to take full advantage of the tutorials and samples we provide to help you evaluate and learn about our products and tools. If you plan to use DI components for development or production, consider distributing components across server and workstation computers. One popular configuration is to install the server on one computer and the design tool on client workstations. DI Plugins that are needed at design and runtime should be installed on the server and on client workstations.

Select Installation Method

To determine the best installation method for you, answer several questions.

  • Why do you want to install DI components? Do you want to evaluate it or develop solutions? Do you want to run it in a production environment?

  • Do you want to house the Pentaho Repository on the PostgreSQL database that Pentaho supplies? Or would you rather house the Pentaho Repository on another database that you supply such as Oracle or MySQL? The Pentaho Repository stores information related to transformations, jobs, and security. The Pentaho Repository also stores information related to transformations and jobs, such as database connection information and schedules. PDI is configured so that the Pentaho Repository runs on PostgreSQL. The database that houses the Pentaho Repository is different than the database that houses your data.

As you consider these questions, review the Installation Decision Table to determine the best installation method for you.

Table 2. DI Component Installation Decision Table
Explore Considerations Choose the Best Option For You
Install All DI Components Install with Your Own Pentaho Repository Use DI Manual Installation Install Only DI Tools and Plugins
Summary

We recommend using this installation method for evaluation purposes only.

This method installs all DI components. It uses an installation wizard and is the quickest and easiest installation option, and includes everything you need to run Pentaho DI. 

This method installs all DI components. This manual method requires that you download and install Pentaho installation files. You can choose the Pentaho Repository database that you want to use, but you download and install it yourself. Consider using this method if your company has a standard database platform with which you must comply. This method installs all DI components. The manual method requires that you download and install Pentaho installation files. You can choose your web application server and Pentaho Repository database, but you must download and install them yourself. Consider using this method if your company has a standard database platform or web application server with which you must comply. This method installs the DI Design Tools and Plugins only. Use this method if you want to install the design tools only, not the DI Server. This method is often used to install design tools and plugins on client workstations.
Installed Components This method installs these things.
  • Servers: DI Server
  • DI Design Tools, Plugins, and Utilities: Spoon, Kitchen, Pan, Cart
This method installs these things.
  • Servers: DI Server
  • DI Design Tools, Plugins, and Utilities: Spoon, Kitchen, Pan, Cart
This method installs these things.
  • Server: DI Server.
  • DI Design Tools, Plugins, and Utilities: Spoon, Kitchen, Pan, Cart
This method installs these things.
  • DI Design Tools, Plugins, and Utilities: Spoon, Kitchen, Pan, Cart
You Supply You need to supply a computer that meets Pentaho's operating system and hardware requirements. You need to supply these things. Each item must meet or exceed the requirements in the supported technologies matrix.
  • Computer with the supported operating system and hardware configuration.
  • Oracle Java Runtime Environment (JRE) or Oracle Java Development Kit (JDK).
  • Pentaho Repository Database (PostgreSQL, MySQL, MS SQL Server, or Oracle).
  • Pentaho Repository Database JDBC Drivers (PosgreSQL, MySQL, MS SQL Server,  or Oracle).
You need to supply these things. Each item must meet or exceed the requirements in the supported technologies matrix.
  • Computer with a supported operating system and hardware configuration.
  • Oracle Java Runtime Environment (JRE) or Oracle Java Development Kit (JDK)
  • Repository Database (PostgreSQL, MySQL, MS SQL Server or Oracle) You can also use an preexisting Pentaho Repository platform if it is compatible with the supported technologies matrix.
  • Pentaho Repository Database JDBC Drivers (PostgreSQL, MySQL, MS SQL Server, or Oracle)
  • Web Application Server (JBoss or Tomcat). You can also use an preexisting application server if it is compatible with the supported technologies matrix.
You need to supply a computer that meets Pentaho's operating system and hardware requirements.
We Supply We supply these things.
  • Installation Package
  • Oracle JRE
  • Repository Platform (PostgreSQL)
  • Repository Platform JDBC Driver
  • Web Application Server (Tomcat)
We supply these things.
  • Installation Package
  • Web Application Server (Tomcat)
We supply the Installation Package. We supply the Installation Package.
Technologies Used These technologies are used in this installation method.
  • Tomcat
  • PostgreSQL
These technologies are used in this installation method.
  • Tomcat
  • PostgreSQL, MySQL, MS SQL Server, or Oracle
These technologies are used in this installation method.
  • Tomcat or JBoss
  • PostgreSQL, MySQL, MS SQL Server, or Oracle
N/A
Expertise You need this knowledge to perform this type of installation.
  • Basic computer knowledge.
You need this knowledge to perform this type of installation.
  • Knowledge of your networking environment, including database port numbers if they differ from the default and IP addresses.
  • Permission to access installation directories.
  • Root or Administrative access.
You need this knowledge to perform this type of installation.
  • Knowledge of your networking environment, including database port numbers if they differ from the default and IP addresses.
  • Permission to access installation directories.
  • Root or Administrative access.
N/A
Time Approximately 30 minutes. Approximately 60 - 90 minutes. Approximately 2 - 3 hours. Approximately 5 - 10 minutes.
Availability Download this software from the Pentaho website or the support portal. Download this software from the Support portal. Download this software from the Pentaho Support Portal. Download this software from the Support portal.
Recommendation Use for the Pentaho Trial Download or if you want to Evaluate DI. Suitable for Rapid Development and Production use. Suitable for Rapid Development and Production use. Suitable for Rapid Development and Production use.

Next Steps

Now that you are familiar with the different installation methods, you are ready to install the DI components.

If you need installation help, contact Pentaho support.