Overview of the different ways to install Pentaho Data Integration
There are several methods to install Pentaho Data Integration (DI). This article helps you make decisions about which method is best for you. If you want to upgrade your current version of the DI components to the newest version, see Upgrade BA and DI Systems.
There are no prerequisites for this section.
No special expertise is required to understand the concepts presented in this article.
About DI Components
This table lists the components and plugins that come with DI.
|Component Type|| |
|Server||Data Integration Server||The DI Server is a core component that executes data integration jobs and transformations using the Pentaho Data Integration Engine. It allows you to manage users and roles (default security) or integrate security to your existing security provider such as LDAP or Active Directory. The DI Server provides the ability to centrally store and manage your data integration jobs and transformations. It also provides the services allowing you to schedule and monitor scheduled activities, which are accessed from Spoon.|
|Design Tool||Spoon||Graphical design environment for modeling Jobs and Transformations. Transformations are metadata models describing the flow of data. Jobs are workflow-like models for coordinating resources, execution and dependencies of data integration activities. It also provides a user interface to the DI Server for managing security and schedules.|
|Design Tool||Kitchen||Command line tool for executing jobs modeled in Spoon. You can use Kitchen to run jobs at a specific time. For instance, you can use Kitchen to write a cron job that will generate a daily sales report at 3:00 a.m. each morning.|
|Design Tool||Pan||Command line tool for executing transformations modeled in Spoon. This is similar to Kitchen, but where Kitchen executes jobs, Pan executes transformations.|
|Design Tool||Carte||Carte is a web server that enables remote execution of jobs and transformations.|
|Plugin||Instaview||Instaview is software that allows you to use templates to manage the complexities of data access and preparation. Instaview automatically generates transform and metadata models, executes them, and allows you to visualize the results.|
|Plugin||Agile BI||Provides a set of tools that allows developers to collaborate on BI project prototyping. It provides Modeling and Visualization perspectives.|
Additionally there are plugins that are developed by the community and therefore, are not documented here.
Decide Where to Install Components
If you are evaluating DI components, we strongly recommend that you install all components on one computer. This enables you to take full advantage of the tutorials and samples we provide to help you evaluate and learn about our products and tools. If you plan to use DI components for development or production, consider distributing components across server and workstation computers. One popular configuration is to install the server on one computer and the design tool on client workstations. DI Plugins that are needed at design and runtime should be installed on the server and on client workstations.
Select Installation Method
To determine the best installation method for you, answer several questions.
Why do you want to install DI components? Do you want to evaluate it or develop solutions? Do you want to run it in a production environment?
Do you want to house the DI Repository on the PostgreSQL database that Pentaho supplies? Or would you rather house the DI Repository on another database that you supply such as Oracle or MySQL? The DI Repository stores information related to transformations, jobs, and security. The DI Repository also stores information related to transformations and jobs, such as database connection information and schedules. PDI is configured so that the DI Repository runs on PostgreSQL. The database that houses the DI Repository is different than the database that houses your data.
As you consider these questions, review the Installation Decision Table to determine the best installation method for you.
|Explore Considerations||Choose the Best Option For You|
|Install All DI Components||Install with Your Own DI Repository||Use DI Manual Installation||Install Only DI Tools and Plugins|
|Summary||This method installs all DI components. It uses an installation wizard and is the quickest and easiest installation option, and includes everything you need to run Pentaho DI.||This method installs all DI components. This manual method requires that you download and install Pentaho installation files. You can choose the DI Repository database that you want to use, but you download and install it yourself. Consider using this method if your company has a standard database platform with which you must comply.||This method installs all DI components. The manual method requires that you download and install Pentaho installation files. You can choose your web application server and DI Repository database, but you must download and install them yourself. Consider using this method if your company has a standard database platform or web application server with which you must comply.||This method installs the DI Design Tools and Plugins only. Use this method if you want to install the design tools only, not the DI Server. This method is often used to install design tools and plugins on client workstations.|
|Installed Components||This method installs these things. ||This method installs these things. ||This method installs these things. ||This method installs these things. |
|You Supply||You need to supply a computer that meets Pentaho's operating system and hardware requirements.||You need to supply these things. Each item must meet or exceed the requirements in the supported technologies matrix. ||You need to supply these things. Each item must meet or exceed the requirements in the supported technologies matrix. ||You need to supply a computer that meets Pentaho's operating system and hardware requirements.|
|We Supply||We supply these things. ||We supply these things. ||We supply the Installation Package.||We supply the Installation Package.|
|Technologies Used||These technologies are used in this installation method. ||These technologies are used in this installation method. ||These technologies are used in this installation method. ||N/A|
|Expertise||You need this knowledge to perform this type of installation. ||You need this knowledge to perform this type of installation. ||You need this knowledge to perform this type of installation. ||N/A|
|Time||Approximately 30 minutes.||Approximately 60 - 90 minutes.||Approximately 2 - 3 hours.||Approximately 5 - 10 minutes.|
|Availability||Download this software from the Pentaho website or the support portal.||Download this software from the Support portal.||Download this software from the Pentaho Support Portal.||Download this software from the Support portal.|
|Recommendation||Use for the Pentaho Trial Download or if you want to Evaluate DI.||Suitable for Rapid Development and Production use.||Suitable for Rapid Development and Production use.||Suitable for Rapid Development and Production use.|