Skip to main content
Pentaho Documentation

Prepare Your Linux and Mac Environment for Installation

Create the Pentaho User

Create a pentaho user account that has administrative privileges.  You will use this account to complete the rest of the installation instructions.

  1. Create an administrative user on computer that will host the DI Server and name it pentaho.
  2. Verify that you have the appropriate permissions to read, write, and execute commands in the pentaho user's home directory.

Create Linux Directory Structure

  1. Log into the machine on which you will run the BA Server. Make sure that you are logged in as the pentaho user.
  2. Create this directory path from home directory (pentaho).
    <your home directory>/pentaho/server/data-integration-server
    <your home directory>/.pentaho
    
  3. Verify that you have the appropriate permissions to read, write, and execute commands in the directories you created.

Install the Web Application Server

The DI Server can be deployed on either the Tomcat or JBoss web application server. By default, DI Server software is configured for Tomcat. This means that if you choose to use Tomcat, you will need to make fewer configuration changes than you would if you choose to use JBoss.

You must install the web application server yourself. If you already have a Tomcat or JBoss web application server installed and you want to deploy the DI Server on it, please skip this step.

  1. To download and install the web application software, use the instructions in the documentation for the web application server of your choice. We recommend that you install the web application server in the pentaho/server/data-integration-server directory.
  2. Verify the web application server is installed correctly by starting it and viewing the default page. If the web application server does not start, troubleshoot it using the web application server's documentation before you continue with the DI Server installation process.
  3. Stop the web application server.

Install the DI Repository Host Database

The DI Repository houses data needed for Pentaho tools to provide scheduling and security functions.  It also stores transformations and jobs.  You can host the DI Repository on these databases.

  • PostgreSQL
  • MySQL
  • Oracle
  • MS SQL Server

To install the DI Repository's host database, do these things.

  1. Check the Supported Technologies section to determine which versions of databases Pentaho supports.
  2. Download and install the database of your choice. 
  3. Verify that the database is installed correctly.

Install Java 

Install a supported version of Java.

  1. Check the Supported Technologies list to see which version of Java Pentaho supports.
  2. Download the supported version of the JRE or JDK from the Oracle site and install it. 

Download and Unpack Installation Files

The Pentaho DI Server software, data files, and examples are stored in pre-packaged .zip files. You will need to manually copy these files to correct directories. 

  1. Download the following installation and plug-in files from the Pentaho Customer Support Portal in the archive build folder.
  • DI Server Installation File: pdi-manual-ee-<current version>-dist.zip.
  1. Unzip the DI Server Installation file.
  2. To unpack the file, run install.sh​.  The IZPak window appears.

If you are unpacking the file in a non-graphical environment, open a Terminal or Command Prompt window and type java -jar installer.jar -console and follow the instructions presented in the window.​ 

  1. Read the license agreement, select I accept the terms of this license agreement, and click Next.
  2. In the Select the installation path text box, enter the place where you want to create the pentaho directory, then click Next.
  3. A message indicating that a target directory will be created appears.  Click OK.
  4. When the installation progress is complete click Quit.

Put Files in Directories

  1. Navigate to the pentaho directory where you unpacked the files, unzip the zip files and place their contents in the appropriate directories listed below.
    File Unzip the Contents of the File to This Directory
    di-license-installer.zip pentaho/server
    di-pentaho-data.zip pentaho/server/data-integration-server
    di-pentaho-solutions.zip pentaho/server/data-integration-server
  2. Copy these files to the following directories.​
    File Copy Files to This Directory
    pentaho-di.war
    • Tomcat: pentaho/server/data-integration-server/<your tomcat installation directory>/webapps
    • JBoss: pentaho/server/data-integration-server/<your jboss installation directory>/standalone/deployments
    pentaho-style.war
    • Tomcat: pentaho/server/data-integration-server/<your tomcat installation directory>/webapps
    • JBoss: pentaho/server/data-integration-server/<your jboss installation directory>/standalone/deployments
    PentahoBIPlatform_OSS_Licenses.html pentaho/server/data-integration-server

Verify Directory Structure

Verify that the files have been placed in the following places by comparing the following directory structure with yours.

If your web application server is not in the pentaho/server/data-integration-server directory, the pentaho-di.war and pentaho-style.war files should appear where you've chosen to install your web application server.

Tomcat File Locations:

  • <your home directory>/.pentaho
  • pentaho/server/license-installer
  • pentaho/server/license-installer/lib
  • pentaho/server/data-integration-server/<your tomcat installation directory>/webapps/pentaho-di.war
  • pentaho/server/data-integration-server/<your tomcat installation directory>/webapps/pentaho-style.war
  • pentaho/server/data-integration-server/data
  • pentaho/server/data-integration-server/pentaho-solutions

JBoss File Locations:

  • <your home directory>/.pentaho
  • pentaho/server/license-installer
  • pentaho/server/license-installer/lib
  • pentaho/server/data-integration-server/<your jboss installation directory>/standalone/deployments/pentaho-di.war
  • pentaho/server/data-integration-server/<your jboss installation directory>/standalone/deployments/pentaho-style.war
  • pentaho/server/data-integration-server/data
  • pentaho/server/data-integration-server/pentaho-solutions

Set Environment Variables

Set the PENTAHO_JAVA_HOME  and PENTAHO_INSTALLED_LICENSE_PATH environment variables. If you do not set these variables, Pentaho will not start correctly. 

If you are using a JRE, set the JRE_HOME home environment variable as well.

  1. Set the path of the PENTAHO_JAVA_HOME variable to the path of your Java installation, like this.
    export PENTAHO_JAVA_HOME=/usr/lib/jvm/java-7-sun
    
  2. Set the path of the PENTAHO_INSTALLED_LICENSE_PATH variable to the path of the installed licenses, like this.
    export PENTAHO_INSTALLED_LICENSE_PATH=/home/pentaho/.pentaho/.installedLicenses.xml
    
  3. Log out and in again, then verify the variables have been properly set.

Advanced Linux and Mac Topics

Complete the instructions in this section only if you have a headless node or it if you plan to install on a Mac OS.

Prepare a Headless Linux or Solaris Server

There are two headless server scenarios that require special procedures on Linux and Solaris systems. One is for a system that has no video card; the other is for a system that has a video card, but does not have an X server installed. In some situations -- particularly if your server doesn't have a video card -- you will have to perform both procedures to properly generate reports with the DI Server.

Systems without video cards

The java.awt.headless option enables systems without video output and/or human input hardware to execute operations that require them. To set this application server option when the DI Server starts, you will need to modify the startup scripts for either the DI Server, or your Java application server. You do not need to do this now, but you will near the end of these instruction when you perform the Start DI Server step. For now, add the following item to the list of CATALINA_OPTS parameters: -Djava.awt.headless=true.

The entire line should look something like this:

export CATALINA_OPTS="-Djava.awt.headless=true -Xms4096m -Xmx6144m -XX:MaxPermSize=256m -Dsun.rmi.dgc.client.gcInterval=3600000 -Dsun.rmi.dgc.server.gcInterval=3600000"

If you intend to create a DI Server service control script, you must add this parameter to that script's CATALINA_OPTS line.

Note: If you do not have an X server installed, you must also follow the below instructions.

Systems without X11

To generate charts, the Pentaho Reporting engine requires functionality found in X11. If you are unwilling or unable to install an X server, you can install the xvfb package instead. xvfb provides X11 framebuffer emulation, which performs all graphical operations in memory instead of sending them to the screen.

Use your operating system's package manager to properly install xvfb.

Adjust Amount of Memory Mac OS Allocates for PostgreSQL

If you plan to install the software on a Mac OS, and you choose to use PostgreSQL, you need to increase the amount of memory that the Mac OS allocates for PostgreSQL. You can skip these instructions if you plan to install the software on Windows or Linux.

PostgreSQL is the name of the default database that contains audit, schedule and other data that you create.  PostgreSQL starts successfully only if your computer has allocated enough memory. Go to http://www.postgresql.org/docs/devel/static/kernel-resources.html and follow the instructions there on how to adjust the memory settings on your computer.