Skip to main content
Pentaho Documentation

Big Data

The PDI transformation steps in this section pertain to Big Data operations.

Note: PDI is configured by default to use the Apache Hadoop distribution. If you are working with a Cloudera or MapR distribution instead, you must install the appropriate patch before using any Hadoop functions in PDI. Patch installation is covered in Select DI Installation Options and Getting Started with PDI and Hadoop.