HBase Input

Last updated
Save as PDF

This step reads data from an HBase table according to user-defined column metadata. HBase is a distributed, column-oriented database that provides random read and write access to the Hadoop File System. HBase stores all data as raw bytes without any associated metadata. A mapping provides metadata that allows the step to decode the binary values properly.

Select an engine

You can run the HBase Input step on the Pentaho engine or on the Spark engine. Depending on your selected engine, the transformation runs differently. Select one of the following options to view how to set up the HBase Input step for your selected engine.

Using the HBase Input step on the Pentaho engine: Learn how to set up this step when using the Pentaho engine.
Using the HBase Input step on the Spark engine: Learn how to set up this step when using the Spark engine.

For instructions on selecting an engine for your transformation, see Run configurations.

Pentaho+ documentation has moved!

The new product documentation portal is here. Check it out now at docs.hitachivantara.com.

Select an engine