Skip to main content

Pentaho+ documentation has moved!

The new product documentation portal is here. Check it out now at docs.hitachivantara.com

 

Hitachi Vantara Lumada and Pentaho Documentation

Use the Transformation menu

Parent article

The Transformation menu contains transformation settings, options, and properties.

Right-click any step in the transformation canvas to view the Transformation menu. Each menu item is described in the following table:

Menu ItemDescription
New HopCreates a new hop.
EditShows the configuration window for the step.
DescriptionAllows you to add a description to the step.
Open Referenced ObjectAllows you to map a sub-transformation. Mapping a sub-transformation is covered in detail in Mapping.
Inspect DataAllows you to inspect the data stream of a step once the transformation has run.
NoteThis option runs your transformation only if it was not previously executed.
Run and Inspect DataRuns the transformation up to the selected step, then lets you inspect your data.
Data MovementDescribes the way data moves through the transformation when there is more than one hop. The following options are available:
  • Round Robin: Partitions the output stream and sends a portion of all output records down each hop.
  • Load Balance: Checks the output row sets to see how much room is left in the buffer. It selects the one that is most empty. If the rows are distributed to steps that take very little processing time per row (or the exact same amount of time for each step to process a row), Load Balance is identical to Round Robin. If the rows are sent down one path that takes a long time to process, such as Sort or Group By and down another path that processes rows more quickly, the "quick path" will likely have more rows sent to it, as it will empty its buffer before the "slow path" has a chance to empty its buffer. This is typically used for clustered transformations, where the same processing occurs on different nodes. The row buffer is set, by default, to 10000. To change the row buffer size, open the Transformation Settings window, then select Nr of rows in rowset on the Miscellaneous tab.
  • Copy Data to Next Steps: Copies the data to subsequent steps.
Change Number of Copies to StartStarts several instances of a step in parallel.
CopyCopies selected items to the clipboard.
DuplicateMakes a copy of the selected items, then pastes them to the canvas.
DeleteDeletes selected items from the canvas.
Hide

Hides the step from the PDI client canvas.

Caution

If you hide the step, you will need to open the transformation or job XML file and hand edit it to view it again. For more details, see the troubleshooting section.

DetachDetaches the step or entry from the transformation or job.
Input FieldsShows metadata, like the field name and type, for fields that come into the step.
Output FieldsShows metadata, like the field name and type, for fields that go out of the step.
Sniff Test During ExecutionThe sniff test displays data as it travels from one step to another in the stream. To use this, right-click a step in the transformation as it runs and select Sniff Test During Execution. The following options are available:
  • Sniff test input rows: Shows the data inputted into the step.
  • Sniff test output rows: Shows the data outputted from the step.
  • Sniff test error handling: Shows error handling data.

For more information on how to use this tool, see the Sniff Test tool article.

Check Selected Step(s)Checks transformation steps for problems that could interfere with successfully running the transformation. Right-click the transformation step that you want to check and click Check Selected Step(s). Warnings and errors appear in the Results of transformation checks window.
Error HandlingIndicates how to apply error handling for a step. When this is selected, the Step error handling settings window appears.
PreviewAllows you to preview the results of the transformation. Launches the Transformation Debug Dialog.
Align/DistributeArranges steps on the canvas so that they are aligned properly or distributed evenly. This helps create a visually pleasing transformation that is easier to read and digest.

Align refers to where the steps are permitted along the x (horizontal) or y (vertical) axis. Distribute makes the horizontal and vertical spacing between steps consistent. Typically, you turn on the grid, then move the different steps or entries on the canvas so that they form some sort of pattern, like a straight or branching line.

You select steps and apply the following options, as needed:

  • Align Left: Positions all steps so their left sides start on the same "x" (horizontal) coordinate as the left-most step. After applied, steps are arranged in a straight vertical line. No changes are made to the spaces between steps.
  • Align Right: Positions all steps so their right sides start on the same "x" (horizontal) coordinate as the right-most step. After applied, steps are arranged in a straight vertical line. No changes are made to the spaces between steps.
  • Align Top: Positions all steps so their top sides start on the same "y" (vertical) coordinate as the step positioned closest to the top of the canvas. After applied, steps are arranged in a straight horizontal line. No changes are made to the spaces between steps.
  • Align Bottom: Positions all steps so their bottom sides start on the same "y" (vertical) coordinate as the step positioned closest to the bottom of the canvas. After applied, steps are arranged in a straight horizontal line. No changes are made to the spaces between steps.
  • Distribute Horizontally: Positions all steps so that they are evenly spaced horizontally. After applied, steps are arranged evenly. No changes are made to the alignment.
  • Distribute Vertically: Positions all steps so that they are evenly spaced vertically. After applied, steps are arranged evenly. No changes are made to the alignment.
  • Snap to Grid: Aligns steps on the canvas to the grid. If grid markers do not appear on the canvas, select Tools Options Look & Feel Show Canvas Grid. See PDI client options for more information on how to customize the PDI client.
Data ServicesCreate, edit, delete, or test a Pentaho Data Service. The Pentaho Data Service allows others to obtain the results of a transformation, even if the person does not have the PDI client or Pentaho Server installed. The Pentaho Data Service is discussed in great detail in Pentaho Data Services.
MappingProvides a way for you to map target fields from the step to source columns in a database. When selected, the Mapping window appears containing the following fields:
  • Source Fields: Lists the field names from the incoming stream.
  • Target Fields: Lists the column names in a target table.
  • Auto Target Selection: Automatically selects a matching table column if the target field is selected.
  • Auto Source Select: Automatically selects a matching target field if the table column is selected.
  • Add: Allows you to move the mapped target and source information to the mappings grid.
  • Guess: Makes mappings based on a computer algorithm.
  • Hide assigned source fields and Hide assigned target fields: Removes mappings from the Source Fields and Target Fields lists those fields are added to the mapping grid.
  • Delete: Removes mappings from the mapping grid so that they reappear in the Target Fields and Source Fields lists again.

When you click OK, the Mapping window closes and a Select / Rename Values step appears on the canvas. It is usually named after the step that right-clicked. The Select / Rename Values window contains the mappings. If you are not able to make mappings, the step still appears, but the properties are blank.

PartitionsSplits data into subsets according to a rule that is applied on a row of data. Partitions are discussed in detail in the Partitioning data article.
ClustersAllows you to create Carte clusters. For more information, see Use Carte Clusters.