Skip to main content
Pentaho Documentation

Inspecting Your Data

When working with your transformation, you can gain valuable insights by visualizing and interacting with your data in many ways. The ability to quickly inspect step data reduces the amount of iterative work needed while building your transformation and enables you to rapidly publish a data source to share with either your teams or across your organization.

Depending on your operating system, you may need to upgrade your Web browser for the full experience. See our list of supported components here.

Begin Inspecting

Begin inspecting your transformation by clicking on a step. This displays the fly-out inspection bar at the top of the canvas area. The bar displays the name of the step selected and offers two options:

  • Inspect Data - Lets you inspect the data of a step once the transformation has run.
    Note: This option is not available until after you run your transformation.
  • Run and Inspect Data - Runs the transformation up to the selected step, then lets you inspect your data.

Additionally, you can begin inspecting in the following ways:

  • Step Context Menu - Right-click on a step and choose either Inspect Data or Run and Inspect Data.

  • Preview Data Panel - Select the Preview Data tab. Click the Inspect Data button located at the top right of the Preview Data bar.
  • Actions Menu - Select a step. From the Menu bar, click Action>Inspect Data or Action>Run and Inspect Data.
  • Keyboard Shortcuts - Select a step. Then using your keyboard:
    • In Windowspress either Shift+Ctrl+F9 (Inspect Data) or Ctrl+F9 (Run and Inspect Data).
    • In OS X, press Shift+Command+F9 (Inspect Data) or Command+F9 (Run and Inspect Data).

Tour the Environment

When you decide to inspect your data, the transformation presents options to visualize your data.

By default, table data is displayed with all available fields selected in Stream View.

000 - DII Opening Screen - 2016-10-06_14-21-13 - View.png

The following sample screen shows a visualization using data field values from the default Stream View for a step.

Inspect Your Data Model View

Use the number locators in the sample screen to reference the sections of the inspection environment.

Key Name Description
Circle 1 Header bar

Use the Header bar to access:

  • The title of the step being inspected.
  • The row count of the data sampled.
  • The Publish button, used to create a data source for collaborative use later via a data service.
  • The Exit button, to return to the transformation canvas
Circle 2

Stream View / Model View

Toggle between Stream View and Model View to inspect data and build visualizations based on the data sampled.

  • Use Stream View to inspect the data using a flat table or visualization types that do not require modeling.
  • Model View extends the analytic capabilities by allowing you to view your selected fields with hierarchical capabilities.
Search Box Use the Search Box to find a specific field in a long list of available fields. This is especially useful in Stream View where the order of the fields is determined by the transformation.
Available Fields Panel

The Available Fields Panel lists all available fields from the subset of data being inspected and allows you to select the specific fields you want to inspect. Click a field to select or clear it. You can also select a field by dragging it into the Layout Panel. Selected fields display with a blue disk icon to the left of their names.

  • Use Clear All to remove all fields from the Layout panel. The Canvas area will be automatically updated.
  • For the flat table in Stream View, you can click Select All to include all fields in the flat table in the order they are listed.
Circle 3 Visualization Selector Use the Visualization Selector to choose a visualization type. Selecting a visualization from the drop-down menu displays it in the Canvas area.
Circle 4 Layout Panel Displays the properties associated with a selected category or field.
Circle 5 Applied Filters

In Model View, the Applied Filters panel displays all the filters you have applied to a visualization. To apply a filter, double-click an available drill-down field on the visualization.

  • Drill-down fields are only available in the Model View since you can only drill down in hierarchies. To drill down, double-click in a visualization data point, such as a column in a graph or a slice in a pie chart, or on a member label. 
  • When drilling down, the next field in the hierarchy replaces its parent in the Layout panel, except in the pivot table and sunburst chart where the next field is added. The filter for the drill-down field is added to the Applied Filters panel and the visualization is updated according to those filters.
Circle 6 Canvas The Canvas displays the selected visualization.
007-number.png Visualization Tabs Use the Visualization Tabs to compare multiple views of your step data.

Explore with Visualizations

When you begin inspecting your data, you are presented with the Stream View with all available data fields selected. The selected data fields are represented in the Canvas area by a flat table. To reduce the number of data fields selected, click anywhere on a data field name. The blue dot to the left of the data field name will disappear, indicating that it is no longer selected. In some cases, it may be faster to deselect all data fields first by clicking the Clear All actions first, then select only the data fields you want to inspect. Your selections will be listed in the order that they are selected.

Once you have the desired data fields selected, you can change the table to a different visualization type by using the Visualization Selector. Alternately, you can create a new visualization by clicking the plus symbol button located to the right of the current tab. Once you have a new visualization created, switch to Model View to display a multidimensional representation of your selected fields. If you selected a visualization that requires a multidimensional model, it will automatically switch to Model View.

You can customize your model by adding, moving, and deselecting fields in the Layout panel, or by drilling down into fields in the visualization itself. When you double-click on drill down fields in your visualization, these fields display in the Applied Filters panel. The Layout panel automatically updates based on the selected filters. To remove a filter, hover over the field in the Applied Filters panel and click the Close button.

You can keep tabs open between sessions and always return to the inspection canvas to fine tune your transformation at any time until you are satisfied with the results. When you exit the inspection canvas, the step displays with the Inspection icon in the transformation canvas so you know it contains a remembered inspection session.

Note that when you reopen a remembered inspection session where some of the selected fields were removed from the transformation or step, the tabs using those fields are now marked as ‘invalid’. To validate those tabs, you can deselect the fields from the visualization in the inspection canvas, or exit your session and add the fields back to the transformation or step itself. The only exception is the flat table, where all invalid fields are removed automatically.

Once you are satisfied with your step data, you can make the content available for further collaboration by publishing a data source.

Publish for Collaboration

When you’re ready to make your content available for others, publish it as a data source. The data source will use a data service that is automatically created on the step, which can be used by other tools at a later time.


To publish, perform the following steps:
1. Click the Publish button at the top right of the Header bar. The Publish Data Source window opens.
2. Click Get Started to open the Publish Details window.

Enter the data source information in the following fields:

Fields Description
Data Source Name The name used by other Pentaho applications when accessing your data source.
Server The default value for this field is your current repository. You can select other repository connections if you have created them through the Repository Manager.
URL The base URL string used to connect to the server.
User Name

The user name required to access the server.

The user must also have publish permissions. 

Password The password associated with the provided user name

3. When you are done, click Finish

4. Once your data source is created, a confirmation will appear. The data source should now be available on the server. Click Close to continue inspecting your data or click View this in User Console to go to PUC and work with the data source in Analyzer.