When working with your transformation, you can gain valuable insights by visualizing and interacting with your data in many ways. The ability to quickly inspect step data reduces the amount of iterative work needed while building your transformation and enables you to rapidly publish a data source to share with either your teams or across your organization.
Depending on your operating system, you may need to upgrade your Web browser for the full experience. See our list of supported components here.
Begin inspecting your transformation by clicking on a step. This displays the fly-out inspection bar at the top of the canvas area. The bar displays the name of the step selected and offers two options:
- Inspect Data - Lets you inspect the data of a step once the transformation has run.
Note: This option is not available until you run your transformation.
- Run and Inspect Data - Runs the transformation, then lets you inspect the data of a step.
Additionally, you can begin inspecting in the following ways:
Step Context Menu - Right-click on a step and choose either Inspect Data or Run and Inspect Data.
- Preview Data Panel - Select the Preview Data tab. Click the Inspect Data button located at the top right of the Preview Data bar.
- Actions Menu - Select a step. From the Menu bar, click Action>Inspect Data or Action>Run and Inspect Data.
- Keyboard Shortcuts - Select a step. Then using your keyboard:
- In Windows,press either Shift+Ctrl+F9 (Inspect Data) or Ctrl+F9 (Run and Inspect Data).
- In OS X,press Shift+Command+F9 (Inspect Data) or Command+F9 (Run and Inspect Data).
Tour the Environment
When you decide to inspect your data, the transformation presents options to visualize your data.
By default, table data is displayed with all available fields selected in Stream View.
The following sample screen shows a visualization using data field values from the default Stream View for a step.
Use the number locators in the sample screen to reference the sections of the inspection environment.
Use the Header bar to access:
|Available Fields Panel||
Toggle between Stream View and Model View to begin inspecting data and building visualizations based on the data sampled.
|Layout Panel||Displays the properties associated with a selected category or field.|
|Visualization Selector||Use the Visualization Selector to choose a visualization type. Selecting a visualization from the drop-down menu displays it in the Canvas area.|
|Canvas||The Canvas displays the selected visualization.|
|Visualization Tabs||Use the Visualization Tabs to compare multiple views of your step data.|
Explore with Visualizations
When you begin inspecting your data, you are presented with the Stream View, with all available data fields selected. The selected data fields are represented in the Canvas area by a flat table. To reduce the number of data fields selected, click anywhere on a data field name. The blue dot to the left of the data field name will disappear, indicating that it is no longer selected. In some cases, it may be faster to deselect all data fields first, by clicking the Clear All actions first, then select only the data fields you want to inspect. Your selections will be listed in the order that they are selected.
Once you have the desired data fields selected, you can change the table to a different visualization type by using the Visualization Selector. Alternately, you can create a new visualization by clicking the plus symbol button located to the right of the current tab. Once you have a new visualization created, switch to Model View to display a multidimensional representation of your selected fields. If you selected a visualization that requires a multidimensional model, it will automatically switch to Model View. The Model View allows you to customize the layout, based on placement of the data fields shelves.
You can always return to the inspection canvas and fine tune your transformation at any time until you are satisfied with the results.
When you exit the inspection environment and return to the Data Integration canvas, all tabs will be discarded.
One you feel that your step data is in a good place, you can make the content available for further collaboration by publishing a data source.
Publish for Collaboration
When you’re ready to make your content available for others, publish it as a data source. The data source will use a data service that is automatically created on the step, which can be used by other tools at a later time.
To publish, perform the following steps:
1. Click the Publish button ( ) at the top right of the Header bar. The Publish Data Source window opens.
2. Click Get Started to open the Publish Details window.
Enter the data source information in the following fields:
|Data Source Name||The name used by other Pentaho applications when accessing your data source.|
|Server||The default value for this field is your current repository. You can select other repository connections if you have created them through the Repository Manager.|
|URL||The base URL string used to connect to the server.|
The user name required to access the server.
The user must also have publish permissions.
|Password||The password associated with the provided user name|
3. When you are done, click Finish.
4. Once your data source is created a confirmation will appear. Click Close to continue inspecting or return to PUC to work with the data source. The data source should now be available on the server.