Skip to main content
Pentaho Documentation

Correct Data Quality

Overview

Explains how to correct data quality.

Follow these instructions to correct the data quality issue.

  1. Click on the Data Integration perspective in the main toolbar.
  2. Right-click the Write to Database step from the flow and choose Detach step. Both hops are detached.
  3. Expand the Transform folder in the Design tab and add a Value Mapper step to the transformation.
  4. Draw a hop from the Filter Missing Zips step to the Value Mapper step and select Result is TRUE.
  5. Draw a hop from the Prepare Field Layout step to the Value Mapper step.  When prompted to select the output type, select Main Output of Step.
  6. Draw a hop from the Value Mapper step to the Write to Database step.File:/pdi_agile_bi_transformation.png
  7. Double-click the Value Mapper step to open its edit step properties dialog box.
  8. In the Fieldname to use field, select COUNTRY.
  9. In the first row of the Field Values table, type United States as the Source value and USA as the Target value. Click OK to exit the dialog box.
  10. Save and run the transformation.
  11. Click Visualize in the main toolbar.
  12. Select the More actions and options  button, then select Administration > Clear Cache.  When a note indicating that the Analyzer and Mondrian caches have been cleared, click OK.
  13. From the menu select View > Show Visualization Properties.
  14. Click Refresh under the data section of the Visualization Properties data in the current view button near the top left of the report window.  panel and notice that the data has been cleansed.