Pentaho Data Integration
InstallationBusiness AnalyticsCToolsData CatalogData QualityLLMs
  • Overview
    • Pentaho Data Integration ..
  • Data Integration
    • Getting Started
      • Configuring PDI UI
      • KETTLE Variables
    • Concepts & Terminolgy
      • Hello World
      • Logging
      • Error Handling
    • Data Sources
      • Flat Files
        • Text
          • Text File Input
          • Text File Output
        • Excel
          • Excel Writer
        • XML
          • Read XML
        • JSON
          • Read JSON
      • Databases
        • CRUID
          • Database Connections
          • Create DB
          • Read DB
          • Update DB
          • Insert / Update DB
          • Delete DB
        • SCDs
          • SCDs
      • Object Stores
        • MinIO
      • SMB
      • Big Data
        • Hadoop
          • Apache Hadoop
    • Enrich Data
      • Merge
        • Merge Streams
        • Merge Rows (diff)
      • Joins
        • Cross Join
        • Merge Join
        • Database Join
        • XML Join
      • Lookups
        • Database Lookups
      • Scripting
        • Formula
        • Modified JavaScript Value
        • User Defined Java Class
    • Enterprise Solution
      • Jobs
        • Job - Hello World
        • Backward Chaining
        • Parallel
      • Parameters & Variables
        • Parameters
        • Variables
      • Scalability
        • Run Configurations
        • Partition
      • Monitoring & Scheduling
        • Monitoring & Scheduling
      • Logging
        • Logging
      • Dockmaker
        • BA & DI Servers
      • Metadata Injection
        • MDI
    • Plugins
      • Hierarchical Data Type
  • Use Cases
    • Streaming Data
      • MQTT
        • Mosquitto
        • HiveMQ
      • AMQP
        • RabbitMQ
      • Kafka
        • Kafka
    • Machine Learning
      • Prerequiste Tasks
      • AutoML
      • Credit Card
    • RESTful API
    • Jenkins
    • GenAI
  • Reference
    • Page 1
Powered by GitBook
On this page
  1. Data Integration
  2. Concepts & Terminolgy

Error Handling

Handling errors in a transformation

PreviousLoggingNextData Sources

Last updated 1 month ago

Workshop - Error Handling

This workshop introduces the Select values step and error handling to find invalid date values.

  • Explicit data conversion using the Select values step

  • Defining error handling

  • Configure the following steps:

CSV File Input

Select values

Text File output


Create a new Transformation

Any one of these actions opens a new Transformation tab for you to begin designing your transformation.

  • By clicking File > New > Transformation

  • By using the CTRL-N hot key

Select Values

The Select Values step in Pentaho Data Integration (PDI) is a powerful transformation component that allows you to manipulate fields in your data stream. It performs three primary functions:

  • selecting specific fields to include or exclude

  • renaming fields, and

  • converting field data types.

This versatile step enables you to clean and restructure your data by removing unnecessary columns, standardizing field names, or ensuring proper data typing. The Select Values step is commonly used early in transformations to prepare data before more complex operations are performed.

  1. In the Select Values step, check the Date format: yyy-MM-dd. This is obviously an incorrect format which, the records are captured in the Text file output - Error.

  2. Preview the Text file output - Error step.

  3. Scroll to the end of the execution pane.

Notice the error fields. These have been configured in the hop.

  1. Double-click on the red cross.

  1. Set the format to: MM/dd/yyyy

RUN the Transformation

This final part of the creating a transformation, executed locally.

x

x

➡️ Next:

Finally RUN the transformation
Error handling
Error fields
Error handling - hop