Monitoring & Scheduling
Monitoring and Scheduling your Jobs & Transformations ..
Last updated
Monitoring and Scheduling your Jobs & Transformations ..
Last updated
There are several methods to monitor a job or transformation in Pentaho Data Integration. Here are some of them:
You can use the logging feature to record information about the execution of a job or transformation, such as the start and end time, the number of records processed, the errors encountered, and the debugging details. You can configure the logging settings in the Transformation Properties or Job Properties dialog box, and specify the database connection, the log table name, the logging interval, and the log record timeout. You can also view the log entries in the Execution Results panel or in the Schedule perspective.
You can use the step performance monitoring feature to track the performance of individual steps in a transformation, such as the number of rows read and written, the speed of processing, and the percentage of total processing time. You can enable this feature in the Transformation Properties dialog box, and view the performance metrics in the Execution Results panel or in a separate window. You can also export the performance data to a CSV file for further analysis.
You can use the sniff test feature to inspect the data flowing through a step in a transformation during execution. This feature allows you to see the input and output rows of a step, as well as any errors or warnings generated by the step. You can access this feature by right-clicking a step in the transformation and selecting one of the options from the Sniff Test During Execution menu.
To schedule a transformation in Pentaho, you can use one of the following methods:
Use the command line tool Pan to execute the transformation with the option -file:“your-transformation.ktr”. You can also specify parameters, variables, or arguments for the transformation. You can then use any scheduling tool on your system, such as cron on Linux or Task Scheduler on Windows, to run the command at a specific time or interval.
Use the Schedule perspective in the PDI client (Spoon) to create and manage schedules for transformations and jobs stored in the Pentaho Repository. You can specify the start and end date and time, the repeat frequency, the log level, and the safe mode for the transformation. You can also edit, delete, enable, disable, or stop the schedules from the Schedule perspective.