Monitoring & Scheduling
Schedule Pentaho Jobs / Transformations and Monitor results ..
Last updated
Schedule Pentaho Jobs / Transformations and Monitor results ..
Last updated
In this guided demonstration, you will:
Configure a Repository connection.
Monitor a Job / Transformation.
Schedule a Job / Transformation.
One way to monitor a Pentaho Transformations / Jobs is to use the PDI Status page, which shows you the details of remotely executed and scheduled transformations, such as the date and time they were run, their status and results.
To access the PDI Status page, you need to navigate to the /pentaho/kettle/status page on your Pentaho Server, and change the host name and port to match your configuration.
Another way to monitor a Pentaho transformation is to enable logging and step performance monitoring in the PDI client. Logging provides you with summarized and detailed information about a transformation, such as the number of records inserted, the total elapsed time, and any errors or exceptions.
Step performance monitoring allows you to see how each step in your transformation is performing in terms of speed, memory usage, and input/output rates.
Now that executed the transformation against the Pentaho server kettle engine you can remotely log into the service to monitor the tasks.
Click on the following URL:
Log into the service.
Username
admin
Password
password
From here you can perform a number of operations:
• RUN the Transformation / Job.
• Stop the running Transformation / Job.
• View Transformation / Job details.
• Remove Transformation / Job.
For monitoring remote servers, log in with the following URL format:
• http://[IP address / FQDN]: [Port]/kettle/status
• Default Username / Password: cluster/cluster
If you have a transformation or a job stored in the Pentaho Repository, you can use the Schedule perspective in the PDI client to create and manage schedules. You can specify the start and end date and time, the repeat frequency, the log level, and the safe mode for the transformation or job.
You can also edit, delete, enable, disable, or stop the schedules from the Schedule perspective.
Ensure that the Pentaho server is up and running ..
Pentaho Repository.
Click on the Connect button (top right on canvas).
Click Add and Enter the following details:
Save & Close.
Click on the Connect button.
Select the connection (Pentaho).
Enter credentials:
Username
admin
Password
password
Click on login.
Lets upload tr_hello_world.ktr and RUN.
As you're connected to the 'Pentaho' Repository, you will need to browse for the transformation locally.
Select File -> Import from an XML file ..
Browse to:
/home/pentaho/Workshop--Data-Integration/Labs/Module 5 - Enterprise Solution/Topic 4 - Monitor
Select: tr_hello_world.ktr & Open.
Change the File type: *.ktr
Click Save.
Let's create a Public / Demo folder.
Enter demo and Save.
Let's now RUN the transformation on the Pentaho server.
Log into the Pentaho User Console.
Username
Admin
Password
password
Select Browse Files -> Public -> Demo
Highlight the tr_hello_world and under File Actions, click open.
Click the Close window.
This indicates that the transformation has been successfully executed.
Ensure that the Pentaho server is up and running and that you have connected to the Pentaho Repository - see previous Monitor section.
Transformations /Jobs need to uploaded into the Pentaho Repository.
Connect to the 'Pentaho' Repository.
Open the tr_hello_world.ktr
From the main menu select: Action -> Schedule
Enter the following settings:
Monitor the status (Periodically, refresh the browser).
To manage the Schedule, switch to the Schedule perspective.
Highlight the tr-hello_world schedule.
6. Disable the schedule an switch back to the Data Integration perspective.