Pentaho Data Integration
InstallationBusiness AnalyticsCToolsData CatalogData QualityLLMs
  • Overview
    • Pentaho Data Integration ..
  • Data Integration
    • Getting Started
      • Configuring PDI UI
      • KETTLE Variables
    • Concepts & Terminolgy
      • Hello World
      • Logging
      • Error Handling
    • Data Sources
      • Flat Files
        • Text
          • Text File Input
          • Text File Output
        • Excel
          • Excel Writer
        • XML
          • Read XML
        • JSON
          • Read JSON
      • Databases
        • CRUID
          • Database Connections
          • Create DB
          • Read DB
          • Update DB
          • Insert / Update DB
          • Delete DB
        • SCDs
          • SCDs
      • Object Stores
        • MinIO
      • SMB
      • Big Data
        • Hadoop
          • Apache Hadoop
    • Enrich Data
      • Merge
        • Merge Streams
        • Merge Rows (diff)
      • Joins
        • Cross Join
        • Merge Join
        • Database Join
        • XML Join
      • Lookups
        • Database Lookups
      • Scripting
        • Formula
        • Modified JavaScript Value
        • User Defined Java Class
    • Enterprise Solution
      • Jobs
        • Job - Hello World
        • Backward Chaining
        • Parallel
      • Parameters & Variables
        • Parameters
        • Variables
      • Scalability
        • Run Configurations
        • Partition
      • Monitoring & Scheduling
        • Monitoring & Scheduling
      • Logging
        • Logging
      • Dockmaker
        • BA & DI Servers
      • Metadata Injection
        • MDI
    • Plugins
      • Hierarchical Data Type
  • Use Cases
    • Streaming Data
      • MQTT
        • Mosquitto
        • HiveMQ
      • AMQP
        • RabbitMQ
      • Kafka
        • Kafka
    • Machine Learning
      • Prerequiste Tasks
      • AutoML
      • Credit Card
    • RESTful API
    • Jenkins
    • GenAI
  • Reference
    • Page 1
Powered by GitBook
On this page
  1. Overview

Pentaho Data Integration ..

Pentaho Pro Data Integration Practitioner

This course is work in progress!

This course is in early access. The content is continually being updated over the next few months.

These Workshops are not intended for production environments.

Introduction

Pentaho Data Integration, often referred to as PDI, is a powerful tool for building your data pipelines. Its main unique features include an intuitive graphical interface, support for various data sources and destinations, extensive transformation capabilities, and robust job scheduling and automation.

The Pentaho Data Integration Practitioner workshop introduces the key concepts behind building your data pipelines using Pentaho data Integration.

Once you have completed the course you will be able to explain:

  • Overview of the Pentaho Data Integration Components

  • Key concepts & terminology

  • Onboarding various data sources

  • Transforming & enriching datasets

  • Scaling out an Enterprise Solution


Overview

The following video introduces the main Topics covered in the Workshops.

To listen to the videos please copy and paste the website URL into your host Chrome browser, as there's no soundcard in the Lab environment.

The following content has been automatically generated by an AI system and should be used for informational purposes only. We cannot guarantee the accuracy, completeness, or timeliness of the information provided. Any actions taken based on this content are at your own risk. We recommend seeking qualified expertise or conducting further research to validate and supplement the information provided.

El siguiente contenido ha sido generado automáticamente por un sistema de inteligencia artificial y debe utilizarse únicamente con fines informativos. No podemos garantizar la precisión, integridad o puntualidad de la información proporcionada. Cualquier acción tomada basada en este contenido es bajo su propio riesgo. Recomendamos buscar la asesoría de expertos calificados o realizar investigaciones adicionales para validar y complementar la información proporcionada.

Le contenu suivant a été généré automatiquement par un système d’intelligence artificielle et doit être utilisé à des fins informatives uniquement. Nous ne pouvons garantir l’exactitude, l’exhaustivité ou l’actualité des informations fournies. Toute action entreprise sur la base de ce contenu se fait à vos propres risques. Nous vous recommandons de rechercher l’avis d’experts qualifiés ou de mener des recherches supplémentaires pour valider et compléter les informations fournies.

Il seguente contenuto è stato generato automaticamente da un sistema di intelligenza artificiale e dovrebbe essere utilizzato solo a scopo informativo. Non possiamo garantire l’accuratezza, la completezza o la tempestività delle informazioni fornite. Eventuali azioni intraprese basate su questo contenuto sono a vostro rischio. Raccomandiamo di cercare l’assistenza di esperti qualificati o di condurre ulteriori ricerche per convalidare e integrare le informazioni fornite.

Der folgende Inhalt wurde automatisch von einem KI-System generiert und sollte nur zu Informationszwecken verwendet werden. Wir können die Genauigkeit, Vollständigkeit oder Aktualität der bereitgestellten Informationen nicht garantieren. Jegliche Handlungen, die auf diesem Inhalt basieren, erfolgen auf eigene Gefahr. Wir empfehlen, qualifizierte Experten zu Rate zu ziehen oder weitere Recherchen durchzuführen, um die bereitgestellten Informationen zu validieren und zu ergänzen.

以下内容是由AI系统自动生成的,仅供信息目的使用。我们无法保证所提供信息的准确性、完整性或及时性。基于此内容采取的任何行动均由您自行承担风险。我们建议寻求合格专家的意见或进行进一步的研究以验证和补充所提供的信息。

以下の内容はAIシステムによって自動生成され、情報提供の目的でのみ使用することをお勧めします。提供された情報の正確性、完全性、またはタイムリー性を保証できません。この内容に基づいて行われるすべての行動は、自己のリスクで行ってください。情報を検証し補完するために、資格のある専門家の意見を求めるか、さらなる調査を行うことをお勧めします。


Lab Environment

The video highlights the menu options which will help you get the best Lab experience ..

The following content has been automatically generated by an AI system and should be used for informational purposes only. We cannot guarantee the accuracy, completeness, or timeliness of the information provided. Any actions taken based on this content are at your own risk. We recommend seeking qualified expertise or conducting further research to validate and supplement the information provided.

Login OS

Username

pentaho

Password

password

Portainer

Username

admin

Password

portainer123

PGAdmin4

Username

pentaho@pgadmin4.com

Password

Passw0rd123


FAQs

VM is unresponsive !
  • Refresh the browser session to reconnect.

  • Try another browser. The recommended browser: Google Chrome.

  • If you're connecting via a Corporate VPN, then this may cause issues. Contact your IT dept to get the URL 'white' listed.

When does the Lab expire ?

The initial duration is 5 days. You will receive an email asking if you wish to extend your time limit.

Videos aren't loading?
  • Hard refresh your browser. CTRL + F5

Is there sound ?

Yes .. There's no sound card attched to the Lab, so you'll need to copy and paste the Lab Guide URL in your host machine browser. 😊

Where can I get a copy of the 'workshop files' ?

All the collateral can be found at: ~/Workshop--Data-Integration.

You can also copy/fork the Git repository:

gh repo clone jporeilly/Workshop--Data-Integration

Corporate Headquarters Regional Contact Information

© Hitachi Vantara LLC 2025. All rights reserved. HITACHI is a trademark or registered trademark of Hitachi, Ltd. VSP is the trademark or registered trademark of Hitachi Vantara Corporation. All other trademarks, service marks and company names are properties of their respective owners.


NextGetting Started

Last updated 20 days ago

Americas: +1 866 374 5822 or

Europe, Middle East and Africa: +44 (0) 1753 618000 or

Asia Pacific: +852 3189 7900 or

info@hitachivantara.com
info.emea@hitachivantara.com
info.marketing.apac@hitachivantara.com
Course Overview
Resumen del curso
Aperçu du cours
Panoramica del corso
Kursüberblick
コースの概要
课程大纲
Lab Environment
Entorno de Laboratorio
Environnement de Laboratoire
Ambiente di Laboratorio
Laborumgebung
ラボ環境
实验室环境
https://github.com/jporeilly/Workshop--Data-Integration.git