Pentaho Data Integration
InstallationBusiness AnalyticsCToolsData CatalogData QualityLLMs
  • Overview
    • Pentaho Data Integration ..
  • Data Integration
    • Getting Started
      • Configuring PDI UI
      • KETTLE Variables
    • Concepts & Terminolgy
      • Hello World
      • Logging
      • Error Handling
    • Data Sources
      • Flat Files
        • Text
          • Text File Input
          • Text File Output
        • Excel
          • Excel Writer
        • XML
          • Read XML
        • JSON
          • Read JSON
      • Databases
        • CRUID
          • Database Connections
          • Create DB
          • Read DB
          • Update DB
          • Insert / Update DB
          • Delete DB
        • SCDs
          • SCDs
      • Object Stores
        • MinIO
      • SMB
      • Big Data
        • Hadoop
          • Apache Hadoop
    • Enrich Data
      • Merge
        • Merge Streams
        • Merge Rows (diff)
      • Joins
        • Cross Join
        • Merge Join
        • Database Join
        • XML Join
      • Lookups
        • Database Lookups
      • Scripting
        • Formula
        • Modified JavaScript Value
        • User Defined Java Class
    • Enterprise Solution
      • Jobs
        • Job - Hello World
        • Backward Chaining
        • Parallel
      • Parameters & Variables
        • Parameters
        • Variables
      • Scalability
        • Run Configurations
        • Partition
      • Monitoring & Scheduling
        • Monitoring & Scheduling
      • Logging
        • Logging
      • Dockmaker
        • BA & DI Servers
      • Metadata Injection
        • MDI
    • Plugins
      • Hierarchical Data Type
  • Use Cases
    • Streaming Data
      • MQTT
        • Mosquitto
        • HiveMQ
      • AMQP
        • RabbitMQ
      • Kafka
        • Kafka
    • Machine Learning
      • Prerequiste Tasks
      • AutoML
      • Credit Card
    • RESTful API
    • Jenkins
    • GenAI
  • Reference
    • Page 1
Powered by GitBook
On this page
  1. Data Integration
  2. Data Sources
  3. Databases

CRUID

CRUID database operations are a set of five basic functions that allow us to manipulate data in a persistent storage system, such as a relational database.

PreviousDatabasesNextDatabase Connections

Last updated 1 month ago

Workshops

Understanding these core CRUID operations is essential for anyone working with databases, as virtually all database interactions can be categorized into one of these five fundamental actions.

DB Connections

In this guided Workshop, you will:

Configure DBeaver / Pentaho database connections:

  • Configure the Pentaho JDBC database connection.

Create

To create a new record in a database, we can use the INSERT statement in SQL or the POST method in HTTP.

In this Workshop, you will onboard data from a CSV file into a database table:

  • CSV File input step

  • Table output step

Read

To read data from a database, we can use the SELECT statement in SQL or the GET method in HTTP.

In this guided demonstration, you will Read from a database table to determine the state of customer orders:

  • Table input step

  • Calculator step

  • Number range step

  • Sort

  • Select values

Update

To update an existing record in a database, we can use the UPDATE statement in SQL or the PUT or PATCH method in HTTP.

In this guided demonstration, you will Update fields (Type1) in the employee table:

  • Update step

Insert / Update

To insert data into an existing record in a database, we can use the MERGE statement in SQL or the PATCH method in HTTP.

In this guided demonstration, you will Update database fields and Insert new records into the employee table:

  • Insert / update step

To delete a record from a database, we can use the DELETE statement in SQL or the DELETE method in HTTP.

In this guided demonstration, you will Delete records based on a criteria set by a parameter and passed in a variable:

  • Get variables step

  • Delete step

Database Connections
Create DB
Read DB
Update DB
Insert / Update DB
Delete DB
CRUID