JSON
JSON
JSON (JavaScript Object Notation) is a lightweight data interchange format that's easy for humans to read and write, and simple for machines to parse and generate. It uses a text-based structure with key-value pairs and arrays to represent data. JSON is language-independent and widely used for transmitting data in web applications.
Now, to extract key-value pairs from this JSON object in Pentaho Data Integration, you would typically use the "JSON Input" step.
In the JSON Input step, the data stream field name, path and data type are defined.
id
$.customer.id
Integer
name
$.customer.name
String
$.customer.email
active
$.customer.active
Boolean
Workshops
Pentaho Data Integration offers several specialized steps for working with JSON data in your ETL processes.
The JSON Input step reads JSON data from files or fields, supporting complex nested structures and JSON Path expressions for precise data extraction. It handles arrays and provides options for managing missing values.
JSON Output converts your transformation data into JSON format, with control over formatting, file output options, and the ability to create both objects and arrays.
The REST Client step connects with REST APIs that typically use JSON, handling authentication, headers, and processing the returned JSON responses for further transformation.
Common workflows include API integration, JSON file processing, and complex JSON transformations, often using these steps in combination for effective data handling.
Last updated