SMB
File sharing ..
Workshop - SMB/CIFS
The Server Message Block (SMB) protocol is a network file sharing protocol that allows applications on a computer to read and write to files and to request services from server programs in a computer network. The SMB protocol can be used on top of its TCP/IP protocol or other network protocols
Objective of this workshop is to:
install & configure a basic Samba server.
share user home directories as well as provide read-write anonymous access to selected directory.

Pentaho Data Integration
Pentaho Data Integration utilizes Virtual File System (VFS) as the abstraction layer within the kernel to expose different filesystems.
In PDI, you can add a VFS connection and then reference that connection whenever you want to access files or folders on your Virtual File System.
(Optional) Download the latest jcifs driver.
(OptionalCopy the JCIFS JAR file into Pentaho Data Integration "lib" folder.
Download CIFS driver
Pentaho Data Integration ships with jcifs-1.3.3.jar
If you wish to replace the current driver, rename to: jcifs-1.3.3.jar -> jcifs-1.3.3.jar.bak
Start Pentaho Data Integration.
cd
cd ~/Pentaho/design-tools/data-integration
./spoon.sh
Create a new Transformation.
Click on the 'View' tab.
Highlight 'VFS Connections' and select 'New'.

Configure with the following details:

Click 'Test'.

Transformation - SMB File Retrieval
Let's create a simple Transformation to onboard data via an SMB VFS connection.
Create the following transformation:

Double-click on Text file input > File tab
Click on Browse and ensure you select:
VFS Connections > SMB > Pentaho/design-tools/data-integration/samples/transformations/files/sales_data.csv
Add the path.

Click on Content tab & configure with the following settings:

Click on Fields tab & click on 'Get Fields'

Preview the rows.

Click OK.
Last updated
Was this helpful?