The video course is composed by 80 lectures and more then 10 hours of content. It is a walk-through of a real ETL project using Pentaho Data Integration (also known as Kettle), starting from the beginning of the design of the ETL with some easy steps that becomes more and more complex, layer by layer, as you go forward. This Pentaho Data Integration video course also cover some basic concepts of data integration and data warehouse techniques.
The change log page describes some more details but the most important feature is about the used of the Apache Chemistry libraries updated at the v0.12. This version is useful to support the CMIS 1.1 for Alfresco and more.
During the next hours the new version will be available in the Pentaho marketplace… stay tuned!
In this tutorial we are going to see how to install Pentaho Data Integration 5. PDI 5 (called Kettle) is one of the most powerful tool of the Pentaho Suite that develop a pure (and complete) ETL tool. This tutorial is an extraction of the complete wiki section dedicated to this amazing tool.
If you have a Linux based operating system or a Windows based platform, the tutorial should work in any case because, you will see, the steps are very simple and easy and not related on the platform.
Before start to install the PDI 5 (aka Kettle) you have to check to have Java installed in you system. To check it, you have simply to execute the command below from a terminal.
If you don’t have it, below you can find a tutorial on how to install it. Please, remember that PDI 5 requests Java7.
Download and install PDI 5 Community Edition
After Java7 is available into your system, you have to download the PDI 5 package from the official website or the sourceforge web page. In our case we are going to install the Pentaho Data Integration 5 Community Edition.
Once the ‘pdi-ce-5.0.1-stable.zip’ file will be downloaded in the system, unzip it on the desktop or everywhere else you will like. All the PDI 5 tool is available in a folder described below:
Probably you cannot believe me but this is enough to install PDI 5 into your system. 😉
PDI 5 tool is composed by different executables and services (Spoon, Kitchen, Pan) everyone designed for a specific purpose. To create all the configuration folders and files, you have to run the Spoon tool for the first time. Spoon is a graphical user interface that allows you to design transformations and jobs that can be run with the other Kettle tools (Pan and Kitchen). To run the spoon tool, follow the instructions described here.
After the first run you are ready to use it.. for example with the A.A.A.R. solution. 😉
This simple video has been developed for the next Pentaho Community Meeting in Sintra (Portugal), next 5-th of October, where the A.A.A.R. and the CMIS Input Plugin will be presented for the first time. The video shows how to extract metadata from an Alfresco repository using the CMIS Input Plugin.