05 Dec

How to install Pentaho Data Integration 7 (aka Kettle)

Few weeks ago, close to the annual Pentaho Community Meeting, the Pentaho Team released the brand new Pentaho Suite v7 with a complete restyle of the layout (of course, this is only one of the improvements). This a good opportunity for me to update the step by step tutorial on how to install the Pentaho Data Integration (aka Kettle) after the one about the past version 5.

Read More

16 Nov

CMIS Input plugin is confirmed as compliant with Pentaho 6.0.0.0-353

cmisinput.png

During the last days I had the time to test the CMIS Input plugin for Pentaho Data Integration on the latest version 6.0.0.0-353. The CMIS Input plugin version is the latest: version 1.3. The results of the tests are that the latest version is confirmed as compliant to the brand new Pentaho release. Read More

04 Nov

A.A.A.R. v4.0 major release

Some months are gone from the latest A.A.A.R. release, but this doesn’t mean that things are not going ahead. 🙂 During the past months I received some concerns about the extraction performance. Today the A.A.A.R. v4.0 is released with a couple of relevant features: the transparent authentication from Alfresco to Pentaho (using the Pentaho Transparent Authentication plugin) and the repository extraction significantly improved in performance. Read More

28 Jul

slf4j conflict during AAAR_Extract execution

slf4jDuring my support activities on the A.A.A.R. solution, I receive few contacts reporting about the error described below. The context is the first execution of the AAAR_Extract script, immediately after the first installation.

2015/07/25 16:25:15 - Cmis Input documents before last update.0 - ERROR (version 5.4.0.1-130, build 1 from 2015-06-14_12-34-55 by buildguy) : Unexpected error
2015/07/25 16:25:15 - Cmis Input documents before last update.0 - ERROR (version 5.4.0.1-130, build 1 from 2015-06-14_12-34-55 by buildguy) : java.lang.LinkageError: loader constraint violation: when resolving method "org.slf4j.impl.StaticLoggerBinder.getLoggerFactory()Lorg/slf4j/ILoggerFactory;" the class loader (instance of org/pentaho/di/core/plugins/KettleURLClassLoader) of the current class, org/slf4j/LoggerFactory, and the class loader (instance of java/net/URLClassLoader) for resolved class, org/slf4j/impl/StaticLoggerBinder, have different Class objects for the type LoggerFactory; used in the signature
2015/07/25 16:25:15 - Cmis Input documents before last update.0 - at org.slf4j.LoggerFactory.getILoggerFactory(LoggerFactory.java:299)
2015/07/25 16:25:15 - Cmis Input documents before last update.0 - at org.slf4j.LoggerFactory.getLogger(LoggerFactory.java:269)
2015/07/25 16:25:15 - Cmis Input documents before last update.0 - at org.slf4j.LoggerFactory.getLogger(LoggerFactory.java:281)
2015/07/25 16:25:15 - Cmis Input documents before last update.0 - at org.apache.chemistry.opencmis.client.bindings.cache.impl.CacheImpl.<clinit>(CacheImpl.java:38)
2015/07/25 16:25:15 - Cmis Input documents before last update.0 - at org.apache.chemistry.opencmis.client.bindings.impl.RepositoryInfoCache.<init>(RepositoryInfoCache.java:56)
2015/07/25 16:25:15 - Cmis Input documents before last update.0 - at org.apache.chemistry.opencmis.client.bindings.impl.CmisBindingImpl.clearAllCaches(CmisBindingImpl.java:253)
2015/07/25 16:25:15 - Cmis Input documents before last update.0 - at org.apache.chemistry.opencmis.client.bindings.impl.CmisBindingImpl.<init>(CmisBindingImpl.java:150)
2015/07/25 16:25:15 - Cmis Input documents before last update.0 - at org.apache.chemistry.opencmis.client.bindings.CmisBindingFactory.createCmisAtomPubBinding(CmisBindingFactory.java:146)
2015/07/25 16:25:15 - Cmis Input documents before last update.0 - at org.apache.chemistry.opencmis.client.runtime.CmisBindingHelper.createAtomPubBinding(CmisBindingHelper.java:98)
2015/07/25 16:25:15 - Cmis Input documents before last update.0 - at org.apache.chemistry.opencmis.client.runtime.CmisBindingHelper.createBinding(CmisBindingHelper.java:56)
2015/07/25 16:25:15 - Cmis Input documents before last update.0 - at org.apache.chemistry.opencmis.client.runtime.SessionFactoryImpl.getRepositories(SessionFactoryImpl.java:133)
2015/07/25 16:25:15 - Cmis Input documents before last update.0 - at org.apache.chemistry.opencmis.client.runtime.SessionFactoryImpl.getRepositories(SessionFactoryImpl.java:112)
2015/07/25 16:25:15 - Cmis Input documents before last update.0 - at it.francescocorti.kettle.cmisinput.CmisSessionFactory.getNewSession(Unknown Source)
2015/07/25 16:25:15 - Cmis Input documents before last update.0 - at it.francescocorti.kettle.cmisinput.CmisSessionFactory.getSession(Unknown Source)
2015/07/25 16:25:15 - Cmis Input documents before last update.0 - at it.francescocorti.kettle.cmisinput.CmisInputMeta.getSession(Unknown Source)
2015/07/25 16:25:15 - Cmis Input documents before last update.0 - at it.francescocorti.kettle.cmisinput.CmisInputMeta.getFields(Unknown Source)
2015/07/25 16:25:15 - Cmis Input documents before last update.0 - at it.francescocorti.kettle.cmisinput.CmisInput.processRow(Unknown Source)
2015/07/25 16:25:15 - Cmis Input documents before last update.0 - at org.pentaho.di.trans.step.RunThread.run(RunThread.java:62)
2015/07/25 16:25:15 - Cmis Input documents before last update.0 - at java.lang.Thread.run(Thread.java:722)

In this post I would like to face this issue, describing the reasons of this behaviour and focusing on the solution (because there is a solution). Read More

23 Dec

Review of the Pentaho Data Integration video course by Itamar Steinberg

pentaho data integration video courseIn this post I have the opportunity to share the review of a brand new Pentaho Data Integration video course by Itamar Steinberg. The full name of the course is mastering data integration (ETL) with pentaho kettle PDI and is available for purchasing on the Udemy website.

The video course is composed by 80 lectures and more then 10 hours of content. It is a walk-through of a real ETL project using Pentaho Data Integration (also known as Kettle), starting from the beginning of the design of the ETL with some easy steps that becomes more and more complex, layer by layer, as you go forward. This Pentaho Data Integration video course also cover some basic concepts of data integration and data warehouse techniques.

Read More

11 Dec

Uploading a mondrian schema to Pentaho using PDI

In this post is shared the solution to upload a mondrian schema to Pentaho BA Server, using the REST API through a transformation of PDI. If you take a look to this thread of the Pentaho forum, the goal seems to be a common problem so we think it could be a good idea to share the solution with the community. I hope this post will be helpful.

Development environment

The source code is developed and tested on a Windows platform and a Linux Ubuntu 14.04 LTS platform. Pentaho BA Server and Pentaho Data Integration are both in the 5.2 version.

Use case

Starting from a file containing the mondrian schema (a XML file), our goal is to develop a PDI transformation to define a Pentaho BA Server Data Source. Of course we would like to define the data source on the mondrian schema so we would like to define a so called “Analysis Data Source”.

Read More

08 Dec

CMIS queries on Alfresco CE 5.0.c

CMIS

CMIS is the most important standard for ECM interoperability. Alfresco is compliance with CMIS 1.1 with Apache Chemistry and CMIS queries are one of the most powerful way to use this ECM. In this post is shared some tests on CMIS queries on the brand new Alfresco Community Edition v5.0.c.

Description of the test environment

I have already shared the test environment I prefer to test Alfresco and CMIS. This is composed by:

  • An Alfresco v5.0.c installation. In this case I use a vanilla installation with uploaded 10K documents.
  • A Pentaho Data Integration (Kettle) v5.2 installation.
  • CMIS Input plugin for PDI to develop queries on Alfresco.

To understand how the CMIS Input plugin works, take a look at the demonstration page here.

Read More

01 Dec

A.A.A.R. v2.2 with interactive dashboards and free analysis

Finally the date came!

Starting from the requests and the collaboration of some of you, the brand new A.A.A.R. v2.2 has been released. I would like to explicitly thank strategicfunctions.com for the contribution. As usual it has been an interesting experience.

The main new feature is obviously the interactive dashboard on audit trail.

Read More

10 Nov

CMIS Input plugin v1.3 for PDI v5.2… extract your data from your ECM.

PDI CMIS Input plugin

The change log page describes some more details but the most important feature is about the used of the Apache Chemistry libraries updated at the v0.12. This version is useful to support the CMIS 1.1 for Alfresco and more.

During the next hours the new version will be available in the Pentaho marketplace… stay tuned!