During the past months, many books have been published about Hadoop technology and for sure this is one of the most popular IT buzzword. Of course, the majority part of the books are for neophytes or people that want to know better the MapR solution and all the Hadoop ecosystem. From this point of view, this book could be classified as an introductory book to the solution architect and IT developers. This classification is clearly represented in the first part of the book (in particular in chapter 1 – Big data overview). Read More
Do you really know what Apache Hadoop is?
Are you sure to understand the meaning of “big data” in the real world scenario?
How big data storage issue and data warehouse issue meet Hadoop implementation?
Which are the main tools Apache Hadoop is based on?
If you completely don’t know (but you are interested in) or you want to have a clear and final picture of those topics (and probably much more) you should read this book. Read More
In this post I would like to share my personal review about the recent book from Packt Publishing called Scaling Big Data with Hadoop and Solr (Second Edition) by Hrishikesh Vijay Karambelkar.
The goal of the book is quite clear in the title too: describe in practice how Apache Hadoop and Apache Solr, help organizations to resolve the problem of information extraction from big data. Don’t you think this is a very interesting problem to face? I think so.
The step by step tutorial describing how to install Alfresco 5.0.a Community Edition on Ubuntu 14.04 LTS, is definitely one of the most accessed pages of this blog. Nothing new, considering how was accessed the same tutorial on Alfresco 4.2.c. This has been a nice surprise and I receive lots of feedback sharing that this is helpful for someone. I like it.
In the past days I modified the tutorial, following some hints and suggestions of some Alfresco experts and the community in general. I’m very angry with some of them because they did not accept to be quoted (I’m joking) but in every case I’m very happy to submit to your attention the revised version of the tutorial.
One of the conclusions of this review is that would be the time to develop another new version with the Solr4 installation tutorial, that has been introduced as mandatory from Alfresco 5.0.c. I’ll working on it…
Until then: thank you community for the feedback. 🙂
In this post I would like to submit to you the brand new book from Packt Publishing called Learning Alfresco Web Scripts by Ramesh Chauhan. The book is easy to read with its 182 pages, full of practical examples and portion of source code.
Alfresco web scripts are fundamentals to develop a real enterprise solution for the Customers. For this reason, having a complete (and updated) guide on how to understand and properly develop them, is the core of the success of a project.
In this post I have the opportunity to share the review of a brand new Pentaho Data Integration video course by Itamar Steinberg. The full name of the course is mastering data integration (ETL) with pentaho kettle PDI and is available for purchasing on the Udemy website.
The video course is composed by 80 lectures and more then 10 hours of content. It is a walk-through of a real ETL project using Pentaho Data Integration (also known as Kettle), starting from the beginning of the design of the ETL with some easy steps that becomes more and more complex, layer by layer, as you go forward. This Pentaho Data Integration video course also cover some basic concepts of data integration and data warehouse techniques.
If you are interested in Hadoop technology probably this is an interesting video course you should evaluate. As you probably know, Apache Hadoop is an open-source software framework for storage and large-scale processing of data-sets on clusters of commodity hardware. All the modules in Hadoop are designed with the assumption that hardware failures are common and thus should be automatically handled in software by the framework.
Talking about the video course, we can divide the content in three main macro-sections:
1. how to create and set up a three machines cluster using Amazon EC2,
2. how to install an Hadoop cluster using Apache Ambari,
3. how to start using Hadoop cluster, in particular with Apache Hadoop User Interface (HUE).
The description of all the topics is clear and well done (Sean Mikha, the author, did a good job). All the relevant topics are always detailed before with an explanation of the logic structure and approach and only after with a demostration on how to do it in practice.
Useful also for other purposes, the creation of the virtual machines on Amazon EC2. The practical description and the step by step creation, is not limited to the server’s creation but is detailed also in what concerns the security and connection using, for example, putty ssh client.
In my opinion the most relevant value of this video course is on the hidden details of the Hadoop cluster installation process. As you will see if you will decide to follow it, the tasks are quite easy to do (probably this a Sean’s merit) but the configuration details and settings are very important if you want to make it work in practice. Following the hints I’m sure every neophyte will gain days of work and lot of nights in googling. 😉
As you probably know (or you read it now for the first time) CMIS is an open standard that allows different ECMs to inter-operate over the Internet through the definition of a collection of services and a powerful query language (CMIS-QL), modeled along a subset of SQL.
The goal of this book is to share and explain all the basics of the CMIS, using a practical and technical approach that starts from the history (why the CMIS was born), going through the definition of the (several) services and the query language, and ending with a collection of examples describing how to use CMIS in practice.
As you can read from the title, Martin Bergljung focuses his description on Alfresco. And this is true because all the examples are developed using an Alfresco repository as referred architecture. But inside the book you can find something more about Alfresco. Personally I have found very interesting the description of the Alfresco Surf together with CMIS standard. Probably this topic is less useful for the most part of the readers (and practical cases) but is an interesting example related to the basics of the Alfresco Share application. Quite interesting also the example on how to make Alfresco and Drupal interact, using CMIS.
Last but not least, I read the book very easily in the first part (the more descriptive) and in the last (full of practical examples in the different languages). I think I will use the book also as manual of the several CMIS services when I will develop something because I suggest you to remember that…
Standard is good!
During this last period I read the new book about Alfresco. The book is titled “Alfresco 4 Enterprise Content Management Implementation” and is Published by Packt Publishing.
The book it’s the natural update and evolution of the famous “Alfresco 3 Enterprise Content Management Implementation”, published in June 2009 and today not so updated. And before, of the “Alfresco Enterprise Content Management Implementation”, published in January 2007 by Packt Publishing.
You know, Alfresco features are growing very rapidly, month after month and release after release.
The fact that the book has more than 500 pages and it is written by 8 different authors (all of them are skilled IT professionists of Cignex Datamatics) clearly means that the content is relevant and detailed. Reading this book, infact, you will find that the content is not so narrative but oriented on practical topics and solutions.
The goal of the authors is relevant: explain Alfresco to IT specialists, starting from scratch and diving deep in the user experience, the customization, the development and the administration. Not bad in a single book!
The book is not specifically for developers (like the authors says) but for administrators, experienced users, developers, in particular for IT consultants.
I have found the description of the installation of the so called “Alfresco bundle” release together with the installation of all the Alfresco components, very useful because it’s not so easy to find, around in the web.
Very interesting, in my opinion, the description of the workflow using Activiti (JBPM is not described even if it supported by Alfresco 4) because it is concrete, developing practical examples for real life cases.
Last but not least, one final consideration about the explanation strategy: in my opinion too much descriptions are developed on Alfresco Explorer. We all know that Alfresco Explorer won’t be improved by Alfresco developers and Alfresco Share is the “modern” user interface. I think could be a good idea, for the next edition of the book, to move the most part of the examples (and images) to it.
In conclusion, I suggest this book to all the people that are not Alfresco Experts but would like to understand better what Alfresco is and how Alfresco works, with a good level of detail.
Have a good read during the next holidays. 😉