Pentaho is owned by Hitachi Vantara, and is a separate business unit. Pentaho started out as business intelligence (BI) software developed by the Pentaho Corporation in 2004. It comprises Pentaho Data Integration (PDI) and Pentaho Business Analytics (PBA). These provide data integration, OLAP services, reporting, information dashboards, data mining and extract, transform, load (ETL) capabilities.
Pentaho was acquired by Hitachi Data Systems in 2015 and in 2017 became part of Hitachi Vantara. In November 2023, Hitachi Vantara launched the Pentaho+ Platform, comprising the original Pentaho Data Integration and Pentaho Business Analytics software, and new Pentaho Data Catalog, and Pentaho Data Optimiser software products. Hitachi Vantara intends to extend the Pentaho+ platform with tools for Data Quality and Data Mastering.
Pentaho Data Optimizer allows organizations to manage, maintain and tier their data based on its business value, the cost of managing it, and regulatory requirements. It uses the auto-discovery features of the Pentaho Data Catalog to achieve this.
In March 2020 and June 2021 Hitachi Vantara acquired Waterline Data and Io-Taho respectively, and amalgamated both into its Pentaho Data Catalog (PDC). PDC automatically finds, analyzes, and tags structured and unstructured data and contextualizes it with business glossary terms and governance policies.
Pentaho Data Integration (PDI) and Pentaho Business Analytics (PBA) use a Java framework to create business intelligence solutions. Although most known for its Business Analysis Server (formerly known as Business Intelligence Server), the PDI/PBA software is indeed a couple of Java classes with specific functionality. On top of those Java classes one can build any business intelligence solution.
The only exception to this model is the ETL tool Pentaho Data Integration - PDI (formerly known as Kettle.) PDI is a set of software used to design data flows that can be run either in a server or standalone processes. PDI encompasses Kitchen, a job and transformation runner, and Spoon, a graphical user interface to design such jobs and transformations.
Features such as reporting and OLAP are achieved by integrating sub-projects into the PDI/PBA framework, like Mondrian OLAP engine and jFree Report. For some time by now those projects have been brought into Pentaho's curating. Some of those subprojects even have standalone clients like Pentaho Report Designer, a front-end for jFree Reports, and Pentaho Schema Workbench, a GUI to write XMLs used by Mondrian to serve OLAP cubes.
Pentaho offers enterprise and community editions of those PDI software. The enterprise software is obtained through an annual subscription and contains extra features and support not found in the community edition. PDI & PBA's core offering is frequently enhanced by add-on products, usually in the form of plug-ins, from the company and the broader community of users.
Pentaho Enterprise Edition (EE) and Pentaho Community Edition (CE).
It supports the MDX (multidimensional expressions) query language and the XML for Analysis and olap4j interface specifications. It reads from SQL and other data sources and aggregates data in a memory cache. Mondrian can be run separately from the Pentaho BI Platform, but is always bundled with the platform itself in both EE and CE versions.
All of these plug-ins function with Pentaho Enterprise Edition (EE) and the older Pentaho Community Edition (CE).
CDC stands for Community Distributed Cache and allows for high-performance, scalable and distributed memory clustering cache based on Hazelcast for both CDA and Mondrian. CDC is a Pentaho plug-in that provides the following features:
Pentaho plug-in that allows the user to export CCC/CDE charts as images, enabling the inclusion of CDE charts inside Pentaho Report Designer reports. In short, this plug-in is able to render server-side exactly the same chart that is rendered on the browser by CDE/CDF.Main characteristics:
A RESTful server connects to existing OLAP systems, which then powers user-friendly, intuitive analytics via a lightweight frontend.
Pentaho followed an open core business model for several years, however with their 10.2 release in 2024 switched to non-OSS licensing.15 The new licensing doesn't allow for running in production without subscribing to their Enterprise Edition.
It provides two different editions of Pentaho Business Analytics: a Developer Edition (non production use only) and an Enterprise Edition. The enterprise edition needs to be purchased on a subscription model. The subscription model includes support, services, and product enhancements via annual subscription.16 The enterprise edition is available under a commercial license. Enterprise license goes with 3 levels of Pentaho Enterprise Support: Enterprise, Premium and Standard.
Michael Terallo, Pentaho Data Access Wizard Retrieved July 29, 2012 ↩
Surya Mukherjee, Ovum. "Pentaho expands coverage for Big Data." March 8, 2012. Retrieved April 11, 2012. http://ovum.com/2012/03/08/pentaho-expands-big-data-coverage/ ↩
James Kobielus, Forrester Research. "The Forrester Wave: Enterprise Hadoop Solutions." February 2, 2012. Retrieved May 10, 2012. http://www.forrester.com/The+Forrester+Wave+Enterprise+Hadoop+Solutions+Q1+2012/fulltext/-/E-RES60755 ↩
David Menninger, Ventana Research. "Pentaho 4 Unites Enterprise Business intelligence and Data Integration Archived 2012-04-20 at the Wayback Machine." June 22, 2011. Retrieved April 8, 2012. http://www.ventanaresearch.com/blog/commentblog.aspx?id=1577 ↩
Nikos Mastorakis, Valeria Mladenov and Vassiliki Kontargyri. "Proceedings of the European Computing Conference." Heidelberg, Germany: Springer Science and Business Media, 2009. ISBN 978-0387848136. p. 789. Retrieved July 11, 2012. https://books.google.com/books?id=avPz1cUPlKgC&dq=Proceedings%20of%20the%20European%20Computing%20Conference%2C&pg=PA789 ↩
Ed Woord, FLOSS FOR SCIENCE. "Machine Learning with WEKA: AN Interview with Mark Hall." July 1, 2012. Retrieved July 25, 2012 http://www.floss4science.com/machine-learning-with-weka-mark-hall/ ↩
Webdetails Consulting Company, Portugal http://www.webdetails.pt ↩
Pedro, Alves "Back to basics: Step by step Pentaho + Ctools installation" December 15, 2011, Retrieved July 27, 2012 http://pedroalves-bi.blogspot.com/2011/12/back-to-basics-step-by-step-pentaho.html ↩
Will, Gorman Pentaho Wiki "Pentaho BI Server Marketplace Plugin February 17, 2012, Retrieved July 27, 2012 http://wiki.pentaho.com/display/PMOPEN/Pentaho+BI+Server+Marketplace+Plugin ↩
Stanford Visualization Group, Protovis https://mbostock.github.com/protovis/ https://mbostock.github.com/protovis/ ↩
CDA Documentation Retrieved July 26, 2012. http://cda.webdetails.org/?q=content/documentation-data-accesses ↩
CDA web API reference: doQuery Retrieved July 27, 2012 http://cda.webdetails.org/?q=content/documentation-web-api-reference ↩
"CDF Documentation". Archived from the original on 2012-06-21. Retrieved 2012-07-26. https://web.archive.org/web/20120621204833/http://cdf.webdetails.org/ ↩
"CST Documentation". Archived from the original on 2011-07-12. Retrieved 2012-07-26. https://web.archive.org/web/20110712224901/http://cst.webdetails.org/ ↩
"Pentaho Server Community Version 10.2 is not available to download". Hitachi Vantara. August 29, 2024. https://community.hitachivantara.com/question/pentaho-server-community-version-102-is-not-available-to-download#ViewAnswer_8a2357c2-683e-470b-bbde-01919e44fbbd-answer-content ↩
Torben Pedersen and Mukesh Mohania. "Data Warehousing and Knowledge Discovery." Heidelberg, Germany: Springer Science and Business Media, 2009. ISBN 978-3642037290. p.296-298. Retrieved April 6, 2012. https://books.google.com/books?id=6292rhpRJWMC&dq=Open%20Source%20BI%20Platforms%3A%20A%20Functional%20and%20Architectural%20Comparison&pg=PP1 ↩