Hitachi Vantara Pentaho Community Wiki
Child pages
  • Pentaho Software Architecture

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migration of unmigrated content due to installation of a new plugin

...

http://wiki.pentaho.com/display/ServerDoc2x/Pentaho+Prompting+API

Pentaho Coding Standards

Cross cutting coding standards for all modules of the Pentaho suite can be found on our github project, this includes configurations for the most popular IDEs.

https://github.com/pentaho/pentaho-coding-standards

Additional content needed around:

Visualizations

Logging

Plugin Architectures

Platform / BA Server Related

Scheduling and Background Execution in Pentaho User Console: Plugin Scheduling and Background Execution 

Intro for creating a REST service for the BA Server: How to create and register a new REST service from a plugin

Developing Plugins Developing Plugins

Kettle Related

Extending Kettle (Infocenter SDK)

UI Technologies

Datasources

...

Pentaho ChartBeans Flash components, which are still used by Pentaho Dashboards and Action Sequences, are based on Open Flash Chart.  OFC4J is a Java to JSON converter that is used to generate the correct metadata for the charts on the server that is no longer maintained by the creator of the project.

Source Path: svn https://sourcegithub.pentaho.orgcom/svnroot/pentaho-commons/pentaho-ofc4j

Architectural Owner: Will Gorman

...

This commons project provides an API for interacting with platform connections, usually within the context of xactions.  Pentaho Metadata also uses this API for providing access to metadata in result sets.

Source Path: svn https://sourcegithub.pentaho.orgcom/svnroot/pentaho-commons/pentaho-connections

Architectural Owner: Will Gorman

...

This commons project provides an apache-vfs implementation of Hadoop HDFS.  This is used by Kettle for accessing HDFS in a number of contexts.

Source Path: svn https://sourcegithub.pentaho.orgcom/svnroot/pentaho-commons/pentaho-hdfs-vfs

Architectural Owner: Will Gorman

...

This project hosts the APIs and basic logic for checking the version information of a currently running platform.

Source Path: svn https://sourcegithub.pentaho.orgcom/svnroot/pentaho-commons/pentaho-versionchecker

Architectural Owner: Will Gorman

Architectural Area: Platform Services

pentaho-palo-core

This project is an abstraction layer that insulates the uses of Palo within the product so there is no compile time dependency, due to Palo's GPL licensing.

Source Path: svn https://sourcegithub.com/pentaho.org/svnkettleroot/plugins/PaloCore/pdi-palo-core

Architectural Owner: Matt Casters

...

This is a legacy project that defines the Pentaho to CWM object model API mapping, the code in the project is mostly generated, and is deprecated with the new Pentaho Metadata thin API.  This module can be completely removed once Pentaho Metadata Editor is replaced by a new editor.

Source Path: svn https://sourcegithub.pentaho.orgcom/svnroot/pentaho-commons/pentaho-cwm

Architectural Owner: Will Gorman

...

This project provides a simple UI for business users to define Pentaho Metadata Queries.  It is based on XUL so it can be used in Swing, SWT and GWT.

Source Path: svn https://sourcegithub.pentaho.orgcom/svnroot/pentaho-commons/pentaho-mql-editor

Architectural Owner: Nick Baker

...

This project provides a document object model for core action sequence file parsing.  It also manages validation of action sequence files (.xaction).  In the long term Pentaho is replacing Action Sequence functionality with Kettle Transformations and Job capabilities.

Source Path: svn https://sourcegithub.pentaho.orgcom/svnroot/pentaho-commons/pentaho-actionsequence-dom

Architectural Owner: Will Gorman

...

This project was Pentaho's early attempt at a common Chart definition and API.  We have sense began a transition to the Visualization API, which is part of the pentaho platform plugin common-ui.  This project is still in use today by Dashboard Designer and Action Sequences, but will be phased out in a future release.

Source Path: svn https://sourcegithub.pentaho.orgcom/svnroot/pentaho-commons/pentaho-chartbeans

Architectural Owner: Will Gorman

Architectural Area: Information Delivery

publisher (deprecated)

This project contains a utility class allowing basic publishing to the Pentaho Platform.  This project may not be used in 5.0, this needs investigation.

Source Path: svn://source.pentaho.org/svnroot/pentaho-commons/pentaho-publisher

Architectural Owner: Will Gorman

Architectural Area: Shared Services

Anchor
pillars
pillars

Pillars

...

Kettle is a metadata-based data integration engine.  There are two runtimes, data transformation and job orchestration.  Kettle includes an SWT based designer called Spoon for design and execution, as well as a server called Carte for remote execution and command line tools for execution as well.

Source Location: svn https://sourcegithub.com/pentaho.org/svnkettleroot/Kettle/pentaho-kettle

Architectural Owner: Matt Casters

...

Architectural Area: Platform Services

Related Architectural Documentation:

weka

Weka is a suite of machine learning software written in Java.

...

The Aggregation Designer is a wizard that allows for manual and automatic creation of aggregates for use with Mondrian.  This application is based on pentaho-xul, and by default runs within the Swing container. This functionality will eventually be merged into the Pentaho Modeler.

Source Location: svn https://sourcegithub.com/pentaho.org/svnroot/pentaho-aggdesigner

Architectural Owner: Will Gorman

...

The Pentaho Metadata Editor allows for editing of Pentaho Metadata schemas.  This application was developed using SWT.  This functionality will eventually be merged into the Pentaho Modeler.

Source Location: svn https://sourcegithub.com/pentaho.org/svnroot/pentaho-metadata-editor

Architectural Owner: Will Gorman

...

Architectural Owner: Pedro Alves

*Architectural Area: * Data Management / Integration

reporting

The reporting plugin contains the elements necessary for executing Pentaho Reporting within the context of the platform.  Included is a thin- client report viewer based on CDF, and backend execution components for running and scheduling reports.

...

Architectural Area: Information Delivery

Architectural overview: DOC and PPT

pdi

The PDI plugin allows native execution of transformations and jobs within the BI Server.  At this time it is not bundled by default in the BI Server.

Source Location: svn https://sourcegithub.com/pentaho.org/svnroot/platform-plugins/pdi-platform-plugin

Architectural Owner: Nick Baker

...

The Agile BI Plugin allows a Spoon user to quickly visualize a relational database table in a free embedded version of Pentaho Analyzer.  Agile BI will automatically generate a mondrian schema, It includes the pentaho-modeler for editing the schema after auto-creation.

Source Location: svn https://sourcegithub.com/pentaho.org/svnkettleroot/pdi-agile-bi/trunk-plugin

Archtiectural Owner: Nick Baker

...

The ARFF output plugin is a tool that allows you to output data from Kettle to a file in WEKA's Attribute Relation File Format (ARFF).

Source Location: svn https://sourcegithub.com/pentaho.org/svnkettleroot/plugins/ArffOutput/trunk/pdi-weka-arff-output-plugin

Architectural Owner: Mark Hall

...

The Weka scoring plugin is a tool that allows classification and clustering models created with Weka to be used to "score" new data as part of a Kettle transform.

Source Location: svn https://sourcegithub.com/pentaho.org/svnkettleroot/plugins/WekaScoring/trunk/pdi-weka-scoring-plugin

Archtiectural Owner: Mark Hall

...

This plugin to Mondrian Schema Workbench allows for publishing of a Mondrian schema to the Pentaho Business Analytics Server.

Source Location: svn https://sourcegithub.com/pentaho.org/svnroot/pentaho-mondrianschemaworkbench-plugins/trunk

Architectural Owner: Will Gorman

...