Hitachi Vantara Pentaho Community Wiki
Child pages
  • Pentaho Software Architecture

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Introduction

The purpose of this document is to provide a detailed view of the overall software architecture that when combined makes up the entire Pentaho open source software suite.

At a high level, the software components can be divided into a variety of forms.  In the following detailed list, the general organization includes third party libraries and components that Pentaho has needed to fork and maintain, common libraries and projects that are used in general ways, pillars that are core business analytics or data integration elements, tools that allow access to pillars, and plugins across the pillars that provide additional functionality.  These same components can be looked at from a architectural purpose point of view, including four general areas including information delivery, data movement, analytics, and platform services.  For each project below we categorize in both manners to give a multi-faceted view of the overall architecture of Pentaho.

Cross Cutting Architectures and Use Cases

This section discusses high level cross cutting software architectures and use cases.

Version Control

At this time, Pentaho utilizes a combination of SVN and GIT for managing the source.  Here are some related articles:

http://wiki.pentaho.com/display/PEOpen/Advanced+Git+Topics\

Metadata Definitions

As we continue to build a community of projects, it's important that they share terminology and common metadata.  Here's the beginnings of capturing shared metadata to be used across all Pentaho projects:

http://wiki.pentaho.com/display/COM/Standard+MetaStore+Element+types\

Detailed Software Listing

This detailed software listing is organized in the general order in which software components are dependent on one another, although it should not be used as the official build order of Pentaho.

...

 Kettle-VFS (Fork of Apache VFS) (MattC)
 Hive JDBC (Will)
 Pentaho OFC4J (Will)

Kettle-VFS

Kettle VFS is a maintained fork of Apache Commons VFS

...

 Hive JDBC (Will)

 Pentaho OFC4J (Will)

Common Components

Pillars

Tools

Plugins