Hitachi Vantara Pentaho Community Wiki

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migration of unmigrated content due to installation of a new plugin

...

  • Drill down into running job entries
  • Visual indicators of running and completed job entries with success and failure mini-icons
  • Mouse over completion mini-icons shows details of execution results
  • Log capturing of completed job entries

Running transformations in Spoon

...

  • Reduced memory consumption
  • Incremental log updates
  • Maximum memory consumption Global log buffer size limit for long running jobs/transformations
  • Interval logging
  • Auto clean-cleanup up of old log tablesrecords
  • Log record time-outs
  • Log record lineage
  • Log record color codingcolour coding in Spoon (blue and red for error lines)
  • Step Logging
  • Job entry logging
  • Execution lineage logging
  • Renaming individual columns
  • Global configuration options for all log tables

New

...

plug-in architecture

  • Unified plugin plug-in architecture
  • Easier deployment and packaging
  • Step, job entry, partitioner, database type, spoon perspective, life-cycle, ... : all pluggable

New repository

...

plug-in architecture

  • Allowing for 3rd party repositories like the Pentaho Unified Enterprise Repository
  • ...
  • Removed dependencies to relational database repository (still supported though)
  • Added support for repositories capable of team-development (file locking)
  • Added support for repositories capable of fine-grained security repositories
  • Added support for repositories capable of storing and retrieving revision history

Step changes

Anchor
Step changes
Step changes

...

  • SAP Input: Reads data from an SAP/R3 application server. (needs jsapco.jar not included in PDI)
  • Data Grid : Allows you to enter static rows of data for reference or testing purposes
  • OLAP Input: read data from an OLAP server using olap4j over XML/A: Mondrian, Palo, SSAS, SAP B/W
  • Salesforce Delete, Insert, Update, Upsert
  • Add fields changing sequence: a sequence that gets reset when the values in a set of fields changes. (group sequence)
  • User Defined Java Class: create your own plugin on the fly in a step (coming out of incubation)
  • Send information using Syslog: Send a message to a Syslog server. http://en.wikipedia.org/wiki/Syslog
  • Java Filter : Filter based on a User Defined Java Expression
  • Memory Group By: for smaller groups you can keep the intermediate statistical results in memory leading to faster results
  • Farrage LucidDB streaming bulk loader
  • Teradata Fastload Bulk loader
  • Experimental steps added: Get table names, Email messages input, ...

...

Anchor
Repository
Repository

  • New repository plugin plug-in architecture
  • New Pentaho Unified Enterprise Repository type
  • New File repository type
  • New repository explorer
  • ...

...

This is what that gave us over the last releases:  

Version

Lines of code

Increase

% inc.

2.1.4

160,000

 

 

2.2.2

177,450

  17,450

10.9%

2.3.0

213,489

  36,039

20.3%

2.4.0

256,030

  42,541

19.9%

2.5.0

292,241

  36,211

14.1%

3.0.0

348,575

  56,334

19.3%

3.1.0

456,772

108,197

31.0%

3.2.0

529,277

  72,505

15.8%

4.0.0

607,180

  77,903

14.7%


Libraries

The total library portfolio of Pentaho Data Integration consists of these libs:

...