Added by Jens Bleuel, last edited by Mark Hall on Aug 10, 2010  (view change)

Labels:

Enter labels to add to this page:
Wait Image 
Looking for a label? Just start typing.

This space is intended to list available Plug-Ins for Pentaho Data Integration and to keep the internal ID unique. 

Notes
  • If you want your Plug-In listed here, please contact communityconnection@pentaho.org. Include the information for the table below, the Kettle version (2 or 3) and your wiki id.  You have the option to have the link point to a web page that you host or to have us create a wiki page that you maintain. If you need also a space in our subversion plugin space, we can add a project to svn://source.pentaho.org/svnkettleroot/plugins for you.
  • Unless stated otherwise, the plug-ins listed on this page are free to download and use, but are not officially supported as part of a paid Pentaho Subscription.
  • Plugins for version 2 are listed here.

 Plugins - Job Entries

Unique ID Name Short Description / Remarks
PDI
Version  / Download
DummyJob
Dummy Job Entry
Dummy plugin test job entry - this can be a blueprint for other job entries
3.x
- General
   
JabberMessage JabberJob Allows sending of Jabber messages (like to Google Talk) 3.x (bin) / (src)
MondrianOutput Mondrian Output Output the result of a mondrian MDX query to an excel file or chart. Jason Chu 3.x (info) / (bin) / (src)

 Plugins - Transformation Steps

Unique ID Name Short Description / Remarks
PDI
Version  / Download
DummyPlugin
Dummy Plugin
Dummy plugin test step - this can be a blueprint for other steps 3.x
- Input Steps
   
DateGenerator
Date Generator
Generates a sequence of consecutive dates.
3.0, 3.1
IMS Database Loader Proventa IMS Database Loader
Reading (large) IBM IMS Database dumps fast and easy.
This is a commercial (closed source) plugin by Proventa AG.
3.x
ProERPCONN
PRORATIO - SAP Connector
ProERPCONN offers a comfortable and quick access to the entire SAP dataset.
This is a commercial (closed source) plugin by Proratio.
3.x
SalesforceInputPlugin SalesforceInputPlugin Allows you to read data from Salesforce. (Translation : US & FR)
Updated : 10 Dec. 2008
3.1
ShapeFileReader ESRI Shapefile Reader
Reads shape file data from an ESRI shape file and linked DBF file
3.x
SugarCRMModule
SugarCRM The SugarCRM Plugin allows you to access all data available in SugarCRM modules. 3.x
SuperCsvInput
Super CSV Input
Read CSV input files.
3.1
- Input/Output Steps
   
PaloDimInput / PaloDimOutput / PaloCellInput / PaloCellOutput PaloKettlePlugin Use Palo Molap Database on your ETLs designed with Kettle. 3.x
Rss Input/Rss Output Rss Input/Output One plugin to read RSS feeds and another one to write feeds to file.
Note: This step has become a standard step in 3.2 and Pentaho Support is available.
3.x
- Output Steps
   
ArffOutput
ARFF Output
Saves data to a file in WEKA's ARFF (Attribute Relation File Format).
Note: Pentaho Support is available for this plug in as part of a Pentaho Subscription.
3.x4.x
SuperCsvOutput
Super CSV Output
Output to a CSV file.
3.1
- Transform Steps
   
Asciify
Asciify
Asciifies strings. It will replace characters that are outside of the 7-bit ASCII range (e.g. diacritics) with a close equivalent and it will replace or remove any other character it can't convert.
3.0, 3.1
EncryptDecryptPlugin Encrypt/Decrypt Provides the ability to encrypt and decrypt the values of any field of a table/fixed file/CSV file. Plug-In posted by Persistent Systems Ltd. 3.x
FieldCalculatorPlugin Field Calculator This plug-in does the arithmatic operatios on the columns of a table. The derived values can be then store in a table/fixed file/CSV file. Plug-In posted by Persistent Systems Ltd. 3.x
Formula
Formula
Calculates values and evaluates expressions.
This is based on the Open Office standard for expressions.
Note: This step has become a standard step in 3.2 and Pentaho Support is available.
3.x
Head
Head
Read first x rows of stream.
3.1
ReservoirSampling
Reservoir Sampling
Samples a fixed number of rows (with uniform probability) from the incoming stream.
Note: This step has become a standard step in 3.2 and Pentaho Support is available.
3.x
Tail
Tail
Read after first x rows of stream.
3.1
TrimCut
TrimCut (Experimental)
Trims and cuts string values to size.
3.0 3.1
UnivariateStats
Univariate Stats
Computes simple univariate statistics.  Available statistics include: N, minimum, maximum, mean, sample standard deviation, median and arbitrary percentiles (computed using a simple mid-point method or interpolation).
Note: This step has become a standard step in 3.2 and Pentaho Support is available.
3.x
WekaScoring (Weka 3.6 or 3.7.0)
Weka Scoring
Appends predictions (labels or probability distributions) from a pre-built WEKA model (classifier or clusterer). Compatible with Weka 3.6.x and 3.7.0.

Note: Pentaho Support is available for this plug in as part of a Pentaho Subscription.
3.x4.x
WekaScoring (Weka >=3.7.2)
Weka Scoring
Appends predictions (labels or probability distributions) from a pre-built WEKA model (classifier or clusterer). Compatible with Weka >=3.7.2.

Note: Pentaho Support is available for this plug in as part of a Pentaho Subscription.
3.x4.x
- Bulk Loading Steps
   
TeraFastPlugin
Teradata bulk loader
Maximum performance for large data loads into Teradata by Aschauer EDV.
3.x



Here are the attachments to the plugin development page, including the sample dummy step and job entry plugins for versions 2.5.x and 3.0.x.