The Wiki will be offline Monday, November 20, for upgrade between 10:00am ET and 5:00pm ET.
Hitachi Vantara Pentaho Community Wiki
Access Keys:
Skip to content (Access Key - 0)

Using a Custom Input or Output Format in Pentaho MapReduce

compared with
Current by Naveen Das
on Apr 02, 2016 08:43.

(show comment)
Key
This line was removed.
This word was removed. This word was added.
This line was added.

Changes (2)

View Page History
h2. Deploy the JAR

# *Deploy the JAR to Pentaho:* Pentaho validates the file and input and output format classes exist before submitting to the cluster, so add the jar to Kettle's classpath by copying the jar to $KETTLE_HOME/libext/pentaho and restarting Spoon. If running from a Spoon client, copy the jar to the [shim|http://wiki.pentaho.com/display/BAD/Configuring+Pentaho+for+your+Hadoop+Distro+and+Version]'s lib/client folder.
# *Deploy the JAR to Hadoop:* Add the jar to Hadoop's distributed cache by running the following commands:
{code}hadoop fs -mkdir /distcache
| Name | Value |
| mapred.cache.files | Add ,/distcache/CustomFileFormats.jar to the existing value. |
| mapred.job.classpath.files | Add :,/distcache/CustomFileFormats.jar to the existing value. |
!ConfigureDistributedCache.PNG|width=626,height=356!
Click OK to close the window.

This documentation is maintained by the Pentaho community, and members are encouraged to create new pages in the appropriate spaces, or edit existing pages that need to be corrected or updated.

Please do not leave comments on Wiki pages asking for help. They will be deleted. Use the forums instead.

Adaptavist Theme Builder (4.2.0) Powered by Atlassian Confluence 3.3.3, the Enterprise Wiki