A recipe for executing Weka in Hadoop.
This package for Weka >= 3.7.10 provides several jobs for executing learning tasks inside of Hadoop. These include:
A full-featured command line interface is available along with GUI Knowledge Flow components for job orchestration. Predictive models learned in Hadoop are fully compatible with Pentaho Data Integration's "Weka Scoring" transformation step.
More information on what is available in the distributed Weka package, and how it is implemented, can be found in a three part blog posting:
Open Weka's package manager (GUIChooser->Tools->Package manager) and install "distributedWekaHadoop".