Hitachi Vantara Pentaho Community Wiki

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Pentaho

...

Big

...

Data

...

Plugin

Image Added width=32px height=32px
The Pentaho Big Data Plugin Project provides support for an ever-expanding BigData community within the Pentaho ecosystem. It is a plugin for the Pentaho Kettle engine which can be used within Pentaho Data Integration (Kettle), Pentaho Reporting, and the Pentaho BI Platform.

<TODO: convert this into a list of currently supported items?>Highlights of the project are to provide support for interacting with Apache Hadoop, Apache Hive, Apache HBase, MongoDB, and Cassandra among other NoSQL data sources for the Pentaho ecosystem.

Pentaho Big Data Plugin Features

This project contains the implementations for:

  • Pentaho MapReduce: visually design MapReduce jobs as Kettle transformations
  • HDFS File Operations
  • Hive
  • HBase
  • Cassandra
  • MongoDB

Key Links

...

  • (GitHub

...

  • mirror:

...

  • <TODO>)

...

  • Documentation:

...

  • <TODO:

...

  • add

...

  • dev

...

  • doc

...

  • page

...

  • and

...

  • aggregate

...

  • links

...

  • to

...

  • wiki

...

  • pages

...

  • such

...

  • as

...

...

...

  • ,

...

...

...

  • ,

...

...

...

  • ,

...

...

...

...


  • Download:

...

  • <TODO>

...

Community

...

and

...

where

...

to

...

find

...

help

...

The

...

Big

...

Data

...

Forum

...

exists

...

for

...

both

...

users

...

and

...

developers.

...

The

...

community

...

also

...

manages

...

the

...

##pentaho

...

IRC

...

channel

...

on

...

irc.freenode.net.

...

Quick

...

Start:

...

Building

...

the

...

project

...

The

...

Pentaho

...

Big

...

Data

...

Plugin

...

is

...

built

...

with

...

Apache

...

Ant

...

and

...

uses

...

Apache

...

Ivy

...

for

...

dependency

...

management.

...

All

...

you'll

...

need

...

to

...

get

...

started

...

is

...

Ant

...

1.8.0

...

or

...

newer

...

to

...

build

...

the

...

project.

...

The

...

build

...

scripts

...

will

...

download

...

Ivy

...

if

...

you

...

do

...

not

...

already

...

have

...

it

...

installed.

...

}
Code Block
svn co svn://source.pentaho.org/svnkettleroot/pentaho-big-data-plugin/trunk pentaho-big-data-plugin
cd pentaho-big-data-plugin
ant{code}

h1. Developing with Eclipse

We recommend [Apache IvyDE|http://ant.apache.org/ivy/ivyde/] to manage your Ivy dependencies within Eclipse.

# Import 

Developing with Eclipse

We recommend Apache IvyDE to manage your Ivy dependencies within Eclipse.

  1. Import pentaho-big-data-plugin

...

  1. into

...

  1. Eclipse

...

  1. Resolve

...

  1. the

...

  1. project

...

  1. using

...

  1. IvyDE

...

If

...

IvyDE

...

is

...

not

...

an

...

option

...

then

...

you

...

can

...

manually

...

add

...

the

...

jars

...

from

...

lib/

...

and

...

libswt/

...

to

...

your

...

class

...

path.

...

This

...

project,

...

like

...

all

...

other

...

Pentaho

...

projects,

...

uses

...

the

...

open-source

...

Subfloor

...

Ant

...

build

...

framework.

...

Running

...

the

...

following

...

targets

...

will

...

configure

...

the

...

Eclipse

...

project

...

to

...

reference

...

the

...

required

...

libraries:

...

}
Code Block
ant resolve create-dot-classpath{code}

Then

...

import

...

or

...

refresh

...

the

...

project

...

in

...

Eclipse

...

and

...

add

...

the

...

SWT

...

libraries

...

for

...

your

...

architecture,

...

e.g.

...

for

...

Mac

...

OS

...

X

...

x64:

...


Image Added