Hitachi Vantara Pentaho Community Wiki

Versions Compared


  • This line was added.
  • This line was removed.
  • Formatting was changed.


This wiki space is the community home and collection point for all things "Big Data" within the Pentaho ecosystem. This is the place to find documentation, how-to's, best practices, use-cases and other information about employing Pentaho technology as part of your overall Big Data Strategy. It is also a place where you can share your own information and experiences using Pentaho Big Data technology. We look forward to your participation and contribution!


  • Very large data volumes measured in terabytes or petabytes
  • Variety of structured, unstructured and semi-structured data
  • High velocity rapidly changing data
  • Datasets that grow so large that they become awkward or uneconomic to work with using traditional database management and BI tools. Analyzing big data allows analysts, data scientists and now casual business users to do things not previously possible including identifying business trends, preventing diseases and combatting crime.

End of the Big Data philosophy discussion - for Pentaho marketing fluff, please visit: