How to create a report that sources data from Hive.
In order follow along with this how-to guide you will need the following:
- Pentaho Data Integration
- Report Designer
- RDBMS Database
The sample data file needed for this guide is:
Tab-delimited, aggregated weblog data for a Hive weblogs_agg table
NOTE: If you have already completed the Transforming Data within Hive guide, then the necessary Hive table, weblog_aggs, will already be created and populated.
Start Hadoop if it is not already running.
Start Hive Server if it is not already running.
Create a Hive Table
NOTE: This task may be skipped if you have completed Transforming Data within Hive guide.