Hitachi Vantara Pentaho Community Wiki
Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 10 Next »


This page will serve as the central location for organizing integration efforts between Pentaho Data Integration and a number of Data Quality platforms provided by our technology partners.

Integrating Data Quality Solutions with Pentaho

Data Quality as it relates to BI and Data Integration refers to the processes and technologies involved in ensuring the conformance of data values to business requirements and acceptance criteria.  Poor Data Quality can have a significant negative impact on business performance in many areas including:

  • Increased costs in direct mailing campaigns caused by bad address data
  • Back office implications in operational areas such as billing, accounting and credit management due to poor quality customer data
  • Compliance, security and/or privacy issues caused by duplicate or bad customer data
  • And the examples go on and on

Your ETL and Data Integration infrastructure is a natural place to integrate data quality best practices and technologies to validate that the data currently have is clean and to act as a 'data quality firewall' ensuring that as new data enters the system, it also is conforms to the quality standards of the business.

Pentaho Data Integration provides a highly scalable and extensible platform for addressing all of your ETL and Data Integration needs, and its pluggable architecture is a natural fit for integrating with industry leading Data Quality solutions from a variety of technology partners.  The sections that follow describe several such active integration projects between Pentaho Data Integration and products for industry leaders in the Data Quality space.

Highlighted Technology Partners

EmergeIT develops tools and techniques that provide powerful Data Integration solutions for Businesses that want to take advantage of the "Information Universe" to drive competitive advantage.

Human Inference Data Quality solutions enable you to trust and find your data – at all times. The integrated HIquality software platform offers various kinds of data quality processes for all stages of the Data Quality Life Cycle in one suite.

MelissaData's Data Quality Suite ensures the integrity of your database, saves money on postage, and increases response rates with the power and flexibility of four Data Quality Objects. Add rapid lookup, verification and correction routines to your custom applications to achieve clean, accurate, usable contact data.

Uniserv is the largest pure-play supplier of data quality solutions in Europe with an internationally usable software portfolio and services for data management, i.e. for securing data quality including data integration. Uniserv's solutions enable it to support its customers in initiatives for data quality and projects for data integration, data migration and data consolidation.

Information for Technology Partners

Integrating with Pentaho Data Integration - If you are a software provider in Data Quality space and interested in integrating your solution with Pentaho Data Integration, this page will provide you with information and links to resources that will help get you started.

  • No labels