Data Vault loading automation using Pentaho Data Integration

It’s a completely different ball-game, but open-source BI/DWH is not new.
I found this full-size demo environment based on MySQL, Pentaho and Pentaho Data Integration (ETL-tool)

More info, click here.

PDI Data Vault framework

Description
A metadata driven ‘tool’ to automate loading a designed Data Vault. It consists of a set of Pentaho Data Integration and database objects. At the moment the version for MySQL includes the latest developments.
The PostgreSQL and Oracle version will be published later.

Thel Virtual Machine (VMware) is a 64 bit Ubuntu Server 12.04, with MySQL (Percona Server) as the database and PDI version 4.4.0 CE.

Version management is accomplished by Git (PDI objects) and neXtep (database objects).

User/passwd : percona/percona
MySQL user/passwd : root/percona
neXtep user/passwd : nextep_user/nextep_user

More info, click here.

A possible architecture:

Data-Vault-Pentaho-Architecture

Thanks to Kasper de Graaf and Aly Hollander:
http://www.bi-podium.nl/mediaFiles/upload/DWHgen/Pentaho_en_DV_-_KdG.pdf

Advertisements

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s