Before developing a course, we listen to the real needs and objectives of each client, to adjust training and get high profitability We adjust each course to your needs.

We are also specialists in formations 'in company' tailored to the needs of each organization, where harvesting for several participants from the same company is much higher. If this is your case, contact us.

Check our courses

ETL Kettle Pentaho

ETL Kettle Pentaho

Goal

Extraction, transformation and loading (ETL) of data is the key to success in a BI system for managing the quality of the data properly.

In this course you will have some of the best practices that we recommend for the design of ETL processes such as:

Centralization of procedures so that the coherence and consistency of the exchanged data from different sources is ensured.

Avoid redundancy calculations: if there is data previously calculated in the operational databases should not return to the calculation performed in the extraction. This premise aims to achieve a double objective.

Establishment of points of "quality control" and validation.

Implement processes charging information for possible errors in the initial information.

Consider the possibility of using intermediate tables with the most atomic level of information to be treated.

In addition, we will review the most important and used ETL tool Pentaho elements: Pentaho Data Integration and Kettle.

Target audiences

Professionals information technology, IT managers, business analysts, systems analysts, Java architects, system developers, database administrators, developers and professionals in relation to the area of technology, marketing, business and financial.

Observations

Data Quality and Integration with Pentaho

Syllabus

Introduction Open Source Business Intelligence Platforms
  • Architecture and features of Pentaho, SpagoBI , BIRT , Mondrian , Kettle , Talend , etc ...
  • Development Tools.
ETL (Kettle)
  • Good practices for ETL process definition.
  • Functional Overview (work, Transformations, flow control)
  • Parameterisation
    • Environment Variables
    • Parameterization of connections to databases. Shared connections.
    • Parameterization of loads and load types
  • Jobs
    • Overview
    • Steps types (Mail, File Managament, etc ...)
    • Description of Steps
    • Steps examples of more useful and common
  • Transformations
    • Overview
    • Steps types (Input, Output, Tranform, etc ...)
    • Description of Steps
    • Steps examples of more useful and common
  • Practical examples

Contacto

Ajustamos cada curso a sus necesidades.

Nuestra oficina en Madrid

Do you need a training?. We may offer a wide training catalog based on platform and software tools such as Pentaho, Talend, Mondrian, Ctools.