pygrametl: A Powerful Programming Framework for Extract–Transform–Load Programmers

Research output: Book/ReportReportResearch

34 Citations (Scopus)
6419 Downloads (Pure)

Abstract

    Extract-Transform-Load (ETL) processes are used for extracting data, transforming it and loading it into data
warehouses (DWs). Many tools for creating ETL processes exist. The dominating tools all use graphical user
interfaces (GUIs) where the developer visually defines the data flow and operations. In this paper, we challenge
this approach and propose to do ETL programming by writing code. To make the programming easy, we present
the (Python-based) framework pygrametl which offers commonly used functionality for ETL development.
By using the framework, the developer can efficiently create effective ETL solutions from which the full power
of programming can be exploited. Our experiments show that when pygrametl is used, both the development
time and running time are short when compared to an existing GUI-based tool
Original languageEnglish
Place of PublicationAalborg
PublisherDepartment of Computer Science, Aalborg University
Volume25
Number of pages18
Publication statusPublished - 2009
Series1DB Technical Report
Number25

Fingerprint

Dive into the research topics of 'pygrametl: A Powerful Programming Framework for Extract–Transform–Load Programmers'. Together they form a unique fingerprint.

Cite this