The development of DataCleaner is handled by the DataCleaner community, a meritocratic, communicative entity of professionals and volunteers whose shared purpose is to produce high-quality software and promote it to as many people as possible.

The community doesn't exist of a well-defined group of people, rather we emphasize that everyone who participates can be a community member and participation is sought to be as convenient and easy as possible.

Join the development

Most of the development infrastructure for DataCleaner is hosted at GitHub, which provides a social platform for building software among professionals and volunteering contributors.

Source code

You can find the source code here:

We encourage everyone with the idea of helping DataCleaner's development to fork the GitHub repository and submit your contributions using pull requests!


Issues are registered and tracked via GitHub's issue tracking system. Note that we previously used our own bugtracker and since then a lot of older issues have been purged from the list.

We invite everyone to submit issues if you find one. You can also start by taking up a discussion on the forums to verify your thoughts on the issue.

Extension development

Some resources to get started with DataCleaner extension development:

Connectivity layer

DataCleaner builds on top of Apache MetaModel for it's connectivity layer. In fact, many of the core developers of DataCleaner are also core developers on Apache MetaModel.

  • Apache MetaModel - datastore layer, allowing access to databases, CSV files and more.
  • MetaModel extras - extra modules for Apache MetaModel under the LGPL license.

You can find more information in the reference documentation!

The reference documentation of DataCleaner contains several chapters on important topics for developers such as:

  • Architecture
  • Building and running jobs through code
  • Embedding DataCleaner into other desktop applications
  • Invoking web services on DataCleaner monitor
  • Uploading and downloading content from DataCleaner monitor