Synthesizers: A Meta-Framework for Generating and Evaluating High-Fidelity Tabular Synthetic Data

Peter Schneider-Kamp*, Anton Danholt Lautrup, Tobias Hyrup

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

50 Downloads (Pure)

Abstract

Synthetic data is by many expected to have a significant impact on data science by enhancing data privacy, reducing biases in datasets, and enabling the scaling of datasets beyond their original size. However, the current landscape of tabular synthetic data generation is fragmented, with numerous frameworks available, only some of which have integrated evaluation modules. synthesizers is a meta-framework that simplifies the process of generating and evaluating tabular synthetic data. It provides a unified platform that allows users to select generative models and evaluation tools from open-source implementations in the research field and apply them to datasets of any format. The aim of synthesizers is to consolidate the diverse efforts in tabular synthetic data research, making it more accessible to researchers from different sub-domains, including those with less technical expertise such as health researchers. This could foster collaboration and increase the use of synthetic data tools, ultimately leading to more effective research outcomes.
Original languageEnglish
Title of host publication Proceedings of the 19th International Conference on Software Technologies ICSOFT
PublisherSCITEPRESS Digital Library
Publication date2024
Pages177-184
ISBN (Print)978-989-758-706-1
DOIs
Publication statusPublished - 2024
SeriesInternational Conference on Software Technologies
ISSN2184-2833

Fingerprint

Dive into the research topics of 'Synthesizers: A Meta-Framework for Generating and Evaluating High-Fidelity Tabular Synthetic Data'. Together they form a unique fingerprint.

Cite this