Synthetic data: leveraging the potential of sensitive data in SSH research
Title
Challenge
Project lead
Duration
Start date
Status
Synthetic data: leveraging the potential of sensitive data in SSH Call 2023/2024
Erik-Jan van Kesteren, University of Utrecht
24 months
September 2025
Approved by NWO
Public Summary
Synthetic data is a dataset with (more or less) the same properties as an original dataset but without privacy-sensitive information. By making synthetic data available instead of (or prior to) the actual dataset, scientists gain faster and easier access to confidential data. In this project, two tools for creating synthetic data are used to unlock existing datasets, including datasets archived at DANS.
Project Team
- Tim Kok, SURF
- Giuseppe Gianquitto, SURF
- Freek Dijkstra, SURF
- Chang Sun, Maastricht University
- Ricarda Braukmann, DANS
- Wim Hugo, DANS