Synthetic data: leveraging the potential of sensitive data in SSH research
Title
Challenge
Project lead
Duration
Start date
Status
Synthetic data: leveraging the potential of sensitive data in SSH Call 2023/2024
Erik-Jan van Kesteren, University of Utrecht
24 months
27 August 2025
Ongoing
Public Summary
Synthetic data is a dataset with (more or less) the same properties as an original dataset but without privacy-sensitive information. By making synthetic data available instead of (or prior to) the actual dataset, scientists gain faster and easier access to confidential data. In this project, two tools for creating synthetic data are used to unlock existing datasets, including datasets archived at DANS.
Project Team
- Freek Dijkstra, SURF
- Chang Sun, Maastricht University
- Ricarda Braukmann, DANS
- Tim Kok, SURF
- Raoul Schram, UU
- Giuseppe Gianquitto, SURF
- Wim Hugo, DANS