Synthetic data: leveraging the potential of sensitive data in SSH research

Title
Challenge
Project lead
Duration

Start date
Status

Synthetic data: leveraging the potential of sensitive data in SSH Call 2023/2024
Erik-Jan van Kesteren, University of Utrecht
24 months
August 2025
Approved by NWO

Public Summary
Synthetic data is a dataset with (more or less) the same properties as an original dataset but without privacy-sensitive information. By making synthetic data available instead of (or prior to) the actual dataset, scientists gain faster and easier access to confidential data. In this project, two tools for creating synthetic data are used to unlock existing datasets, including datasets archived at DANS.

Project Team
  • Tim Kok, SURF
  • Giuseppe Gianquitto, SURF
  • Freek Dijkstra, SURF
  • Chang Sun, Maastricht University
  • Ricarda Braukmann, DANS
  • Wim Hugo, DANS