Synthetic data: leveraging the potential of sensitive data in SSH research

Title
Challenge
Project lead
Duration

Start date
Status

Synthetic data: leveraging the potential of sensitive data in SSH Call 2023/2024
Erik-Jan van Kesteren, University of Utrecht
24 months
27 August 2025
Ongoing

Public Summary
Synthetic data is a dataset with (more or less) the same properties as an original dataset but without privacy-sensitive information. By making synthetic data available instead of (or prior to) the actual dataset, scientists gain faster and easier access to confidential data. In this project, two tools for creating synthetic data are used to unlock existing datasets, including datasets archived at DANS.

Project Team
  • Freek Dijkstra, SURF
  • Chang Sun, Maastricht University
  • Ricarda Braukmann, DANS
  • Tim Kok, SURF
  • Raoul Schram, UU
  • Giuseppe Gianquitto, SURF
  • Wim Hugo, DANS