1 code implementation • 15 Apr 2024 • Nian Ran, Bahrul Ilmi Nasution, Claire Little, Richard Allmendinger, Mark Elliot
However, there are unique challenges in tabular data compared to images, eg tabular data may contain both continuous and discrete variables and conditional sampling, and, critically, the data should possess high utility and low disclosure risk (the risk of re-identifying a population unit or learning something new about them), providing an opportunity for multi-objective (MO) optimization.
1 code implementation • 2 Jul 2022 • Claire Little, Mark Elliot, Richard Allmendinger
The paper presents a framework to measure the utility and disclosure risk of synthetic data by comparing it to samples of the original data of varying sample fractions, thereby identifying the sample fraction which has equivalent utility and risk to the synthetic data.
no code implementations • 3 Dec 2021 • Claire Little, Mark Elliot, Richard Allmendinger, Sahel Shariati Samani
Generative Adversarial Networks (GANs) are gaining increasing attention as a means for synthesising data.