The UK tech sector is increasingly turning to synthetic data to address current challenges around data access and availability, privacy, and sharing.
Advocates of synthetic data argue that it is better for privacy preservation than ‘anonymised’ datasets that despite best efforts can, from time to time, be linked back with identifiers. It also removes the reliance on data sharing for the formation of the datasets and often times is more scalable. It is no wonder, then, that Gartner predicts that by 2024, 60% of data used for the development of AI and analytics projects will be synthetically generated. In fact, it is possible that AI models simply won’t be possible without the use of synthetic data.
However, amongst the advocates and advantages of synthetic data, there are justifiable apprehensions about data generation techniques, transparency and explainability that could offset the potential benefits it could bring. Furthermore, if existing barriers to unlocking the full value of non-synthetic or ‘real data’ were removed, it could be possible that the need of synthetic data would be reduced. There are also many other long-standing data challenges that exist which risk persisting in the context of synthetic data if not addressed, such as the data skills shortage.
In this next iteration of techUK’s Future Vision Series, techUK will seek to answer some of the burning questions members have around the creation and use of synthetic data, where it could be revolutionary, and whether it will live up to its hype.
In particular, this webinar will address:
-
What are the common ways to create synthetic data, and what does the emerging synthetic data ecosystem look like?
-
What is the connection between synthetic data and PET’s?
-
Is synthetic data a plaster over larger data availability challenges, or a long term solution for businesses?
-
How do we ensure decisions made based off synthetic data are rooted in reality, rather than what a machine imagines that reality to be?
-
What are some of the key ethical considerations which should underpin the generation of synthetic data, and how important is public trust?
-
What does this mean for the future of data availability?
Speakers include:
- Dr. Martin O'reilly, Director of Research Engineering, The Alan Turing Institute
- Jeremy Poulter, Business Development & Solutions Director, Defence and National Security, Microsoft UK
- Alexandra Ebert, Chief Trust Officer, Mostly AI
- Emanuele Haerens, Commercial Director, Hazy
- Laura Foster, Head of Tech & Innovation, techUK

Laura Foster
Laura is techUK’s Head of Programme for Technology and Innovation.

Dani Dhiman
Dani joined techUK in October 2021 as Policy Manager for Data.