Synthetic data

« Back to Glossary Index

Synthetic data

What does Synthetic data?

Synthetic data are artificially generated data that mimic the structure and statistical properties of real data. A synthetic data set has the same mathematical properties as the real data on which it is based. However, it does not contain any actual information from individuals. Synthetic data are mainly used where there is not enough valid data available or where existing original data cannot be used directly for data protection reasons. Synthetic data are created with the help of algorithms or models based on existing data.

Practical example

For statistical evaluations personal data anonymized in such a way that they can no longer be assigned to a specific person.

« Back to Glossary Index
administrator