Synthetic dataset of the Luxembourgish population

The dataset was created from scratch using the open-source code on GitLab [link] released by LNDS and the real statistics from the STATEC, the statistics portal of Luxembourg . It is meant to be an example of the dataset structure anyone can generate and personalize in terms of some parameters, including the sample size. The data are organized by individual profiles on the rows and their personal features on the columns. The age-structure distribution, the number of populations over municipalities, the number of different nationalities present in Luxembourg, the salary statistics per municipality are some of real information we have ingested into our synthetic generation model to generate the dataset. Some other features (Date_of_birth, Social_matricule, First_name and Surname of foreign nationalities) have been obtained by a logical relationship between the variables without exploiting any additional real information. We are in compliance with the law in reducing and putting close to zero, the reisk of identifying a real person completely by chance.

Data and Resources

Additional Info

Field Value
Title for URL of the dataset Synthetic dataset of the Luxembourgish population
Identifier 3c5065d1-fcca-442d-a095-ad9836a22b8e
Theme e84dbfc9-75fa-476c-8fec-4d20742801ae
Type Synthetic data
Contact Point
Contact Email
publisher Luxembourg National Data Service (LNDS)
Release Date 2023-11-27
Modification Date 2023-11-27
Frequency irregular
Access Right public
Dataset Relationships []
Dataset Dictionary