The dataset was created from scratch using the open-source code on GitLab [link] released by LNDS and the real statistics from the STATEC, the statistics portal of Luxembourg . It is meant to be an example of the dataset structure anyone can generate and personalize in terms of some parameters, including the sample size. The data are organized by individual profiles on the rows and their personal features on the columns. The age-structure distribution, the number of populations over municipalities, the number of different nationalities present in Luxembourg, the salary statistics per municipality are some of real information we have ingested into our synthetic generation model to generate the dataset. Some other features (Date_of_birth, Social_matricule, First_name and Surname of foreign nationalities) have been obtained by a logical relationship between the variables without exploiting any additional real information. We are in compliance with the law in reducing and putting close to zero, the reisk of identifying a real person completely by chance.