The danish demographic database-principles and methods for cleaning and standardisation of data

5Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Since 2001 seven Danish censuses dating from 1787 till 1880 have been completely transcribed by volunteers. Due to this effort the research community now has access to a large number of demographic data. The census data were digitised according to the principle of literal data transcription in order to leave all interpretations to the users. The disadvantage of this solution is that it induces problems when creating aggregated statistics as the spelling of, e.g. position in household and occupations was not standardised which leads to great variation in the description of the same entities. In order to overcome this obstacle the data were cleaned and standardised. Standardisation consists of adding numeric codes for the gender, civil status and position in household. For occupations, HISCO has been applied to secure that the data can be used in comparative research.

Cite

CITATION STYLE

APA

Clausen, N. F. (2015). The danish demographic database-principles and methods for cleaning and standardisation of data. In Population Reconstruction (pp. 1–22). Springer International Publishing. https://doi.org/10.1007/978-3-319-19884-2_1

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free