Our activities include the acquisition, documentation, anonymization, dissemination, and preservation of micro-data and related metadata.
Acquisition
The National Data Archive was primarily established to archive survey and census microdata produced by the National Statistics Office and other official data producers. On a selected basis, the National Data Archive also serves as a repository for non-official datasets. Data producers interested in depositing data in the National Data Archive are invited to contact us.
Documentation
Data documentation serves several important functions. It helps data producers build institutional memory, and helps researchers to:
The National Data Archive adopted the Data Documentation Initiative (DDI) and the Dublin Core (DCMI) international metadata standards.
Anonymization
Statistical agencies are charged with legal and ethical obligations to protect the confidentiality of survey respondents. The National Data Archive protects confidentiality of the data by:
Dissemination
Data dissemination increases the quality, use and potential impact of data, by:
Obviously, making microdata available also has down sides. It exposes data producers to criticism, it increases the risk of breach to confidentiality, and it can result in conflicting outputs being generated. Having faith in the ethical consuct of data users and in their willingness to contribute to the quality and usefulness of the data, the National Data Archive considers that the benefits outweigh the disadvantages. We insist however that access to microdata must not be seen as a right. Access will only be permitted to bona fide users, and for statistical and research purposes only.
Preservation
Micro-datasets can be damaged or lost because of human error, because of technical problems, or because of disasters such as fire or flood. New technologies can also render old data unreadable, because of either hardware or software advances. The National Data Archive is implementing standard procedures for ensuring the physical security and long-term usability of its resources, together with associated backup arrangements for minimizing the impact of adverse events.