Archive
The following generic information provides a broad overview of the importance of archiving; for more practical and detailed information on data and metadata archiving, see our Guidelines on Data Archiving
This phase manages the archiving of statistical data and metadata and consists of four sub-processes:
- 8.1.Define archive rules – Rules for archiving statistical data and metadata resulting from a statistical business process are determined by this sub-process. The requirement to archive intermediate outputs, such as a sample file, raw data from the collection phase, and results of various stages of the process, and to analyze phases, should also be considered. Archive rules for a specific statistical business process may be fully or partly dependent on the general archiving policy of the statistical organization or, for national organizations, on standards applied across the government sector. The rules should include consideration of the medium, location of the archive, and the requirement for keeping duplicate copies. They should also consider the conditions (if any) under which data and metadata should be disposed of. For at least the first iteration of a statistical business process, this sub-process is logically and strongly linked to Phase 2 – Design.
- 8.2.Manage archive repository – This sub-process manages one or more archive repositories, including databases or physical locations where copies of data or metadata are stored. It includes:
- Maintaining catalogues of data and metadata archives, with sufficient information to ensure that individual data or metadata sets can be easily retrieved;
- Testing retrieval processes
- Periodically checking the integrity of archived data and metadata; and
- Upgrading software-specific archive formats when software changes.
This sub-process may cover a specific statistical business process or a group of processes, depending on the degree of standardization within the organization. Ultimately it may be considered to be an over-arching process if organization-wide standards are put in place.
- 8.3.Preserve data and associated metadata – In this sub-process, data and metadata from a specific statistical business process are archived. It includes:
- Identifying data and metadata for archiving in accordance with the rules defined in 8.1;
- Formatting those data and metadata for the repository;
- Loading or transferring data and metadata to the repository;
- Cataloguing the archived data and metadata; and
- Verifying that the data and metadata have been successfully archived.
- 8.4.Dispose of data and associated metadata – In this sub-process, data and metadata from a specific statistical business process are disposed of. It includes:
- Identifying data and metadata for disposal, in accordance with the rules defined in 8.1;
- Disposing those data and metadata; and
- Recording the disposal of those data and metadata.
Related Resources
Documents
National Statistics Code of Practice - Protocol on Data Management, Documentation and Preservation (UK)
Author(s) | UK National Statistics |
Description | This protocol sets out how the producers of National Statistics will carry out their responsibilities for managing, documenting, retaining and preserving the statistical resources which they control. |
Date | 2004 |
URL |
Principles and Good Practice for Preserving Data
Author(s) | Interuniversity Consortium for Political and Social Research (ICPSR) |
Description | This document provides basic guidance for managers in statistical agencies who are responsible for preserving data using the principles and best practices defined by the digital preservation community. The document defines the rationale for preserving data and the principles and standards applied to data preservation; documents the development of a digital preservation policy; and uses digital archive audit principles to suggest best practices for data. |
Date | December 2009 |
URL |