Rda Recommendedfileformats
Rda Recommendedfileformats
Contents
Introduction: file format categories .............................................................................................................................. 1
Overview: formats for preservation and use .............................................................................................................. 1
Recommended file formats ............................................................................................................................................ 2
Acceptable file formats: general purpose ................................................................................................................... 4
Other file formats: specialist and rarely-used ............................................................................................................ 5
Further information ........................................................................................................................................................... 6
Guidance on file formats with examples of recommended and acceptable formats is provided in this
document.
Proprietary formats are more suitable for preservation if they are widely-used and can be
opened in different operating systems and with different programmes or applications:
examples include Microsoft Rich Text Format, Excel, and PDF.
The preferred preservation formats for representations of image, audio and video data are
those that encode the information with a lossless algorithm, such as TIFF (.tif), Free Lossless
Audio Codec (.flac), and MPEG-4 (.mp4). Unlike lossy formats (e.g. JPEG, .mp3, WMV), lossless
formats do not cause data loss when files are saved.
We recommend you deposit your files in the Archive using open, platform-independent or non-
proprietary file formats wherever possible.
Preservation formats may not always be the most accessible or usable:
Open formats may lack the formatting and functionality that allow data to be rendered,
manipulated and analysed more effectively.
Information-rich lossless formats, such as high-resolution image or video, may produce very
large files which are not suited for web access or fast processing.
In such cases you may wish to deposit files in more than one format, so that data are optimised for both
preservation and use: for example, a richly-formatted Excel file containing manipulated tabular data
might be made available alongside a CSV file of the raw numbers; or a large TIFF image file might be
deposited for preservation, and a JPEG or other compressed image file made available for web
download.
You may need to document your file formats, for example, recording the version of the software used
to generate them, and including, if relevant, details of the compression, codec and bit rate used. This
information can be entered in the Data processing and preparation activities field in the Metadata
Record; it can also be recorded in a README.txt file or other documentation file uploaded with your
data.
For general information on choosing appropriate formats for preservation and other uses, consult the
Further information references at the end of this document.
CSS .css
XML .xml
GIF .gif
PNG .png
Image/graphics
Postscript .ps
WAV .wav
R (ascii) .rdata
CAD .dwg
These are mostly general-purpose formats, and as many are widely used, it is likely that the Archive will
be able to preserve them. But preservation of data files in these formats cannot be guaranteed, and
they are deposited at your own risk.
If you wish to preserve information, formatting or functionality encoded in files in these formats, you
may deposit the original files alongside the same files converted into standard preservation formats.
For instance, you could deposit your SPSS files (.por or .sav) along with standard preservation files in
SPSS syntax (.sps) and Plain Text (.txt) data files.
TeX .tex
Photo CD .pcd
Tarball .tar
Further information
The National Archives. PRONOM. https://www.nationalarchives.gov.uk/PRONOM/Default.aspx
Online registry of file formats.
UK Data Service. Recommended formats. http://ukdataservice.ac.uk/manage-
data/format/recommended-formats
Recommended formats for depositing files in the UK DataService.
Wikipedia. List of file formats. https://en.wikipedia.org/wiki/List_of_file_formats#Tabulated_data
Including common scientific data formats.