0% found this document useful (0 votes)
21 views4 pages

Data Longevity and Accessibility

Uploaded by

Tesfaye
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
21 views4 pages

Data Longevity and Accessibility

Uploaded by

Tesfaye
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 4

Data longevity and accessibility

An important field in computer science, technology, and library science is the longevity of data.
Scientific research generates huge amounts of data, especially in genomics and astronomy, but
also in the medical sciences, e.g. in medical imaging. In the past, scientific data has been
published in papers and books, stored in libraries, but more recently practically all data is stored
on hard drives or optical discs. However, in contrast to paper, these storage devices may become
unreadable after a few decades. Scientific publishers and libraries have been struggling with this
problem for a few decades, and there is still no satisfactory solution for the long-term storage of
data over centuries or even for eternity.

Data accessibility. Another problem is that much scientific data is never published or deposited
in data repositories such as databases. In a recent survey, data was requested from 516 studies
that were published between 2 and 22 years earlier, but less than one out of five of these studies
were able or willing to provide the requested data. Overall, the likelihood of retrieving data
dropped by 17% each year after publication.[21] Similarly, a survey of 100 datasets in Dryad
found that more than half lacked the details to reproduce the research results from these studies.
[22]
This shows the dire situation of access to scientific data that is not published or does not have
enough details to be reproduced.

A solution to the problem of reproducibility is the attempt to require FAIR data, that is, data that
is Findable, Accessible, Interoperable, and Reusable. Data that fulfills these requirements can be
used in subsequent research and thus advances science and technology.[23]

In other fields
Although data is also increasingly used in other fields, it has been suggested that the highly
interpretive nature of them might be at odds with the ethos of data as "given". Peter Checkland
introduced the term capta (from the Latin capere, "to take") to distinguish between an immense
number of possible data and a sub-set of them, to which attention is oriented.[24] Johanna Drucker
has argued that since the humanities affirm knowledge production as "situated, partial, and
constitutive," using data may introduce assumptions that are counterproductive, for example that
phenomena are discrete or are observer-independent.[25] The term capta, which emphasizes the
act of observation as constitutive, is offered as an alternative to data for visual representations in
the humanities.

The term data-driven is a neologism applied to an activity which is primarily compelled by data
over all other factors.[citation needed] Data-driven applications include data-driven programming and
data-driven journalism.

See also
 Biological data
 Computer data processing
 Computer memory
 Dark data
 Data (computer science)
 Data acquisition
 Data analysis
 Data bank
 Data cable
 Data curation
 Data domain
 Data element
 Data farming
 Data governance
 Data integrity
 Data maintenance
 Data management
 Data mining
 Data modeling
 Data point
 Data preservation
 Data protection
 Data publication
 Data remanence
 Data science
 Data set
 Data structure
 Data visualization
 Data warehouse
 Database
 Datasheet
 Data-driven programming
 Data-driven journalism
 Data-driven testing
 Data-driven learning
 Data-driven science
 Data-driven control system
 Data-driven marketing
 Digital privacy
 Environmental data rescue
 Fieldwork
 Information engineering
 Machine learning
 Open data
 Scientific data archiving
 Secondary Data
 Statistics
 Digital data
 Data aggregation
References
1.

 OECD Glossary of Statistical Terms. OECD. 2008. p. 119. ISBN 978-92-64-025561.


  "Statistical Language - What are Data?". Australian Bureau of Statistics. 2013-07-13.
Archived from the original on 2019-04-19. Retrieved 2020-03-09.
  "Data vs Information - Difference and Comparison | Diffen". www.diffen.com. Retrieved
2018-12-11.
  Yonego, Joris Toonders (July 23, 2014). "Data Is the New Oil of the Digital Economy".
Wired – via www.wired.com.
  "Data is the new oil". July 16, 2018. Archived from the original on 2018-07-16.
  "data | Origin and meaning of data by Online Etymology Dictionary".
www.etymonline.com.
  American Psychological Association (2020). "6.11". Publication Manual of the American
Psychological Association: the official guide to APA style. American Psychological Association.
ISBN 9781433832161.
  "Joint Publication 2-0, Joint Intelligence" (PDF). Joint Chiefs of Staff, Joint Doctrine
Publications. Department of Defense. 23 October 2013. pp. I-1. Archived from the original
(PDF) on 18 July 2018. Retrieved July 17, 2018.
  Akash Mitra (2011). "Classifying data for successful modeling". Archived from the
original on 2017-11-07. Retrieved 2017-11-05.
  Tuomi, Ilkka (2000). "Data is more than knowledge". Journal of Management Information
Systems. 6 (3): 103–117. doi:10.1080/07421222.1999.11518258.
  P. Beynon-Davies (2002). Information Systems: An introduction to informatics in
organisations. Basingstoke, UK: Palgrave Macmillan. ISBN 0-333-96390-3.
  P. Beynon-Davies (2009). Business information systems. Basingstoke, UK: Palgrave.
ISBN 978-0-230-20368-6.
  Sharon Daniel. The Database: An Aesthetics of Dignity.
  Liu, Stephanie (2020-07-30). "Straight From The Source: Collecting Zero-Party Data
From Customers". Forrester. Retrieved 2025-01-14.
  Greenstein, Danielle (2019-08-19). "What is First-Party vs Third-Party Data: Definitions
& Strategies". Lotame. Retrieved 2025-01-14.
  Studio, AdExchanger Content (2025-01-02). "The Dawn Of First-Party Data: Navigating
The New Advertising Landscape". AdExchanger. Retrieved 2025-01-14.
  Bridgwater, Adrian. "Third-Party Data Is Now First-Class". Forbes. Retrieved 2025-01-
14.
  Fallows, Carley (2025-01-13). "Which Data Source Can You Trust for Better Marketing
ROI?". Littlegate Publishing. Retrieved 2025-01-14.
  Greenstein, Danielle (2024-03-15). "What is Second Party Data and How Can you Use
it?". Lotame. Retrieved 2025-01-14.
  Mesly, Olivier (2015), Creating Models in Psychological Research, Springer Psychology :
126 pages. ISBN 978-3-319-15752-8
  Vines, Timothy H.; Albert, Arianne Y. K.; Andrew, Rose L.; Débarre, Florence; Bock, Dan
G.; Franklin, Michelle T.; Gilbert, Kimberly J.; Moore, Jean-Sébastien; Renaut, Sébastien;
Rennison, Diana J. (2014-01-06). "The availability of research data declines rapidly with article
age". Current Biology. 24 (1): 94–97. arXiv:1312.5670. doi:10.1016/j.cub.2013.11.014.
ISSN 1879-0445. PMID 24361065. S2CID 7799662.
  Roche, Dominique G.; Kruuk, Loeske E. B.; Lanfear, Robert; Binning, Sandra A. (2015).
"Public Data Archiving in Ecology and Evolution: How Well Are We Doing?". PLOS Biology.
13 (11): e1002295. doi:10.1371/journal.pbio.1002295. ISSN 1545-7885. PMC 4640582.
PMID 26556502.
  Eisenstein, Michael (April 2022). "In pursuit of data immortality". Nature. 604 (7904):
207–208. Bibcode:2022Natur.604..207E. doi:10.1038/d41586-022-00929-3. ISSN 1476-4687.
PMID 35379989. S2CID 247954952.
  P. Checkland and S. Holwell (1998). Information, Systems, and Information Systems:
Making Sense of the Field. Chichester, West Sussex: John Wiley & Sons. pp. 86–89. ISBN 0-471-
95820-4.
 Johanna Drucker (2011). "Humanities Approaches to Graphical Display". Digital
Humanities Quarterly. 005 (1).

You might also like