Research On Raw Data
Research On Raw Data
Search
Donate
Create account
Log in
Personal tools
Contents
hide
(Top)
Generating data
Examples
See also
References
Further reading
Raw data
16 languages
Article
Talk
Read
Edit
View history
Tools
Appearance
hide
Text
Small
Standard
Large
Width
Standard
Wide
Color (beta)
Automatic
Light
Dark
"Primary data" redirects here. For data that has been created at the time
under study, see Primary source.
Raw data, also known as primary data, are data (e.g., numbers,
instrument readings, figures, etc.) collected from a source. In the context
of examinations, the raw data might be described as a raw
score (after test scores).
Generating data
[edit]
Data has two ways of being created or made. The first is what is called
'captured data',[1] and is found through purposeful investigation or
analysis. The second is called 'exhaust data', [1] and is gathered usually by
machines or terminals as a secondary function. For example, cash
registers, smartphones, and speedometers serve a main function but may
collect data as a secondary task. Exhaustive data is usually too large or of
little use to process and becomes 'transient' or thrown away. [1]
Examples
[edit]
In computing, raw data may have the following attributes: it may possibly
contain human, machine, or instrument errors, it may not be validated; it
might be in different area (colloquial) formats; uncoded or unformatted; or
some entries might be "suspect" (e.g., outliers),
requiring confirmation or citation. For example, a data input sheet might
contain dates as raw data in many forms: "31st January 1999",
"31/01/1999", "31/1/99", "31 Jan", or "today". Once captured, this raw
data may be processed stored as a normalized format, perhaps a Julian
date, to make it easier for computers and humans to interpret during later
processing. Raw data (sometimes colloquially called "sources" data or
"eggy" data, the latter a reference to the data being "uncooked", that is,
"unprocessed", like a raw egg) are the data input to processing. A
distinction is made between data and information, to the effect that
information is the end product of data processing. Raw data that has
undergone processing are sometimes referred to as "cooked" data in a
colloquial sense.[dubious – discuss] Although raw data has the potential to be
transformed into "information," extraction, organization, analysis, and
formatting for presentation are required before raw data can be
transformed into usable information.
Tim Berners-Lee (inventor of the World Wide Web) argues that sharing raw
data is important for society. Inspired by a post by Rufus Pollock of
the Open Knowledge Foundation his call to action is "Raw Data Now",
meaning that everyone should demand that governments and businesses
share the data they collect as raw data. He points out that "data drives a
huge amount of what happens in our lives… because somebody takes the
data and does something with it." To Berners-Lee, it is essentially from this
sharing of raw data, that advances in science will emerge. Advocates
of open data argue that once citizens and civil society organizations have
access to data from businesses and governments, it will enable citizens
and NGOs to do their own analysis of the data, which can empower people
and civil society. For example, a government may claim that its policies
are reducing the unemployment rate, but a poverty advocacy group may
be able to have its staff econometricians do their own analysis of the raw
data, which may lead this group to draw different conclusions about the
data set.
See also
[edit]
Standard score
References
[edit]
Further reading
[edit]
Give Us the Data Raw, and Give it to Us Now - the blog post from
Rufus Pollock that inspired Tim Berners-Lee
Categories:
Data types
Research
Information
Privacy policy
About Wikipedia
Disclaimers
Contact Wikipedia
Code of Conduct
Developers
Statistics
Cookie statement
Mobile view
Search
Raw data
16 languagesAdd topic