Digital Object Identifier
Digital Object Identifier
Explanation:
Abstract :
The DOI is the digital equivalent of the well
The Digital Object Identifier (DOI®) System is known Universal Product Code (UPC) bar code
for identifying content objects in the digital printed on virtually all physical products. The bar
environment. DOI® names are assigned to any code identifies the product, ties it to its brand
entity for use on digital networks. They are name, manufacturer and price, which can then be
used to provide current information, including integrated into the grocery store's inventory
where they (or information about them) can be control and revenue reporting systems, the
found on the Internet. The purpose of this distributor’s inventory control, delivery and
paper is to introduce the Digital Object revenue reporting systems, etc. All of these
Identifier (DOI) -- what it is, how it works and systems communicate with each other
why its adoption could facilitate e-commerce successfully, even between different companies,
and Digital Rights Management (DRM). because they all use the same universal identifier
for a given product. The DOI provides a stable,
persistent link between content and a directory on
Introduction : the Internet to which the content owner wishes
the DOI to point. As a persistent identifier, the
The Digital Object Identifier (DOI) is an DOI has advantages over a non-persistent
Internet based global naming and Uniform Resource Locator (URL). Unlike a URL,
resolution system that provides for the which points to a piece of content based on its
precise identification, retrieval, and location on a computer connected to the Internet,
trading of digital items in the form of the DOI identifies a piece of content by a
articles, books, images, bibliographies, permanent number that is independent and that
supporting data, videos, charts, tables, audio, and never changes once it is assigned to the content.
other electronic files. Development of the Instead of pointing to the location of the specified
DOI system began in 1996 when content (e.g., Web site address or URL), the DOI
content creators and technologists points to a directory on the Internet, which in turn
jointly recognized that information and redirects the user’s browser to the current location
entertainment objects could not be of the specified content. Instead of encountering
commercially distributed on the Error 404, which occurs when a web browser
Internet unless there was a common attempts to resolve to a URL that has changed or
system of unique identification for no longer exists, the DOI will persistently point to
those objects. These early stakeholders the content itself. As long as the URL to which
envisioned an unambiguous, machine- the DOI points is maintained in the central DOI
readable identifier that could be used directory, a DOI link survives when content is
for all electronic communications and moved to a different server or ownership of the
transactions involving content work is transferred from one party to another. The
throughout its life cycle, including its underlying technology for this central DOI
creation, editing, publication, Directory is called the Handle System3, and when
distribution, and archiving. it receives a DOI request from a user’s browser, it
translates or “resolves” that DOI to the specific
location which the publisher has specified for it, where:
and then automatically re-routes the user’s
browser to that location. 10.1234 is the prefix:
It is also possible for a DOI to reference
content that does not exist in static form, i.e., does 10 is the directory code. The directory code must
not exist as a specific file on an Internet server. A be numeric; valid directory codes are determined
DOI can point, for example, to a piece by the maintenance agency (the International DOI
of software that offers the user a subscription to Foundation). At present, the only valid directory
an information service, to a Digital Rights code is 10 and all valid DOI's begin with "10.".
Management function that offers the content in 1234 is the registrant's code or publisher ID,
exchange for registration and/or payment, content identifying the registrant. In this case, the number
is available by some other means. or to a notice identifies the International DOI Foundation.
that the requested The DOI’s syntax also creates 181 is the suffix, or item ID, identifying the
the opportunity for interoperability among media single object. (Typical suffixes are longer than
types, including text, music, film, video, this example.)
photograph, software, etc. The fact that the DOI
can incorporate other numbering systems like The prefix is assigned by a DOI Registration
ISBN, EAN or BICI will enable companies in Agency to a specific registrant. The suffix is
different media to work with each other’s content. assigned by the registrant and must be unique
For example, if a journalist embeds an MP3 file within a prefix. It can integrate existing standard
(the format for digitized music files) containing a identifiers such as an ISBN or ISSN, or SICI. The
DOI into a news story, the DOI identifies the DOI is case insensitive and is considered an
publisher of the work, provides a path to the "opaque string": nothing can be inferred from the
current owner and can authenticate the rights number with respect to its use in the DOI System.
holder. The DOI therefore facilitates the act of
attributing ownership and/or authorship in a The correct way to cite a DOI on a webpage or in
powerful manner. The usage rights the owner is a publication is doi:10.1234/181.
willing to grant pricing and sales transactions, for
example-can still be controlled on an individual The alpha numeric character set includes
basis, even if the MP3 file is re-sold or re-used in Universal Character Set (UCS-2), of ISO/IEC
another context. Alternatively, the DOI can 10646, which is the character set defined by
simply provide a mechanism for the user to Unicode v2.0. The UCS-2 character set
determine who actually holds the rights and to encompasses most characters used in every major
discover what other information is available about language written today.
the specified content and what other uses the
rights holder has permitted.
Intended Benefits:
Structure :
DOIs were developed with a few primary
intended benefits:
The DOI consists of a unique alphanumeric
character string divided into two parts: a prefix
• Persistent identification: each DOI
and a suffix.
unequivocally and permanently identifies the
object to which it is associated.
An example of a complete DOI is:
• Protection of intellectual property: the
assignment of a permanent DOI identifier to a
10.1234/181
work may assist its creator in protecting The ability to reference content of any type
copyrights to the material. and at any level of granularity. Many existing
• Network actionability: through Handle numbering systems only apply to specific units of
System technology, each DOI resolves to one or content, e.g., ISBN for books, ISSN, for serials,
more web pages assigned by the publisher SICI for journal articles, CUSIP for securities.
• Semantic interoperability: metadata allow to The ability to subsume existing numbering
unambiguously communicate – to any user, from schemes, so those publishers that use them may
any place, at any point of the continue to do so.
productive/distributive chain – all the pieces of The DOI provides this persistent link by
information about the related objects and their providing a resolution-and-routing system similar
hierarchical relationships in concept to the Internet’s Domain Name System
(DNS). Just as DNS recognizes and routes
Applications: domain names to the appropriate Internet address,
the DOI system resolves and routes the DOI to
FACILITATING THE E-COMMERCE MARKET the appropriate locations specified by the
publisher. This might be the location of the
DOIs are becoming a key to the creation of a content itself, or it might be the publisher’s
viable e-commerce market for digital content, response page confirming that the user has found
one of the main goals of the DOI creators. what s/he was looking for and offering further
Computers cannot easily communicate with each choices, or it might be any other destination the
other unless they can recognize a common, publisher wishes to specify. However, DOI goes
unique identifier for the item about which they beyond the DNS by allowing resolution of a
are communicating. In order for e-commerce to single DOI to multiple data or multiple DOIs. For
be fully and seamlessly integrated into instance, the DOI of an eBook might also take the
mainstream publishing processes, publishers will customer to related content such as press reviews,
be using the DOI throughout the life cycle of their author interviews and eBooks on similar topics.
content. A typical cycle can include preparation,
formatting, publication, syndication, distribution, PROTECTING COPYRIGHT, PREVENTING
aggregation, retail sales and/subscriptions, royalty PIRACY, AND MORE:
computation, rights/permissions grants, sales The development and deployment of the
tracking, financial reporting and use. Admittedly, DOI was also designed to enable automated
an e-commerce marketplace is still viable without copyright management, which should offer
the use of persistent unique identifiers for digital additional copyright protection, preserve data
content, just as some retail stores are able to sell integrity and help prevent piracy.
physical goods without the aid of the now- The minimum elements of metadata about
ubiquitous bar code that can include certain the identified objects that are registered in the
information about the product. The key benefit of DOI system will be publicly available so that the
the DOI is that information can be linked to identification of each entity will be unambiguous.
inventory and accounting processes. In the This data and the publisher’s addition of rights
publishing field, there is no other identification metadata5, would allow the DOI to lead the user
scheme that has a combination of attributes that to a variety of copyright and permissions
are necessary to enable e-commerce. The primary information (for instance, the copyright owner,
required attributes are4: how the content may be used, what format is
The ability to provide a guaranteed online allowed and a pathway to a licensing center).
reference to content. The infrastructure that Because the DOI can be persistently associated
supports legacy-numbering systems like ISBNs with this metadata, the rights holder can consider
would need significant modification to do this. conducting copyright and permissions clearance
across the Internet and can also utilize security The most effective application of the DOI,
applications that locate unauthorized postings of however, must start before the stage at which
proprietary content on the Web. Permission rights DRM swings into action. Not only must
in a robust DRM system must consistently and publishers’ internal content management systems
precisely define terms of access or use, such as, incorporate the DOI at the earliest stages of the
“I, the rights holder, grant you the right to creation and production process, it must also
reproduce in electronic format all illustrations in integrate with the rights management system.
the printed version of the book.” A Publishers must recognize quality assurance of
comprehensive DRM system also needs to metadata as “mission critical” to the organization.
identify the multiple “views” of one object (for (DRM) systems enable copyright protection,
instance, the written musical composition, the distribution, usage and payment for digital
performance, the printed publication, use in a content such as text, music, images or software
movie and so forth). The resolution of these rights via any electronic medium."
must point to the appropriate entity that controls
each one. Finally, an effective identifier and How the DOI Complements Digital Rights
copyright management system must be able to Management (DRM)
support all of these permission links even if one The SIIA defines "Digital Rights
small segment of the content is reassigned to Management" as follows: "Digital rights
another rights holder. The DOI system is management (DRM) systems enable copyright
structured to operate successfully within all of protection, distribution, usage and payment for
these identifier challenges. digital content such as text, music, images or
software via any electronic medium." The full
promise of Digital Rights Management (DRM)
goes well beyond preventing unauthorized use.
How the DOI Complements Digital Rights
DRM – especially when combined with the DOI
Management (DRM)
– is a means for content users to purchase and
share content in a number of new and different
The SIIA defines "Digital Rights Management" as
ways while being assured of the authenticity of
follows: "Digital rights management The DOI
the content and respecting the rights of the
provides a way for them to at least agree on how
content provider.
content items are identified. This is a necessary –
At heart, DRM systems track various types
though not sufficient – requirement for DRM
of information about content, who owns it, who is
interoperability.
using it, and so on. There are many DRM
These capabilities require successful
solutions on the market, but none of them can
integration of many different systems before cross
interoperate without agreement on how these
platform interoperability can be achieved. DRM
kinds of information are represented. The DOI
can therefore be utilized with the DOI, which
provides a way for them to at least agree on how
brings each application together by using a
content items are identified. This is a necessary –
common, universal identifier for the content being
though not sufficient – requirement for DRM
managed by a DRM tool. A DOI represents an
interoperability.
“actionable identifier,” that provides a reliable (or
These capabilities require successful
persistent) link back to the content owner’s
integration of many different systems before cross
server, a rights clearinghouse, an online sales
platform interoperability can be achieved. DRM
outlet, or some other provider of DRM and/or e-
can therefore be utilized with the DOI, which
commerce services. This ability of the DOI to
brings each application together by using a
“phone home” for access is a very powerful tool
common, universal identifier for the content being
for DRM.
managed by a DRM tool. A DOI represents an
“actionable identifier,” that provides a reliable (or the content resides in order to complete a
persistent) link back to the content owner’s transaction successfully. That place may be the
server, a rights clearinghouse, an online sales content owner’s web site, or another site that the
outlet, or some other provider of DRM and/or e- content owner may designate as its rights
commerce services. This ability of the DOI to clearinghouse or ecommerce vendor. Regardless
“phone home” for access is a very powerful tool of the content’s location, a DOI will always
for DRM. The most effective application of the successfully resolve (point) to the content, even if
DOI, however, must start before the stage at that content’s location changes over time,
which DRM swings into action. Not only must provided the metadata that is maintained in the
publishers’ internal content management systems DOI directory is changed to reflect the content’s
incorporate the DOI at the earliest stages of the new location. The persistent DOI allows any user
creation and production process, it must also of the content to access further information about
integrate with the rights management system. the content, purchase new editions of the content,
Publishers must recognize quality assurance of or see related content as long as the DOI directory
metadata as mission critical” to the organization. is properly maintained and updated.
Figures 2A-2B show how broken URLs occur
DRM PROCESS FLOW: today and show how the DOI solves this
problem.8
Figure 2A – URLs point to the location of the
content…