Gary Cleveland
IFLA UDT Core Programme
March, 1998
International Federation of Library Associations and Institutions
The IFLA Core Programme on Universal Dataflow and Telecommunications (UDT) seeks to facilitate the international and
national exchange of electronic data by providing the library community with pragmatic approaches to resource sharing. The
programme monitors and promotes the use of relevant standards, promotes the use of relevant technologies and monitors
relevant policy issues in an effort to overcome barriers to the electronic transfer of data in library fields.
Mailing Address:
Email: [email protected]
Definitions, Issues and Challenges
The idea of easy, finger-tip access to information— each of these different phrases meant. “Digital
what we conceptualize as digital libraries today— library” is simply the most current and most widely
began with Vannenar Bush’s Memex machine accepted term and is now used almost exclusively at
(Bush, 1945) and has continued to evolve with each conferences, online, and in the literature.
advance in information technology. With the arrival
of computers, the concept centered on large Another factor adding to the confusion is that digital
bibliographic databases, the now familiar online libraries are at the focal point of many different areas
retrieval and public access systems that are part of of research, and what constitutes a digital library
any contemporary library. When computers were differs depending upon the research community that
connected into large networks forming the Internet, is describing it (Nurnberg, et al, 1995). For
the concept evolved again, and research turned to example:
creating libraries of digital information that could be
accessed by anyone from anywhere in the world. • from an information retrieval point of view, it is
Phrases like “virtual library,” “electronic library,” a large database
“library without walls” and, most recently, “digital • for people who work on hypertext technology, it
library,” all have been used interchangeably to is one particular application of hypertext
describe this broad concept. methods
• for those working in wide-area information
But what does this phrase mean? What is digital delivery, it is an application of the Web
library? And what are the issues and challenges in • and for library science, it is another step in the
creating them? Moreover, what are the issues continuing automation of libraries that began
involved in creating a coordinated scheme of digital over 25 years ago
libraries? It has been suggested that digital libraries
will only be viable within such a scheme (Chapman In fact, a digital library is all of these things. These
and Kenny, 1996). This paper provides a very high- different research approaches will all add to the
level overview of digital libraries and briefly outlines development of digital libraries.
each of these questions in turn.
Third, confusion arises from the fact that there are
1. WHAT IS A DIGITAL LIBRARY? many things on the Internet that people are calling
“digital libraries,” which from a librarian’s point
What is a digital library? There is much confusion of view are not. For example:
surrounding this phrase, stemming from three
factors. First, the library community has used • for computer scientists and software developers,
several different phrases over the years to denote this collections of computer algorithms or software
concept— electronic library, virtual library, library programs are digital libraries.
without walls— and it never was quite clear what • for database vendors or commercial document
suppliers, their databases and electronic that libraries do to develop and organize collections
document delivery services and digital libraries. and to help users find information.
• for large corporations, a digital library is the
document management systems that control The institutions involved in the American Digital
their business documents in electronic form. Library Federation came up with a similar notion of
• for a publisher, it may be an online version of a “digital library.” It also emphasizes the traditional
catalogue. underpinnings of libraries— selection, access, and
• and for at least one very large software preservation— as well as the fact that digital libraries
company, a digital library is the collection of will necessarily be constructed to serve particular
whatever it can buy the rights to, and then communities (Waters, 1998):
charge people for using.
Digital libraries are organizations that provide
A fairly spectacular example of what many people the resources, including the specialized staff, to
consider to be a digital library today is the World select, structure, offer intellectual access to,
interpret, distribute, preserve the integrity of,
Wide Web. The Web is a gathering of thousands
and ensure the persistence over time of
and thousands of documents. Many would call this collections of digital works so that they are
huge collection a digital library because they can readily and economically available for use by a
find information, just as they can do banking in a defined community or set of communities.
“digital bank” or buy compact discs in a “digital
record store.” Yet, is the Web a digital library? With the assumption that digital libraries are
According to Clifford Lynch, once of the leading libraries first and foremost, we can list some
scholars in the area of digital library research, it is characteristics. These characteristics have been
not. Lynch (1997:52) states: gleaned from various discussions about digital
libraries, both online and in print (See Arms, 1995;
One sometimes hears the Internet characterized Graham, 1995a; Chepesuik, 1997; Lynch and
as the world's library for the digital age. This Garcia-Molina, 1995):
description does not stand up under even casual
examination. The Internet and particularly its
collection of multimedia resources known as the • digital libraries are the digital face of traditional
World Wide Web was not designed to support libraries that include both digital collections and
the organized publication and retrieval of traditional, fixed media collections. So they
information as libraries are. It has evolved into encompass both electronic and paper materials.
what might be thought of as a chaotic repository
for the collective output of the world's digital • digital libraries will also include digital
"printing presses.".... ...In short, the Net is not a materials that exist outside the physical and
digital library. administrative bounds of any one digital library
Thus, in examining the various examples of what are • digital libraries will include all the processes
called digital libraries, it appears that librarians have and services that are the backbone and nervous
been confused about what a digital library is, that the system of libraries. However, such traditional
word “library” has been appropriated by many processes, though forming the basis digital
different groups to describe either their areas of library work, will have to be revised and
research or signify a simple collection of digital enhanced to accommodate the differences
objects. between new digital media and traditional fixed
So what is a working definition of “digital library”
that makes sense to librarians? As a starting point, • digital libraries ideally provide a coherent view
we should assume that digital libraries are libraries of all of the information contained within a
with the same purposes, functions, and goals as library, no matter its form or format
traditional libraries collection development and
management, subject analysis, index creation, • digital libraries will serve particular
provision of access, reference work, and communities or constituencies, as traditional
preservation. A narrow focus on digital formats libraries do now, though those communities may
alone hides the extensive behind-the-scenes work be widely dispersed throughout the network.
1. digitization, converting paper and other media • the priorities of user communities. Such
in existing collections to digital form (discussed priorities will justify holding the materials
in more detail below). locally, for example, because of the demands of
2. acquisition of original digital works created by a curriculum
publishers and scholars. Example items would
be electronic books, journals, and datasets. • manageable portions of collections. When
3. access to external materials not held in-house there is no other overriding criteria, then
by providing pointers to Web sites, other library material can be divided up among institutions
collections, or publishers’ servers. simply according to what is reasonable for any
one institution to collect or digitize
While the third method may not exactly constitute
part of a local collection, it is still a method of • technical architecture. The state of a library’s
increasing the materials available to local users. technical architecture will also be factor in
One of main issues here is the degree to which selecting who digitizes what. A library must
libraries will digitize existing materials and acquire have a technical architecture up to the task of
original digital works, as opposed to simply pointing support a particular digital collection.
to them externally. This a reprise of the old access
versus ownership issue but in the digital • skills of staff. Institutions whose staff don’t
realm with many of the same concerns such as: have the necessary skills can’t become a major
node in a national scheme.
• local control of collections
• long-term access and preservation Yet, no matter how a collection is built— of materials
digitized in-house, of original digital works, or of
What about digital collection building in a providing access to materials by pointing to other
coordinated scheme? There are many reasons why external resources libraries in a collective must
building digital collections is a good candidate for ensure it is preserved and made available in
coordinated activity. First, acquiring digital works perpetuity. For example, if the only copies of digital
and doing in-house digitization are expensive, works reside on a particular publisher’s server, then
especially to undertake alone. By working together, what happens if the publisher goes bankrupt? Or if
institutions with common goals can gain greater the market value of a particular work approaches
efficiencies and reduce the overall costs involved in zero? What if all of part of a digital collection of a
these activities, as was the case with retrospective library were lost, such as through some catastrophic
conversion of bibliographic records. Second, it also event? Ensuring long-term preservation and access
reduces the redundancy and waste of acquiring or will require policies and a scheme by which
converting materials more than once. Third, redundant permanent copies are stored at designated
coordinated digital collection building enhances institutions. Preservation issues will be discussed
resource sharing and increases the richness of further later in the paper.
collections to which users have access.
• their potential for long-term use The fifth issue is related to metadata. It is the
• their intellectual or cultural value problem of naming in a digital library. Names are
• whether they provide greater access than strings that uniquely identify digital objects and are
part of any document’s metadata. Names are as
