CILIP - AI Report - Final High Res
CILIP - AI Report - Final High Res
Monograph:
Cox, A. orcid.org/0000-0002-2587-245X (2021) The impact of AI, machine learning,
automation and robotics on the information professions: a report for CILIP. Report. CILIP
Reuse
Items deposited in White Rose Research Online are protected by copyright, with all rights reserved unless
indicated otherwise. They may be downloaded and/or printed for private study, or other acts as permitted by
national copyright laws. The publisher or other rights holders may allow further reproduction and re-use of
the full text version. This is indicated by the licence information on the White Rose Research Online record
for the item.
Takedown
If you consider content in White Rose Research Online to be in breach of UK law, please notify us by
emailing [email protected] including the URL of the record and the reason for the withdrawal request.
[email protected]
https://eprints.whiterose.ac.uk/
The impact of AI, machine learning,
automation and robotics
on the information professions
A report for CILIP
Research report: The impact of AI, machine learning,
automation and robotics on the information professions
About CILIP
CILIP is the UK library and information association. We are
the only chartered body in the world dedicated to uniting,
supporting and advocating for information professionals
and librarians – the people who help the world make better
decisions. Our membership is open to everyone working in
libraries, information or knowledge management, data science
and analytics or a related professional role. We work with
employers, learning providers and suppliers across the library
and information sector in the UK and internationally to develop
talent, promote innovation, encourage workforce diversity
and ultimately to secure the long-term future of our profession.
Reference group
The author would like to extend this thanks to the following
colleagues who generously took time to share their insights with
the project:
Stephen Ayre (William Harvey Library, George Eliot Hospital),
Ken Chad (Ken Chad Consulting), Alex Fenlon (Library Services,
The University of Birmingham), Julie Glanville (Independent
consultant in Information Retrieval), Felix Greaves (Director,
Science, Evidence and Analytics at National Institute for
Health and Care Excellence (NICE)), Jake Hearn, Timothy Jacobs
(The Christie NHS Foundation Trust), Indra Joshi (Director AI, NHS),
Niel Kempson, Tony Lewis (TmL Consultancy), Stephen Phillips
(vice chair CILIP K&IM committee), Alexandra Pooley (Chair SLSIG),
Mia Ridge (Digital Curator, Western Heritage Collections, Digital
Research team, British Library), Hélène Russell (The Knowledge
Business), Tony Russell-Rose (Goldsmiths, University of London),
Edward Saperia (Newspeak House), Philip Scott (Reader in Health
Informatics, School of Computing, University of Portsmouth),
Matthew Soare (Makerspace Development Manager, Hull Central
Library), Alex Smith (Global RAVN Product Lead), Marion Spring
(Associate Director, Information Services, National Institute for
Health and Care Excellence (NICE)), Martin White (Managing
Director, Intranet Focus Ltd).
Contents
03 Executive summary
The context
The brief
3.1 Intelligent web and mobile search: the need for algorithmic literacy
3.2 AI interfaces to existing knowledge discovery systems
3.3 AI in knowledge discovery: the need for socio-technical infrastructures
CASE STUDY Living systematic reviews
CASE STUDY Library research collections and the creation of a community of data scientists
3.8 4th Industrial Revolution automation: the need for AI, robotics and data literacy
CASE STUDY Create Crates
4.1 The hype challenge: the need for understanding and a vision
4.2 The ownership challenge: the need for collaboration
4.3 The procurement challenge: the need for expertise in licensing, copyright and IPR
4.4 The data challenges: the need for data stewardship
4.5 The technical infrastructure and workflow challenges: the need for information
architecting
4.6 The post-implementation challenges: the need for support, training and promotion
4.7 The ethical use challenge: the need for responsible use rooted in professional
values and ethics
42 Recommendations
For CILIP
For information services and libraries
For individual information professionals
For educational institutions at all levels
For other training providers
For research
44 References
48 Appendixes
Executive summary
The primary function of the outputs of link between AI and the wider digital
the research will be to drive the refresh transformation, and its interrelations with
of CILIP’s Professional Knowledge and other social changes. The complexity of
Skills Base, and from this the “ecosystem” the issues is summarised in chapter 1.
of teaching, learning and professional
Chapter 2 provides an overview of how
development for current and future
AI and robots are relevant to the work of
information, knowledge management
the profession. This arises particularly from
and library professionals.
how it might change how text and other
The data for the project was an extensive forms of content are described, searched
literature review and conversations with and used. This could constitute a shift from
twenty-one experts from across the UK the paradigm of searching for items to
profession conducted by the author in read manually to the model of extracting
November and December 2020. knowledge from collections of text, images
and other data through algorithms. But
In the interests of direct expression the
there are changes happening across
phrase “AI and robots” is used in the
the whole information value chain: in its
report to define the scope of technologies
production, organisation and consumption.
under consideration, although there was
The technologies open up exciting
little consensus about terminology in the
opportunities for a more information rich
literature or among interviewees.
world. However, they also pose a large
For many reasons this is a complex topic: number of ethical concerns. They will also
partly because of the lack of agreed have a potential impact on jobs, including
terminology, but also because of the on the work of information professionals.
emotive responses it evokes, the sectoral
patchiness of its impact, the complex
The most obviously relevant change Many of the competencies needed to take
AI has effected is in web and mobile these opportunities already exist at some
search, so chapter 3, which identifies level in the profession and are reflected
the applications of AI and robots in in the CILIP Professional Knowledge and
information, knowledge management and Skills Base (PKSB). Chapter 5 presents For many
library work, starts with this. Changes to these. A longer discussion is offered about reasons this
web and mobile search imply a need for the repositioning of the profession in
what has been called algorithmic literacy, relation to computational thinking, data
is a complex
as an aspect of information literacy. AI analysis and data science, the fusion skills topic: partly
will also increasingly appear in interfaces to manage the relations between humans because
to knowledge discovery systems licensed and AI, and the soft skills that AI are unlikely of the lack
or run by information services. However, to develop in the near future. A concept of
it is in its impact on knowledge discovery data stewardship is also defined.
of agreed
that AI will have the greatest impact. What terminology,
The information, knowledge management
types of socio-technical infrastructure
and library professional is in a strong but also
are needed to support this and how because of
position to help organisations move into
information professionals might be
involved varies a lot between contexts.
the 4th Industrial Revolution deploying AI the emotive
and robots, which is reiterated at the start responses
Conversational agents or chatbots offer
of chapter 6. Yet some vulnerabilities
a new means to communicate with
remain, particularly in relation to adjacent it evokes.
users. AI might also be cautiously used
professions. These vulnerabilities can be
in managing people, but the ethical
partly addressed through the thirteen
issues here are significant. Automation
recommendations the report makes
of routine administrative tasks including
to six different stakeholder groups.
within AI pipelines is facilitated through
Of particular importance are:
robotic process automation. Sensor data
could be used to make buildings such as o For CILIP to articulate the relevance
libraries smarter to improve design and of the profession’s skills, values and
for wayfinding. Four potential uses of ethical principles, to identify pathfinder
robots in libraries are in handling books, organisations and people who encapsulate
acting as an information desk, for learning, these possibilities, and to promote
and for generic tasks such as cleaning or knowledge sharing within the profession.
through robotic furniture. Finally, there is
a potential library role in promoting public
o For individual information professionals
and information organisations to explore
understanding of AI and robots, through
the use of AI tools and share what they
AI and data literacy. Case studies provide
learn with others across the profession.
more in-depth narratives around some of
the key applications and how information o For the profession to use the
professionals can be involved. technologies in their own work and to
support information users in engaging
Chapter 4 suggests how opportunities
productively and safely with them for
for information professionals are
social good.
created through the need to overcome
organisational challenges to reaping the
benefits of AI and robots, especially in
knowledge discovery. These include a
vision; collaboration; expertise in licensing,
copyright and IPR; data stewardship;
information architecture; support, training
and promotion; and responsible use rooted
in professional values and ethics.
o The global market for industrial and non- With these opportunities come
industrial robotics is predicted to grow from considerable risks– for example, a
£22bn in 2018 to £178bn in 2025; specialist UCL research unit has already
found examples of Internet-connected
o The number of ‘Internet of Things’ smart smart home devices being used for The scale
connected devices is predicted to grow
coercive control in the home by abusive and pace
from 27bn in 2017 to more than 125bn
partners. The literature and news are
in 2030.
full of many examples of unaccountable
of the 4th
Every day, the technologies associated with algorithms and AI-driven decisions which Industrial
the 4th Industrial Revolution – AI, machine exacerbate human failings such as sexism Revolution
learning, automation and robotics – are and prejudice. will
finding new applications. Examples include: The role of information professionals inevitably
o Fraud/risk detection and management has always been to enable our users and disrupt
in financial services communities to profit from new advances
in technology and to make better use of the library,
o Regulatory compliance at scale (for information for their own advancement. information
example with Data Protection legislation) This was as true of the cuneiform & knowledge
o Decision support collections of antiquity as it was of the management
great book and print collections of the past
o Automation of inventory across century as it will be of the vast collections workforce.
supply-chain, logistics and stock
of data and personal information being
management
generated and shared online.
o Use of chatbots to automate customer The opportunity and the challenge for
service enquiries
CILIP as a professional association is to
o Use of self-help apps, for example for ensure that we are able to use our ‘pipeline
self-management of long-term conditions of talent’ to prepare the information
professional workforce to play this role
once again in the 4th Industrial Revolution.
The brief:
Specifically, there are three key questions
The role of the information, which we will need to be able to answer in
knowledge management and order to ensure that our profession is ready
library professions. and able to fulfil this role:
The tremendous scale and pace of the o How do we ensure that today’s
4th Industrial Revolution and its associated workforce has the skills and understanding
technologies will inevitably disrupt the they need in order to enable them to
library, information and knowledge support their users in participating safely
management workforce. In 2017, McKinsey and successfully in a modern world that
estimated that 50% of current work is increasingly powered by AI, machine
activities were technically automatable, learning, process automation and robotics?
and that 60% of jobs have at least 30%
of their responsibilities that are capable o What are the ethical implications of our
of being automated. approach to these technologies – how can
we deploy the existing ethical framework
The opportunities are potentially for librarians and ensure that it aligns
immense – people will lead increasingly to emerging work on data ethics and
information-rich lives in which they can responsible technology?
use data and analytics to improve their
health, be more productive and release o What should the skillset of the future
more time for family, friends and leisure workforce look like and what is the
activities. In so doing, they will need expert, curriculum by which we will ensure
ethical and accountable information that the next generation of information
professionals to help them select and use professionals has the skills to keep pace
the right technologies, to maximise their with future developments in technology?
information-handling skills and to keep
them and their families safe from harm.
context, Cox (2021) identifies a wide range because of its connotations or lack of
of applications in teaching (Intelligent agreed definition. Yet other terms, such
Tutoring Systems, conversational agents as machine learning, although an important
in MOOCs, plagiarism detection), research reference point, are too narrow. For the
(text and data mining, robot scientists, purposes of the report we will prefer the
automated peer review) and university term AI because it usefully captures the
administration and estates management range of technologies supporting 4th
(the smart campus). This means that Industrial Revolution automation. The
information professionals embedded in term AI is often used in the literature to
their sector may see very different patterns encompass robots, but sometimes it will
of change happening around them. be useful to differentiate AI and robots.
Information professionals work in every
There is also a blurred boundary between
sector so a comprehensive perspective
the bundle of technologies addressed
is impossible.
in the report and others, such as virtual
reality (VR) and augmented reality (AR) that
AI is an umbrella term, are also making their own way through
and journal articles, grey literature, internal to search across whole collections as
know-how or even text harvested from datasets rather than pages, as Ridge (2019)
conversations online. Difficult to digitise puts it. Publishers are increasingly using AI
parts of texts such as tables may be easier for content enrichment, which automates
to process (Cordell, 2020). AI is improving bringing together content from different Having some
the digitisation of manuscripts through sources for the reader (67 bricks, 2017). grasp of AI
recognition of hand-written manuscripts.
Sentiment analysis even seeks to identify
As well as processing unstructured texts, techniques
other forms of information are handled will be
different types of feeling in texts.
better than before through AI technologies,
The ability of AI to analyse the content of thus computers are able to: important
texts may shift search away from being to many
1. Process inputs in the form of human
primarily through structured bibliographic
voice (e.g. voice search) and also to output information
databases to search of full text items or professional
in voice form.
whole collections using multiple potential
algorithms. The paradigm could be seen as 2. Identify the subject of images and jobs.
shifting from searching for an item (like a categorise them better, as in a search
book or online report) to read manually to engine being able to identify what an image
mining for knowledge in whole collections is of or enable searching using an image
of text data: from close reading to distant rather than a text term, or as in facial
reading. If this is the case, having some recognition, where it is able to match a
grasp of AI techniques will be important human face to a photograph. Computer
to many information professional jobs. vision is the area of computing which deals
Potentially, work information professionals with the extraction of information from
do through creating descriptive metadata images and video.
might become less necessary as
This means that computers can accept
computers perform this task, though this
a wider range of inputs as data, including
is debatable. The change will not reduce
images, sound recordings and videos.
the need to understand the ecologies of
Natural Language Processing (NLP) is
the production of information in order to
the term for turning text or audio into
conduct an effective search or the need
structured information. They can also
to critically evaluate results based on this
interact with people to receive and give
understanding. Yet as Tony Russell-Rose
data through voice or images. Chatbots
suggested in the interview for this report,
can draw on a knowledge base to interact
AI will impact on the production of texts
with a human in a more or less human
throughout the information lifecycle or
interactional style.
value chain:
Machine learning means that computers
o In producing texts. The first scholarly can be given training data from which they
text composed entirely by a computer
can identify useful patterns, without those
synthesising existing literature was
patterns necessarily being determined by
published by Springer in 2019. Publishers
a human in advance. Supervised machine
are widely using AI to manage the peer
learning is where training data that has
review processes (Thelwall, 2018).
been manually labelled is used to help the
o In acquiring texts into collections, computer to learn to recognise patterns.
e.g. in crawling for data or processing text In unsupervised machine learning, the
algorithm searches for patterns without
o In categorising texts and knowledge being given training data.
discovery. New tools will exist to
describe the content of a single text or There is nothing new to automating
a large collection. business processes, but what robotic
process automation (RPA) offers is relatively
o In curating and reuse of text. user-friendly tools to stitch together
o In consuming text. Users will be processes across different platforms.
supported to access text through AI This promises to make it easier to automate
tools such as summarisation which daily processes.
provides them quick access to content These abilities widen the range of data
or recommendation and personalisation that computers can accept, processes
whereby appropriate content is pushed they can carry through and outputs they
towards them. They may also be enabled
can generate. It allows them to do things Equally the need for a critical awareness
in some sense akin to human thinking, about how knowledge is generated
learning, decision making and interacting. grows, and the information, knowledge
management and library profession has
The ability to train AI rests on the availability
Biases in of data. AI is premised on “big data”. As
an important part to play in this.
training data the points above illustrate, the range of Nevertheless, it is important to recognise
(and in the inputs and outputs that computers can that in many contexts these developments
computing handle has expanded to unstructured remain potential. The admittedly
text, voice and images. In this sense the fragmentary evidence suggests a great
industry definition of “data” has expanded. What interest in the technologies in all sorts
itself) librarians might call collections and more of organisations in the UK (AIIM Industry
systema- latterly might have been referred to as Watch, 2018; British Computing Society,
tically “content” can now be seen as data (Padilla 2020; Carter, 2019). But as section 3 made
et al., 2018). Books and journal articles clear, different sectors are at different
reinforce are data, as are grey literature, know-how, stages of acceptance of the technologies.
existing photo collections and archives of email or
unfair social online forums. This is not just an important
shift in language. This is a critical change
2.2 The ethics of AI
structures.
because many professional distinctions In addition to questions about the maturity
rest on differentiating types of content, of the technology, there is intense concern
such as between published books and at the societal level around the ethics of
articles in libraries, unique manuscripts in AI. Jobin et al (2019) report on no less
archives and historical objects in museums. than 84 statements attempting to define
Increasingly the boundaries blur and what constitutes ethical AI, including by
issues converge. the House of Lords, the European Union
and the OECD; the Royal Society and IEEE;
In addition, the digital shift has led to a
as well as Google, IBM and Microsoft. The
growing range of other forms of computer-
timeline in Fjeld et al. (2020) suggests that
accessible data. Online activities leave data
these concerns emerged in 2018 and 2019.
traces. Sensors embedded in objects and
So the impact of AI is surrounded by a
buildings gather more data about the world,
storm of controversy in society as a whole
including ourselves. We carry devices such
(Centre for Data Ethics and Innovation, 2020;
as smartphones that track our own activity,
Equality Task Force, 2020; Floridi & Cowls,
location and movements. Indeed, even
2019) as well as within the information
human feeling is claimed to be data for
profession itself (Association of Research
affective computing. This is associated with
Libraries, 2019; Cordell, 2020; IFLA, 2020;
a societal re-evaluation of data, with claims
Padilla, 2019). Despite this evidence of
that data is the “new oil”. Actually, this claim
concern, there is more to say because
was first made as early as 2005 by the
most of these statements originate from
mathematician involved in developing
the global North and questions remain
Tesco’s supermarket reward scheme, the
about the implementation of these codes
Clubcard. His point was that oil is valuable,
in practice (AINow, 2019). The following
but needs to be refined. Similarly, issues
paragraphs summarise some of the
of data quality, of data description, data
main concerns.
management and governance apply in the
AI context. If the data used to train an algorithm
is based on a biased sample then it will
This is in many ways very exciting from an
learn a misleading pattern. If the data we
information, knowledge management and
train computers on only reflects our past
library perspective, because an AI-driven
faulty decisions, then it will just repeat our
and data-intensive society is one based on
past biases. Computers trained on white
information and knowledge. The amount
faces do not necessarily identify black
of data available to humanity is growing
faces. This raises issues of fairness and
and in many contexts AI is making it is
non-discrimination. Much of the literature
easier to find information. The potential
points to the way that these biases in
to search across different formats of
training data (and in the computing industry
information is increasing. Computers are
itself) systematically reinforce existing unfair
better at supporting our understanding
social structures.
through interacting with us orally and
presenting new visualisations of data.
Privacy is another concern. Because these A further dimension of the societal impact
technologies are embedded in other tools of AI and robots, is their potential impact
and sites as general purpose technologies, on work, including professional work.
the user may not be aware that they Section 3 already discussed the hard to
are gathering data about them, trying predict impact of the technologies on
to influence or nudge their behaviour or employment. The next section considers the
making decisions based on it. This raises implications for information and knowledge
issues of consent, the right to control over management and library profession.
how one’s data is used and the right to
be forgotten.
2.3 The threats to the profession
from AI and robots
In so far as we do have a sense that data
is being collected, AI can be creating
surveillance. Quite apart from our right to In addition to the wider issues and social
privacy, this awareness of surveillance can risks, AI and robots pose a number of
in itself change our behaviour, e.g. make us threats from a professional perspective.
reluctant to search for particular forms of One arises if AI can genuinely perform tasks
information. Surveillance impacts the right currently carried out by an information
to free expression. professional. For example, if it produces
Gathering data about us creates a summaries or indexing that are good
security risk, if it is compromised or stolen enough for their context of use, and
and appropriated. much more quickly than is possible to do
manually. Or if it can efficiently sort books
The operation of AI in personalising content and put them back on a library shelf. This
and supporting moderation has direct appears to be highly likely with very routine,
impact both on the freedom to form an predictable cognitive or manual work. It
opinion and on the right to free expression may do away with certain more routine
(Kay, 2018). work, especially those involving sifting vast
Because the technologies “learn” the amounts of data.
patterns it may not be easy for anyone, Currently most AI and robot work is
including system designers, to fully narrow. It requires a “human in the loop”.
understand and explain the outputs or Over time it may also prove to be possible
decisions they make. This opacity leads to to use AI and robots on more complex,
problems of explainability and transparency. unpredictable tasks, including those
How do we explain how the outputs of AI seemingly based on expert judgement. At
were arrived at? Who is accountable for a this point more skilled tasks may be lost in
mistake if a computer makes it? How can human jobs. For information professionals
an appeal be lodged? The shifting nature this represents another wave of fear of
of machine learning may also reduce the
The
“disintermediation”.
possibility of verifiability and replicability. information
There is also a risk that even though they profession
There are also fundamental issues are less fit for purpose, cheap to produce
around the human control over technology, and easy to use results from AI satisfy busy, have an
professional responsibility and the promotion unskilled users. There is even the risk that important
of human values (Fjeld et al., 2020). that information services are replaced by AI role in
These issues are particularly acute where even if the latter do not work so well at all, if weighing up
AI is used to make decisions that directly ill-informed decision makers fall for some of
the hype. Thus, AI implies the restructuring how ethical
shape people’s lives, influence or “nudge”
human behaviour, or wherever personal of work, in unpredicted and perhaps issues
data is involved. Indeed, some critics of AI disruptive ways. around AI
have gone much further in seeing AI as part However, because there are many apply in
of a wider movement towards surveillance challenges around AI in practice, there contexts
and the metrification of society, where only remain opportunities for information
things that can be counted are valued.
where
and knowledge professionals to
contribute. The opportunities can be more they are
Ultimately, it is vital that AI and robots
are used for social good. The information clearly understood if we examine the operating.
profession have an important role in applications of AI and robots relevant to
weighing up how these ethical issues apply information work.
in contexts where they are operating.
3.
3.1 Intelligent web and mobile Some ranking is determined by
organisations paying for a better placing.
search: the need for algorithmic 4. Search engine optimisation is used to
literacy promote results in search results.
The first domain where both users and 5. Search engine results are affected by
information professionals have already data voids, keyword appropriation, filter
encountered AI is in search engines, bubbles, echo chambers, deep fakes,
through the many novel features of automated content moderation, etc.
search that have appeared there first, 6. Confirmation bias and unconscious
such as auto-suggestion, auto-correction, bias may impact our own information-
recommendation and personalisation, and seeking behaviours.
most recently voice search (Fernandez,
2016). Some of these changes have already This is an area where most information
happened or are continuing to happen. professionals are well informed and already
In many ways they are welcomed by the engaged with AI, though they might not be
information, knowledge management and used to calling it that. It is a positive finding
library profession as increasing easy and of this report that information professionals
ubiquitous access to information. are at the forefront of promoting better
understanding of AI in mobile and web
Yet with our focus on Information Literacy search and many of the same issues apply
and as Google sceptics we immediately across the range of AI applications.
understand the limits of some of these
applications of AI: encapsulated in terms
such as keyword appropriation, filter 3.2 AI interfaces to existing
bubbles, echo chambers, and deep fakes. knowledge discovery systems
A critical evaluation of search results Just as the first applications of AI that we
requires an understanding of the hidden have probably experienced are in web
biases. The term algorithmic literacy and mobile search, so the most obvious
has been coined to refer to the deeper application is in information service
understanding of the operations of search discovery systems. Such tools have existed
engines needed by users to critically for some time in third party subscription
evaluate search results (Bakke, 2020; databases. They may now be offered for
Sander, 2020). library discovery systems. One example
To have algorithmic literacy would be to application that is being publicised at the
have an awareness that: time of writing is Yewno, which is being
integrated with ExLibris. This offers a new
1. Search engines are driven by commercial way to explore library content through
motives, not simply by a desire to offer visualised concept maps (Gramatica
access to information in an unbiased way, & Pickering, 2017).
so that search results are not simply ranked
by popularity. AI also has applications in supporting the
creation of descriptive metadata within
2. There are layers of network bias conventional knowledge discovery systems
(Johnson, 2020) in how search results are (Flannery, 2020) or to identify items to be
produced and presented. weeded because of lack of use (Wagstaff
& Liu, 2018).
A projects-pecific assembly of digital Temporary, researcher chosen. Temporary, ad-hoc. Background support.
1 content and tools.
A local content collection, such as a Locally developed, Permanent. Assembling content, training
2 know-how collection in a law firm. unpublished content. tools, supporting usage.
Digitised or born digital content in the Local unpublished content, Temporary for each project. Providing content,
3 special collections of a research library. often digitized. supporting usage.
A tool/ content agnostic infrastructure Any. For storage and tools. Building and promoting
5 within which bespoke collections of content infrastructure.
for different types of analysis are stored.
A support service that focuses on advice on None. Human rather than technical. Assembling expertise
8 tools, training and supporting collaboration. and building community.
5. Evaluating the items found, including assessing risk of bias 8. A support service that focuses on
in trial results advice on tools, training to use the tools
and supporting collaboration, but does not
6. Writing the synthesis provide a technical infrastructure. The focus
The concept of the living systematic review (a self-driving SR) here is on people.
encapsulates the ambition to automate the whole process, so that Thus the model could be a temporary
the review can always be up to date at the level of each guideline. project (1); an infrastructure licensed
Many of the individual steps in the process have been approached from an aggregator (6); a locally created
with a view to automating them, with some, such as collecting full infrastructure to support projects on valued
text or supporting human screening, being easiest to achieve. Tools content (3); or an infrastructure to support
have also been developed to assess risk of bias in trials or to identify content collection and analysis (5); an
sample sizes in reviewed papers. Views differ on time frame within ongoing integrated infrastructure (7); or
which the various steps in the process can be automated. It is likely something more akin to a community (8).
that there will be a human in the loop for the foreseeable future, It could focus on local content (2,3) or
and so they might be better referred to as computer assisted reviews published or licensed content (4,6,7) or
(O’Connor et al., 2019). There remain barriers to acceptance (O’Connor be content agnostic (5,8). The collection
et al., 2019), in funding of development, managing crowd sourcing of content could be made by an
and licensing issues. There are also critical ethical issues in terms of information service such as a library (2,3,7)
ensuring transparency and considering the issue of liability if an error by a third party (4,6) or by researchers (1,5).
is made based on a computer decision. It could focus on temporary (1) or long-term
In the conventional systematic review, the information professional purposes (7).
plays a critical role at the stage of understanding the topic, choosing It could focus on content; technical
databases to search, defining a precise search to ensure all relevant infrastructure or tools; on skills and
papers are found, and perhaps in retrieving results. One of the most collaboration; or purposes. Or it could seek
achievable applications of AI is to help screen hits, by allowing a user to to provide all these.
train it to sift a large number of potential hits rapidly. This means that
techniques of precise searching become less relevant. This is exactly In reality there might be a mixing of
where many information professionals have tended to fit into the these options, e.g. where some elements
process, so the effect could be to diminish their role significantly. Yet of support (8) are supplied alongside
they may be able to contribute to areas such as licensing, creating a content platform (2,3) or licensing a
workflows, training and support. publisher platform (6).
Some of the forces shaping how this service questions, saving time for more complex
might evolve would be: queries. For certain purposes the way that
the user can remain anonymous may be an
o The purposes of analysis advantage. Data from interactions can be
o Cost or type of cost mined for further information. They might
o Skills base of the information service be delivered through apps or on web sites.
and the users
Ethical issues with chatbots include how
far to make it explicit that the chatbot is not
3.4 AI interacting with users: human and also how to design them not to
It could offer services such as to allow Thus, it is easy to imagine robots being
users to book spaces or for wayfinding. It employed in library buildings for:
could be used to nudge behaviour, such as o Cleaning
to encourage users who have not moved
for a long time to take a break, for their
o Delivery, including by drones (Nath, 2018)
well-being. The further one moves into the o Building maintenance
realms of seeking to influence behaviour, o Robotic furniture (such as furniture that
the more smart technologies raise the types automatically adjusts to the height of the
of ethical concerns discussed in section 5.5. user or shelves that move down to the user)
The smart library might be part of a smart
or intelligent campus and ultimately a smart
city, with data being integrated across these
3.8 4Th industrial revolution
different levels (JISC, 2019a). Many scenarios automation: the need for AI,
of use are imagined in JISC (2019) and the
Intelligent Campus blog (intelligentcampus. robotics and data literacy
jiscinvolve.org/wp). Some are as simple as AI and robots are being applied in a very wide
providing integrated travel information. range of social and organisational settings:
Others are more futuristic. from supermarkets to governments.
3.7.2 Robots and libraries One important implication for all information
Vlachos et al. (2020) propose that there are professionals is that a basic form of literacy
three main uses of robots in a library context: that they need is understanding of AI and
for navigation, book location and placement; robots: they need AI and robotics literacy.
as information desks; and for learning (see Algorithmic literacy could be seen as a subset
also Tella, 2020; Zheng, 2019). Reflecting on of this. Given its dependence on big data, AI
the first usage, we can acknowledge that literacy also implies data literacy. As citizens
book returns sorters have existed for some and employees, information professionals
time. Other uses relating to traditional book need this understanding themselves. More
stock, are to: specifically, AI, robotics and data literacy
is something that libraries can play a role
1. Retrieve books within large book in promoting.
storage facilities (automated storage
retrieval systems)
2. Locate books on standard shelves
3. Scan shelves to identify misplaced
and lost books (Liau, 2019)
4. Move large quantities of books
(Liau, 2019)
An embodied robot could also be used to Create Crates
welcome or inform users. A telepresence Wakefield public libraries run a service called Create Crates, circulating
robot enables a remote user to navigate maker materials to libraries that have neither the space or budget to
around a space (Guth & Vander Meer, 2017). accommodate permanent makerspaces. Each of the crates is broadly
An example of a learning use of robots based on a Science, Technology, Engineering or Maths (STEM) subject
would be to build a robot, perhaps from and allows branches to deliver fun, creative, STEM-based activities
a kit, as an educational exercise within an that support young people’s educational and social development while
information service or makerspace. Robots encouraging innovation, critical thinking, imagination and inventive
such as drones can be made available as a problem solving without the need for permanently dedicated space or
borrowable item. a budget for materials. Each crate includes the core materials necessary
to deliver STEM-based activities and ranges from low-tech items such
It seems likely that robots developed for
as modelling clay and Lego to high-tech items such as Raspberry Pis,
very generic tasks in other sectors, rather
electronic kits and a 3D printer. There are four types of positive impact:
than robots for library specific tasks, will
firstly, in enthusing children and their parents in STEM, including,
be the first to reach a scale of production
potentially, robots and AI; secondl,y in presenting the library service
where they become cheap enough for
as a place where people are encouraged to create, experiment and
widespread use.
explore ideas; thirdly, in boosting library staff skills and confidence in
dealing with topics such as AI; and fourthly in creating collaborations
with new partners.
This is an obvious role for public libraries Appendices 1 and 2 summarise respectively
since their constituency is the whole Long and Magerko’s (2020) definition of the
population. Makerspaces are the ideal dimensions of AI literacy (which includes
place to promote in-depth exploration, understanding of robotics) and Prado
but less intensive introductions would also and Marzo’s (2013) data literacy definition
be relevant. But given the wide and deep (others are available).
impacts of AI and data, all information
services could arguably integrate AI
and data literacy into their information
literacy training.
The challenges
and opportunities
licensing, copyright and IPR data disappear”. Even in the public sector
there also remain challenges around the
AI solutions (and data) often carry a openness of data.
significant premium (Carter, 2019).
In all sectors data that is available is often
Cost is certainly an issue in the public fragmented and poorly described and
sector. The vendor solutions are perceived organised. But for it to be used by AI it does
to be very expensive and there is an need to be managed better. This implies
interest in open source and do-it-yourself a need for organisations to improve their
solutions, though of course these have data management in order to leverage AI.
different types of resource implications.
In the academic sector, AI could be applied
Data processing and infrastructure costs
to historic special collections, but these
of AI could be high.
are very heterogenous (Cordell, 2020).
The algorithms for image recognition AI. But this implies understanding how to
trained on contemporary images do not motivate potential participants. So it is far
necessarily work for historical images. As from being purely an IT infrastructure that
a result, off-the-shelf algorithms cannot be is needed.
AI has only easily applied. Training data is needed and
The workflow and infrastructure challenges
been adopted in fact, in all contexts, new types of analysis
imply the need for information architects.
on particular genre of text will require a
in a few lengthy process of training.
specific areas
In the research library context the
4.6 The post-implementation
at any scale. provenance of the collections being challenges: the need for support,
The unex- used as data has to be understood for
plainability the results of analysis through AI to be training and promotion
and “opacity” understandable. “Biases” in how historic Gaining acceptance for AI remains an issue
collections were built up and also in what to be tackled. AI has only been adopted in a
of AI is has been chosen to be digitised in the few specific areas at any scale.
problematic past need to be taken into account in
In many sectors, there are strong cultural
for its interpreting analysis.
barriers to adopting AI because many
acceptance AI also creates derivative data. For example, professional groups feel threatened by it
and use. in a research library context, AI use on and motivated to be doubtful that it can
digitised collections creates new descriptive deliver promises to automate tasks that
data. Reintegrating this into discovery have always before been seen as requiring
systems poses issues of quality. human skills to undertake.
The data challenges create an opportunity The unexplainability and “opacity” of AI
for information professionals in terms of is problematic for its acceptance and use
data governance and management and (Burrell, 2016). If it is hard to understand
long term preservation that might be how a result was produced, there will be
summarised in the term ‘data stewardship’. scepticism about how far to trust it. Many
This builds on existing expertise in the information uses require replication: this
information profession in information can be hard where machine learning is
governance and information description involved. In other contexts, there may be
(taxonomy building, use of standards). a temptation to rely too heavily on the
Since the data is often information content, outputs of AI without real understanding
collection skills are also highly relevant. of how it works and its limitations (Baker,
2018). Critical information literacy
4.5 The technical infrastructure remains relevant.
and workflow challenges: the There are many training needs (Carter,
2018). Domain experts need basic training
need for information architecting in how to use AI. AI is not advanced enough
to work on its own: there remains a need
In many industries current processes are
for there to be a human in the loop. There
complex and have strong local adaptations,
is also a need for trusted intermediaries
involve lots of human judgement and
who can interpret results. In the academic
remain somewhat undocumented. They
sector, while there appears to be widening
have not been looked at with an eye to
interest in using AI across all disciplines,
routinise them. In the commercial sector,
the skills are lacking to do the analysis and
offshored activities may be more routinised
interpret the results.
and so more open to AI applications, such
as RPA. There is much to be learned about So, there are emerging needs for support,
the implementation challenges (Pelz-Shapre training and the promotion of AI. These
& Kompella, 2019). post-implementation challenges create an
opportunity for information professionals
In the research library context, building
to facilitate the development of AI and its
workflows for AI systems is also a challenge.
implementation, e.g. through support,
It may be that libraries themselves have to
training and promotion. This builds
create an infrastructure for text and data
on existing roles of the same sort in
mining. In the special collection context, it
supporting information use and promoting
is also recognised that there is a value in
information services.
integrating crowdsourcing, e.g. to help train
4.7 The ethical use challenge: There is a great potential for bias in
training data which can lead to algorithms
the need for responsible use simply repeating past errors. The way that
Competencies needed
to take the opportunities
presented by AI and robots
Increasing understanding Organisations maximise Teaching and training skills, Information Lack of current
of AI, robots and data. the effective and safe use supporting users (Literacies literacy and understanding
of AI and robots. and learning). digital skills amongst users.
training.
Leadership – having The organisation Leadership skills, strategic Leadership. Hard to predict what
a vision for how the positions itself to benefit thinking and evaluation the technologies
information service/ strategically from the (Leadership and advocacy), will really do and in
organisation / society technologies, it identifies Business planning (Strategy, complex environment
can best use AI and where the technology planning and management) where main benefit
be involved in wider can have an impact in and communicating with will be; need a clear
AI use. either transforming stakeholders (Customer understanding of
the business or simply focus, service design the potential.
through improved and marketing).
efficiency.
Collaboration
Partnership creation Rich and supportive Partnership development Collaboration. Differing expectations;
for multi-disciplinary collaborations (Leadership and advocacy) internal politics.
teams of domain experts, come together to and communicating with
IT and other services reap the benefits stakeholders (Customer
to support AI. of the technologies. focus, service design
and marketing).
Procurement
Copyright – The issues around IPR Information rights Copyright role. The law may lag behind
understanding what are managed. (Information governance technologies.
is legally allowed. and compliance).
Licensing of content/ Appropriate licenses are Copyright, intellectual Licensing New layer of concerns
data – negotiating/ negotiated and their property and licensing of content. in examining licences.
checking licences and terms are understood (Information governance
publicising licence terms. by users. and compliance).
Licensing of tools The process of choosing Contract management Procurement Novel types of system
or algorithms. and procuring tools (Strategy, planning and of systems. (or data?) require new
successfully identifies management). understanding; wide
effective and safe range of tools available;
technologies. complex issues in
vendor relations
(Carter, 2019).
Data stewardship
Data governance. The risk and benefit Information governance Information Ensuring compliance.
from data are effectively and compliance. governance.
managed.
Curation and Long term benefits Digitisation, curation and Digital Technological
preservation of content are derived from data preservation (Records preservation. uncertainty; lack of
created by AI such as produced by AI. management and archiving). a digital preservation
derivatives. culture.
Data analysis, including Data is used for useful Data analytics (Using Domain experts, data
visualisation. analytic descriptions and and exploiting knowledge scientists and AI experts
predictions. and information). more likely to lead on
this but some level of
understanding useful.
Technical infrastructure
Infrastructure Effective, reliable, secure Information architecture Management IT team more obvious
building/ procurement, and safe technologies are (Organising knowledge and of information to manage.
e.g. storage for scale assembled for AI to work information), information content specific
and heterogeneity within. retrieval (Using and infrastructure.
of data. exploiting knowledge
and information), data
management (Knowledge
and information
management), ICT skills (IT
and communication).
Workflow design, Workable workflows Information architecture Procurement IT team more obvious
including reintegration are created. (Organising knowledge of systems. to manage.
of derived data into and information), data
discovery systems. management (Knowledge
and information
management), ICT skills
(IT and communication).
Skills development Users are supported Teaching and training Training. Diverse ways machine
for users. to acquire the skills skills, supporting users learning used makes
they need to manage (Literacies and learning). this challenging.
data and use AI tools.
Users such as
researchers may not
expect to come to the
library for this training.
Curation and Long term benefits Digitisation, curation and Digital Technological
preservation of content are derived from data preservation (Records preservation. uncertainty; lack of
created by AI such as produced by AI. management and archiving) a digital preservation
derivatives. culture.
Marketing new tools Users are aware of the Strategic marketing Marketing of Finding bandwidth.
to users. changing technology (Customer focus, service new information
options available. design and marketing). services and
products.
Responsible use
Ethics and values – Professional values Access to knowledge, Access to The way familiar issues
ensuring that AI uses and ethics are respected intellectual freedom, knowledge. are manifested may
are ethical and in in every aspect of information skills and be shifting.
tune with human and technology use. information literacy
professional values.
Research ethics
(Research skills).
will be those which computers remain many tools available will be useful, be that
poor at. WEF (Gray, 2016; World Economic to support users or to apply them directly
Forum, 2020) suggest that the following are to their own professional work.
areas needed:
In terms of exploiting data science methods
o Complex problem solving for information service work, there will be
o Critical thinking strong competition for people with these
o Creativity competencies, so it may be hard to recruit
or even retain talent in this area (Markow
o People management et al., 2017).
o Coordinating with others
The Edison project (Demchenko et al.,
o Emotional intelligence 2017) explored skillsets in the broad
o Judgement and decision making data science area and drew a useful
o Service orientation distinction between data analytics (see
o Negotiation Appendix 4: Data Science Data Analysis
skills), data management skills (Appendix
o Cognitive flexibility 5: Data Science Data Management skills)
This listing reflects the ability to apply and a number of other skillsets, such as
intelligence in a broad way (in contrast engineering for data science. In reality, as
to the narrow AI achieved by computers) they recognize, roles might sit across these
and also to have the soft, human skills divides: data science is a spectrum (Burton
that computers are unlikely to develop in et al., 2017). Information professionals need
the near future. In this context the skills to be more “data savvy” in general and
many information professionals possess some information professionals will acquire
in collaboration, influencing and negotiating data analytics skills. However, it seems
seem to be very valuable. Arguably the reasonable to position most information
PKSB does not currently fully represent professionals as most likely to work more
some of these higher order cognitive skills; in the data management than data analysis
for example, complex problem solving, area, as defined by the Edison project.
critical thinking and creativity are not
As has already been established, the
presently identified as professional skills.
current wave of excitement around AI was
The information profession is so diverse preceded by a wave of interest in “big data”,
and positioned so differently that it is because AI requires large volumes of data.
hard to generalise about what skills will As has also been explained, the range of
be needed, but computational sense, data what is called data has expanded. In AI
science skills, fusion skills and these soft there is often training data, input data
skills suggest directions of travel. and derived data to manage. As a result,
a core aspect of most AI applications is
5.3 Data science and data data governance. Since data is closely
related to information, data governance
stewardship and management feels very much within
the scope of information professionals.
Use of AI tools to support data analysis has
They can certainly contribute a unique
created a new form of analyst role, the data
perspective in the context of multi-
scientist. Data science uses computational
stakeholder data governance (Wendehorst,
methods (such as those discussed in 4.1
2020). Articulating this clearly is important
above) to derive new knowledge from data.
to taking the opportunities presented by
Data scientists tend to have a combination
AI. Developments in some information
of statistical skills, computational skills and
sectors, such as around research data
domain knowledge. Some might sit more
management in the academic sector, give
at the analysis end, some more at the end
us some sense of what this might look
of business need, and some bridge or
like. A lot here may turn on how involved
translate between the two (Henke et al.,
information professionals are already in
2018; Open Data Institute, 2020a). Data
the data management in their organisation.
visualisation is also part of what they do.
In some cases the data is library collections
This might be an area where information
or know-how managed by an information
professionals need to reposition their skills
service. In other cases there may be other
base. Some understanding of the principles
stakeholders who currently manage data.
involved and hands-on experience of the
Recommendations
Recommendation 3
CILIP with other professional bodies
can foster knowledge sharing across the
profession, and with adjacent professions,
through events and curating relevant
open learning resources. CILIP special
interest groups offer a locus for developing
communities of practice to support learning
about the new technologies.
Recommendation 11
There is a need for hands-on experiences
with AI applications in practice.
References
67 bricks. (2017). Content Enrichment : An essential Burton, M., Lyon, L., Erdmann, C., & Tijerina, B.
strategic capability for every publisher. www.67bricks. (2017). Shifting to Data Savvy: The Future of Data
com/content-enrichment-an-essential-strategic- Science In Libraries. d-scholarship.pitt.edu/33891.
capability-for-every-publisher. Callister, P. D. (2020). Law, artificial intelligence,
AIIM. (n.d.). Robotic Process Automation ( RPA ) for and natural language processing: A funny thing
Information Management Professionals: Is RPA Right happened on the way to my search results.
for My Business? info.aiim.org/is-rpa-right-for-my- Law Library Journal, 112(2), 161–212. ssrn.com/
business. abstract=3712306.
AIIM. (2019). 10 Things You Need to Know About Carter, D. (2018). How real is the impact of artificial
Process Services and RPA. info.aiim.org/10- intelligence? The business information survey
things-you-need-to-know-about-process- 2018. Business Information Review, 35(3), 99–115.
automation-and-rpa. doi.org/10.1177/0266382118790150.
AIIM Industry Watch. (2018). Leveraging Deep Carter, D. (2019). New technologies and new
Learning and Machine Learning Capabilities. data sources: The business information survey
info.aiim.org/leveraging-deep-learning-and- 2019. Business Information Review, 36(3), 95–105.
machine-learning-capabilities. doi.org/10.1177/0266382119871848.
AINow. (2019). 2019 report. ainowinstitute.org/ Carter, D. (2020). Regulation and ethics in
AI_Now_2019_Report.pdf. artificial intelligence and machine learning
Anderson, C. B., & Craiglow, H. A. (2017). Text technologies: Where are we now? Who is
mining in business libraries. Journal of Business and responsible? Can the information professional play
Finance Librarianship, 22(2), 149–165. doi.org/10.108 a role? Business Information Review, 37(2), 60–68.
0/08963568.2017.1285749. doi.org/10.1177/0266382120923962.
Anton, E., Behne, A., & Teuteberg, F. (2020). Caspers, M., Guibault, L., McNeice, K., Piperidis,
The humans behind artificial intelligence – an S., Pouli, K., Eskevich, M., & Gavrilidou, M. (2015).
operationalisation of AI competencies. Twenty-Eighth Reducing barriers and increasing uptake of text and
European Conference on Information Systems data mining for research Baseline report of policies
(ECIS2020), June. aisel.aisnet.org/ecis2020_rp/141. and barriers of TDM in Europe ( extended version
of D3 . 3). project.futuretdm.eu/wp-content/
Association of Research Libraries. (2019). Research uploads/2017/05/FutureTDM_D3.3-Baseline-Report-
Library Issues RLI 299: Ethics of Artificial Intelligence. of-Policies-and-Barriers-of-TDM-in-Europe.pdf.
publications.arl.org/rli299.
Centre for Data Ethics and Innovation. (2020).
Baker, J. J. (2018). 2018: A legal research odyssey: Review into bias in algorithmic decision-making.
Artificial intelligence as disruptor. Law Library assets.publishing.service.gov.uk/government/
Journal, 110(1), 5–30. doi.org/10.2139/ssrn.2978703. uploads/system/uploads/attachment_data/
Bakke, A. (2020). Everyday Googling: Results file/938850/Main_Report_-_CDEI_Review_into_Bias_
of an Observational Study and Applications for in_Algorithmic_Decision-Making.pdf.
Teaching Algorithmic Literacy. Computers and CILIP. (2018). Ethical Framework. www.cilip.org.uk/
Composition, 57, 102577. doi.org/10.1016/j. general/custom.asp?page=ethics.
compcom.2020.102577.
Cook, M., & Van der Veer Martens, B. (2019).
Barr, D., Harrison, J., & Conery, L. (2011). Managing Exploratory Units in Academic Libraries.
Computational Thinking: A Digital Age Skill for Journal of Library Administration, 59(6), 606–628.
Everyone. Learning and Leading with Technology, doi.org/10.1080/01930826.2019.1626647.
38(6), 20–23. eric.ed.gov/?id=EJ918910.
Coombs, C., Hislop, D., Taneva, S. K., & Barnard,
Beller, E., Clark, J., Tsafnat, G., Adams, C., Diehl, S. (2020). The strategic impacts of Intelligent
H., Lund, H., Ouzzani, M., Thayer, K., Thomas, J., Automation for knowledge and service work:
Turner, T., Xia, J., Robinson, K., Glasziou, P., Ahtirschi, An interdisciplinary review. Journal of Strategic
O., Christensen, R., Elliott, J., Graziosi, S., Kuiper, J., Information Systems, July 2017, 101600.
Moustgaard, R., … Wedel-Heinen, I. (2018). Making doi.org/10.1016/j.jsis.2020.101600.
progress with the automation of systematic reviews:
Principles of the International Collaboration for Cordell, R. (2020). Machine Learning + Libraries:
the Automation of Systematic Reviews (ICASR). A report on the state of the field. labs.loc.gov/static/
Systematic Reviews, 7(1), 1–7. doi.org/10.1186/ labs/work/reports/Cordell-LOC-ML-report.pdf?.
s13643-018-0740-7. Cox, A. M. (2021). Exploring the impact of Artificial
British Computing Society. (2020). AI and your job. Intelligence and robots on Higher Education
doi.org/10.1093/itnow/bwaa063. through literature-based design fictions.
International Journal of Educational Technology in
Brooke Robertshaw, M., & Asher, A. (2019). Higher Education. doi.org/doi.org/10.1186/s41239-
Unethical numbers? A meta-analysis of library 020-00237-8.
learning analytics studies. Library Trends, 68(1),
76–101. doi.org/10.1353/lib.2019.0031. Cox, A. M., Kennan, M. A., Lyon, L., Pinfield, S.,
& Sbaffi, L. (2019). Maturing research data services
Burrell, J. (2016). How the machine ‘thinks’: and the transformation of academic libraries.
Understanding opacity in machine learning Journal of Documentation, 75(6). doi.org/10.1108/JD-
algorithms. Big Data and Society, 3(1), 1–12. 12-2018-0211.
doi.org/10.1177/2053951715622512.
Cox, A. M., Pinfield, S., & Rutter, S. (2019). The
intelligent library. Library Hi Tech, 37(3), 418–435.
doi.org/doi.org/10.1108/LHT-08-2018-0105.
Jones, K. M. L., Briney, K. A., Goben, A., Salo, D., McNeal, M., & Newyear, D. (2013). Introducing
Asher, A., & Perry, M. R. (2020). A comprehensive chatbots in libraries. Library Technology Reports,
primer to library learning analytics practices, 49(8), 5–10.
initiatives, and privacy issues. College and Research McNeice, K. (2017a). Skills and education: The need
Libraries, 81(3), 570–591. doi.org/10.5860/ for strategic thinking. FutureTDM Symposium.
crl.81.3.570.
McNeice, K. (2017b). Text and Data
Jones, K. M. L., & Salo, D. (2018). Learning Mining: The Changing Nature of Research.
analytics and the academic library: Professional FutureTDM Symposium.
ethics commitments at a crossroads. College and
Research Libraries, 79(3), 304–323. doi.org/10.5860/ Millard, T., Synnot, A., Elliott, J., & Turner, T.
crl.79.3.304. (2018). Results from the evaluation of the pilot living
systematic reviews. community.cochrane.org/sites/
Jonnalagadda, S. R., Goyal, P., & Huffman, M. D. default/files/uploads/inline-files/Transform/201905
(2015). Automating data extraction in systematic LSR_pilot_evaluation_report.pdf.
reviews: A systematic review. Systematic Reviews,
4(78). doi.org/10.1186/s13643-015-0066-7. Min-Allah, N., & Alrashed, S. (2020). Smart
campus—A sketch. Sustainable Cities and Society, 59.
Kay, D. (2018). Report on Artificial Intelligence doi.org/10.1016/j.scs.2020.102231.
technologies and implications for freedom of
expression and the information environment. Moore, M. T. (2017). Constructing a sentiment
www.ohchr.org/EN/Issues/FreedomOpinion/Pages/ analysis model for LibQUAL+ comments.
ReportGA73.aspx. Performance Measurement and Metrics, 18(1), 78–87.
doi.org/10.1108/PMM-07-2016-0031.
Kelly, J. (2016). THE TEXT AND DATA MINING
COPYRIGHT EXCEPTION: BENEFITS AND Mordell, D. (2019). Critical questions for archives as
IMPLICATIONS FOR UK HIGHER EDUCATION. (big) data. Archivaria, 87(5), 140–161. archivaria.ca/
www.jisc.ac.uk/guides/text-and-data-mining- index.php/archivaria/article/view/13673.
copyright-exception. Murphy, O., & Villaespesa, E. (2020). The Museums +
Kroski, E. (2020). Law librarianship in the age of AI. Ai Network. themuseumsainetwork.files.wordpress.
American Library Association. com/2020/02/20190317_museums-and-ai-toolkit_
rl_web.pdf.
Lewis, V., Spiro, L., Wang, X., & Cawthorne, J.
E. (2015). Building Expertise to Support Digital Nath, F. (2018). Library drone delivery programme:
Scholarship: A Global Perspective. In CLIR Publication A study. DESIDOC Journal of Library and Information
(Issue 168). www.clir.org/pubs/reports/pub168/ Technology, 38(5), 350–354. doi.org/10.14429/
pub168. djlit.38.5.12892.
Liau, Y. chin. (2019). Transforming library operation Neary, M. A., & Chen, S. X. (2017). Artificial
with robotics. August, 1–5. creativecommons.org/ intelligence: Legal research and law librarians.
licenses/by/4.0. AALL Spectrum 21, 5(6), 16–20. lawdigitalcommons.
bc.edu/lsfp/1073.
LIBER. (2020). Research Libraries: How You Can
Support Text and Data Mining. NHSX. (2019). Artificial Intelligence: How to get it
right. Putting policy into practice for safe data-driven
Long, D., & Magerko, B. (2020). What is innovation in health and care. www.nhsx.nhs.uk/
AI literacy? Competencies and design media/documents/NHSX_AI_report.pdf.
considerations. Conference on Human Factors
in Computing Systems - Proceedings, 1–16. O’Connor, A. M., Tsafnat, G., Gilbert, S. B., Thayer,
doi.org/10.1145/3313831.3376727. K. A., Shemilt, I., Thomas, J., Glasziou, P., & Wolfe,
M. S. (2019). Still moving toward automation of
Lowendahl, J.-M., & Calhoun Williams, K. the systematic review process: A summary of
(2018). 5 best practices for artificial intelligence discussions at the third meeting of the International
in higher education. Collaboration for Automation of Systematic
Ma, L., & Sun, B. (2020). Machine learning and AI Reviews (ICASR). Systematic Reviews, 8(1), 1–5.
in marketing – Connecting computing power to doi.org/10.1186/s13643-019-0975-y.
human insights. International Journal of Research Oakleaf, M. (2018). The problems and promise of
in Marketing, 37(3), 481–504. doi.org/10.1016/j. learning analytics for increasing and demonstrating
ijresmar.2020.04.005. library value and impact. Information and Learning
Markow, W., Braganza, S., Taska, B., Miller, S. M., Science, 119(1–2), 16–24. doi.org/10.1108/ILS-08-
& Hughes, D. (2017). The Quant Crunch: How the 2017-0080.
Demand for Data Science Skills is Disrupting the Job OECD. (2019). Artificial intelligence in society.
Market. www.burning-glass.com/research-project/ www.oecd-ilibrary.org/science-and-technology/
quant-crunch-data-science-job-market. artificial-intelligence-in-society_eedfee77-en.
Maxwell, D., Norton, H., & Wu, J. (2018). The Data Oliver, J. C., Kollen, C., Hickson, B., & Rios, F. (2019).
Science Opportunity: Crafting a Holistic Strategy. Data Science Support at the Academic Library.
Journal of Library Administration, 58(2), 111–127. Journal of Library Administration, 59(3), 241–257.
doi.org/10.1080/01930826.2017.1412704. doi.org/10.1080/01930826.2019.1583015.
Mckinsey Analytics. (2019). Global AI Survey: AI Open Data Institute. (2020a). Data Skills Framework.
proves its worth, but few scale impact. In McKinsey theodi.org/article/data-skills-framework.
&Company (Issue November). www.mckinsey.com/
featured-insights/artificial-intelligence/global-ai- Open Data Institute. (2020b). Getting data right:
survey-ai-proves-its-worth-but-few-scale-impact. Perspectives on the UK National Data Strategy 2020.
library1.nida.ac.th/termpaper6/sd/2554/19755.pdf.
McKinsey Global Institute. (2018). Skill Shift:
Automation and the Future of the Workforce Padilla, T. (2019). Responsible operations: Data
(Discussion Paper, May 2018). In McKinsey science, machine learning, and AI in libraries. In
&Company (Issue May). www.mckinsey.com/~/ OCLC. doi.org/10.25333/xk7z-9g97.
media/McKinsey/Featured Insights/Future of Pelz-Shapre, A., & Kompella, K. (2019). Practical
Organizations/Skill shift Automation and the future artificial intelligence: An enterprise playbook.
of the workforce/MGI-Skill-Shift-Automation-and- Deep Publishing.
future-of-the-workforce-May-2018.ashx.
Prado, J. C., & Marzal, M. Á. (2013). Incorporating Tammaro, A. M., Matusiak, K. K., Sposito, F. A.,
data literacy into information literacy programs: & Casarosa, V. (2019). Data curator’s roles and
Core competencies and contents. Libri, 63(2), responsibilities: An international perspective. Libri,
123–134. doi.org/10.1515/libri-2013-0010. 69(2), 89–104. doi.org/10.1515/libri-2018-0090.
Ridge, M. (2019). Operationalising AI at a national Tella, A. (2020). Robots are coming to the libraries:
library. Museums + AI Network. www.slideshare.net/ Are librarians ready to accommodate them? Library
miaridge/operationalising-ai-at-a-national-library. Hi Tech News, 37(8), 13–17. doi.org/10.1108/LHTN-
Rowley, J. (2011). Innovation for survival: From 05-2020-0047.
cooperation to collaboration. In Advances in Thelwall, M. (2018). Artificial Intelligence,
Librarianship (Vol. 34). Emerald Group Publishing Automation and Peer Review. In Jisc. link.springer.
Ltd. doi.org/10.1108/S0065-2830(2011)0000034013. com/article/10.1007/BF02016664.
Sander, I. (2020). Critical big data literacy tools— Tsai, Y. S., & Gasevic, D. (2017). Learning analytics
Engaging citizens and promoting empowered in higher education - Challenges and policies: A
internet usage. Data & Policy, 2. doi.org/10.1017/ review of eight learning analytics policies. ACM
dap.2020.5. International Conference Proceeding Series, 233–242.
Schlegel, D., & Kraus, P. (2020). Robotic process doi.org/10.1145/3027385.3027400.
automation as an emerging career opportunity: Twidale, M. B., & Nichols, D. M. (2006).
An analysis of required qualifications and skills. Computational sense: The role of technology in the
13th Annual Conference of the EuroMed Academy education of digital librarians. researchcommons.
of Business, 1083–1097. publikationen.reutlingen- waikato.ac.nz/handle/10289/40.
university.de/frontdoor/deliver/index/docId/2815/ Vincze, J. (2017). Virtual reference librarians
file/2815.pdf. (Chatbots). Library Hi Tech News, 34(4), 5–8.
Semeler, A. R., Pinto, A. L., & Rozados, H. B. doi.org/10.1108/LHTN-03-2017-0016.
F. (2019). Data science in data librarianship: Vlachos, E., Faber Hansen, A., & Povl Holck, J.
Core competencies of a data librarian. Journal (2020). A robot in the library. In M. Rauterberg
of Librarianship and Information Science, 51(3), (Ed.), Culture and Computing (pp. 312-). Springer.
771–780. doi.org/10.1177/0961000617742465. doi.org/10.1007/978-3-030-50267-6.
Shih, W. (2020). Voice revolution. Library Wagstaff, K. L., & Liu, G. Z. (2018). Automated
Technology Reports, 56(4), 5–13. classification to improve the efficiency of
Smith, A. (2016). Big Data technology, evolving weeding library collections. Journal of Academic
knowledge skills and emerging roles. Legal Librarianship, 44(2), 238–247. doi.org/10.1016/j.
Information Management, 16(4), 219–224. doi.org/ acalib.2018.02.001.
doi.org/10.1017/S1472669616000499. Wang, M. (2013). Supporting the research process
Stanfill, M. H., & Marc, D. T. (2019). Health through expanded library data services. Program,
information management: Implications of artificial 47(3), 282–303. doi.org/10.1108/PROG-04-2012-
intelligence on healthcare data and information 0010.
management. Yearbook of Medical Informatics, 28(1), Wendehorst, C. (2020). Framework paper for GPAI
56–64. doi.org/10.1055/s-0039-1677913. work on data governance.
Stuart, D. (2020). Data science for information Wing, J. M. (2006). Computational thinking.
professionals. Facet Publishing. Communications of the ACM, 49(3), 33–35.
Syed, R., Suriadi, S., Adams, M., Bandara, W., doi.org/10.1145/1999747.1999811.
Leemans, S. J. J., Ouyang, C., ter Hofstede, A. H. M., World Economic Forum. (2020). The 10 skills you
van de Weerd, I., Wynn, M. T., & Reijers, H. A. (2020). need to thrive in the Fourth Industrial Revolution.
Robotic Process Automation: Contemporary themes www.weforum.org/agenda/2016/01/the-10-
and challenges. Computers in Industry, 115, 103162. skills-you-need-to-thrive-in-the-fourth-industrial-
doi.org/10.1016/j.compind.2019.103162. revolution.
Tam, W. (2020). The rise of the robots: Using robotic Zheng, W. (2019). How do library staff view librarian
process automation in bibliometrics. tv.theiet. robotics? Librarian staff’s ignored humanistic views
org/?videoid=14133. on the impact and threat of robotics adoption. IFLA
Tambe, P., Cappelli, P., & Yakubovich, V. (2019). WLIC 2019, 1–17. library.ifla.org/2751/1/s02-2019-
Artificial intelligence in human resources wang-en.pdf.
management: Challenges and a path forward.
California Management Review, 61(4), 15–42.
doi.org/10.1177/0008125619867910.
Appendixes
APPENDIX 3 APPENDIX 4
Fusion Skills Edison project
(Daugherty & Wilson, 2018) (Demchenko et al., 2017):
Rehumanizing time Data science data analytics
The ability to increase the time available for DSDA01 Effectively use variety of data
distinctly human tasks like interpersonal analytics techniques, such as Machine
interactions, creativity, and decision making Learning (including supervised,
in a reimagined business process. unsupervised, semisupervised learning),
Data Mining, Prescriptive and Predictive
Responsible normalizing
Analytics, for complex data analysis through
The act of responsibly shaping the purpose
the whole data lifecycle.
and perception of human-machine
interaction as it relates to individuals, DSDA02 Apply designated quantitative
businesses, and society. techniques, including statistics, time series
analysis, optimization, and simulation to
Judgment integration deploy appropriate models for analysis
The judgment-based ability to decide and prediction.
a course of action when a machine is
DSDA03 Identify, extract, and pull together
uncertain about what to do.
available and pertinent heterogeneous
Intelligent interrogation data, including modern data sources
Knowing how best to ask questions such as social media data, open data,
of AI, across levels of abstraction, to get governmental data.
the insights we need. DSDA04 Understand and use different
performance and accuracy metrics
Bot-based empowerment for model validation in analytics
Working well with AI agents to projects, hypothesis testing, and
extend your capabilities, and create information retrieval.
superpowers in business processes and
professional careers. DSDA05 Develop required data analytics
for organizational tasks, integrate data
Holistic melding analytics and processing applications
The ability to develop robust mental into organization workflow and business
models of AI agents to improve business processes to enable agile decision making.
process outcomes.
DSDA06 Visualise results of data
Reciprocal apprenticing analysis, design dashboard and use
1. Performing tasks alongside AI agents storytelling methods.
so they can learn new skills;
2. On-the-job training for people
so they can work well within
AI-enhanced processes.
Relentless reimagining
The rigorous discipline of creating new
processes and business models from
scratch, rather than simply automating
old processes.
APPENDIX 5 APPENDIX 6
Edison project DAMA DMBok2
(Demchenko et al., 2017): (DAMA International, 2012)
Data science data management Data governance
DSDM Develop and implement data planning, oversight, and control over
management strategy for data collection, management of data and the use of data and
storage, preservation, and availability for data-related resources. While we understand
further processing. that governance covers ‘processes’,
not ‘things’, the common term is Data
DSDM01 Develop and implement data Governance, and so we will use this term.
management strategy for data collection,
storage, preservation, and availability for Data architecture
further processing. the overall structure of data and
DSDM02 Develop and implement relevant data-related resources as an integral part
data models, define metadata using of the enterprise architecture.
common standards and practices, for Data modeling & design
different data sources in variety of scientific
analysis, design, building, testing,
and industry domains.
and maintenance.
DSDM03 Integrate heterogeneous data
from multiple source and provide them Data storage & operations
for further analysis and use. structured physical data assets storage
deployment and management.
DSDM04 Maintain historical information
on data handling, including reference to Data security
published data and corresponding data ensuring privacy, confidentiality and
sources (data provenance). appropriate access.
DSDM05 Ensure data quality, accessibility, Data Integration & Interoperability –
interoperability, compliance to standards, acquisition, extraction, transformation,
and publication (data curation). movement, delivery, replication, federation,
DSDM06 Develop and manage/supervise virtualization and operational support.
policies on data protection, privacy, IPR
Documents & content
and ethical issues in data management.
storing, protecting, indexing, and enabling
access to data found in unstructured
sources (electronic files and physical
records), and making this data available
for integration and interoperability with
structured (database) data.
Metadata
collecting, categorizing, maintaining,
integrating, controlling, managing,
and delivering metadata.
Data quality
defining, monitoring, maintaining data
integrity, and improving data quality.
Resources
In addition to the extensive references quoted
in the report, of particular value for information
professionals interested in this area, the following
introductory learning resources are recommended:
Stuart, D. (2020). Practical Data Science for
Information Professionals. Facet Publishing.
Foster project, www.fosteropenscience.eu.
The carpentries, carpentries.org.
AI4K12,
github.com/touretzkyds/ai4k12/wiki.
Alan Turning Institute, www.turing.ac.uk.
ADA Lovelace Institute,
www.adalovelaceinstitute.org.