Algorithms in Government
Algorithms in Government
spotlight
Algorithms in Government
A Magic Formula or a Divisive Force?
Prasanna Lal Das
November 2022
Key Insights
1. Algorithmic decision-making is becoming for fairness, transparency, and
prevalent in the public sector worldwide, accountability in algorithmic decision-
and governments in developing countries are making.
increasingly beginning to deploy algorithms to e. Developing countries are dependent
deliver citizen and business services as part of on international data infrastructure to
their digital transformation agenda. develop and manage their algorithms.
f. Developing countries deploying
2. Many algorithmic decision-making initiatives in algorithmic decision-making are
developing countries are still at an early stage, dependent on big tech companies but
as the case studies in this issue brief suggest. have little leverage over them.
The examples featured in this brief are local
and carefully designed, with data governance 4. Opportunities to address these specific data
challenges such as privacy and data security in governance challenges are emerging, including:
mind.
a. Create regional or other data alliances
3. Developing countries face several distinct data to tackle relevant data governance
governance challenges related to the design and challenges.
implementation of algorithmic decision-making b. Focus on cases that don’t depend on
services. personal data to deliver relevant services
to citizens and businesses.
a. Institutions in developing countries have c. Keep the emphasis on people, both as
an extreme legitimacy, accountability, and designers and supervisors of algorithms
transparency problem. and as consumers of algorithmic services.
b. Poor local data means that people in
developing countries are inadequately 5. Many additional data governance challenges
represented in training data. posed by algorithmic decision-making can be
c. People in developing countries have less addressed as part of a country’s overall digital
experience in interacting with machines and transformation agenda. These are not the focus
algorithms, and there’s a scarcity of data in of this issue brief but include themes such as the
local languages to close the cultural gap. overall legal/regulatory/enabling environment,
d. Developing countries have had limited infrastructure development, financing, capacity/
involvement in developing standards skills development, and institutional support.
1
DIGITAL IMPACT ALLIANCE | 2022
SECTION 1
Introduction
It’s easy to be seduced by the power of algorithms
to deliver public services. Do you want to target
beneficiaries of government programs and
services precisely and accurately? Well, there’s
an algorithm for that.1 This is also true for real-
time monitoring of resources,2 personalization of
government interactions, fraud and corruption
prevention,3 anticipation (if not outright
prediction) of events and behavior,4 and more.
In such instances, algorithms seem like a magic
formula that can crack some of government’s
most persistent problems.
2
DIGITAL IMPACT ALLIANCE | 2022
3
DIGITAL IMPACT ALLIANCE | 2022
SECTION 2
An algorithm is a step-by-step procedure to turn The terms algorithm and AI are often used
any given inputs into useful outputs. A computer interchangeably. In policy terms, it’s useful to think
algorithm follows a series of instructions to of algorithms—a form of automated instruction—
transform inputs (data) into outputs that can be as a subset of AI, which encompasses larger
used for making decisions, either by the computer socio-political and economic issues and a variety
system or a human. Many machine-learning of technical/scientific disciplines.
algorithms learn directly from data by identifying
patterns and relationships, without rules-based How do computer algorithms and humans
instructions from humans. interact?
The algorithms discussed in this paper focus on Humans interact with algorithms as designers and
systems that either augment or replace humans creators, embedding their socio-political value
for decision-making in the public sector. One systems into code; consumers and users who gain
basic example is an algorithm to determine value from the code and use services/products;
customs duty at an international border. If the and as sources of data whose actions serve
value of a shipment exceeds a certain threshold, as new data points or inputs for the algorithm.
apply a duty unless exporting to a neighboring Humans can also provide a point of control for
country. Determining eligibility for COVID vaccines the algorithm, either as testers or validators of
when they are scarce is an example of a more the decisions made by the algorithm. Algorithmic
complex algorithmic decision-making process, systems that act independently, without control
as it involves a greater number of variables with or supervision from humans, are considered
intricate, sometimes dynamic interrelationships. autonomous.
4
DIGITAL IMPACT ALLIANCE | 2022
SECTION 3
5
DIGITAL IMPACT ALLIANCE | 2022
6
DIGITAL IMPACT ALLIANCE | 2022
solution. The ministry selected chest X-rays for the design phase included health care experts,
the POC because they are typically the first tool lawyers, technical experts, representatives from the
for many diagnostics involving the heart, lungs, Ministry of Health, and other government officials
blood vessels, airways, and even bones of the from the Office of the Prime Minister. No separate
chest and spine. financing was provided by the government for the
POC.
The system works by having X-rays taken at
different clinics sent to the Central Radiology No technical or regulatory audits are planned for
Information System. Deep learning algorithms the POC. However, in line with the requirements
analyze and triage the X-rays, typically within expected to be established by the upcoming
one minute. All readings are then validated guidelines of trustworthy use of AI, the government
by a radiologist and compared to the original plans to conduct a questionnaire-based
radiologist reports. At the time of writing, more assessment of trustworthy AI that will also include
than 200 images had been read by algorithm and the stakeholders of the current POC.
cross-checked by radiologists. The accuracy rate
of one of the algorithms being tested was 71.4%, Policy/regulations/institutions
which was lower than the vendor had claimed and
lower than the solutions offered by other service Health care has been identified as a priority sector
providers. Results for the other algorithm were not in both the Strategy for the Development of Artificial
available at the time of writing. Intelligence in the Republic of Serbia for the period
2020-2025, as well as the accompanying action plan
Data and algorithms for 2020-2022. The strategy envisions a new Agency
for the National AI Program, which is expected to be
The data for the algorithms is sourced from established shortly. Serbia is also developing a Law
multiple health care institutions. It’s stored in on Ethical Use of Trustworthy AI that will provide
Serbia’s central healthcare information system. the legal framework for the current initiative as
A third-party vendor is responsible for the it’s scaled beyond the POC stage. Meanwhile, the
management of the system and must meet government is developing separate guidelines for
contractual obligations for the security and quality trustworthy AI.
of the data. The data and information about the
algorithms are currently not available publicly. The government has deemed the existing policy/
regulatory/institutional environment adequate for
Algorithms are used in multiple ways, such as: the POC. Among the issues considered were:
∙ Processing images to provide initial diagnosis ∙ Data protection and security – The contractual
∙ Triaging patients responsibility of the vendor managing the
central healthcare information system
Human interaction, oversight, redressal, and ∙ Privacy – All data is anonymized
stakeholder engagement
Opportunities and challenges
The system is not autonomous since radiologists
verify and validate all readings during the POC The system is not applicable in all patient contexts.
stage. The expectation is that the need for human For example, the system can’t analyze data from
verification will decline substantially after the patients who are unable to lie on their right side (PA)
system is put into production and there’s greater and left side (AP) during radiography. The project
confidence in the accuracy rates. team has also identified compliance with the law and
ethics as potential issues.
The POC has been implemented by two vendors
that offer slightly different algorithms, under the The project team considers data quality, financing,
supervision and operational responsibility of and an underdeveloped regulatory and institutional
the Ministry of Health and the Office for IT and apparatus to be the main challenges going forward.
eGovernment. Stakeholder consultations during It’s also important to ensure that all AI applications
within the country conform to EU standards.
7
DIGITAL IMPACT ALLIANCE | 2022
8
DIGITAL IMPACT ALLIANCE | 2022
SECTION 4
∙ Issue #1: Institutions in developing countries Issue #1: Institutions in developing countries
have an extreme legitimacy, accountability, have an extreme legitimacy, accountability,
and transparency problem. and transparency problem.
∙ Issue #2: Poor local data means that Governments, even in advanced economies,
the people of developing countries are recognize and grapple with the challenges of
inadequately represented in training data. legitimacy, accountability, and transparency of
algorithms. Part of the challenge is technical.
∙ Issue #3: People in developing countries have Algorithms, given their utilization of vast
less experience interacting with machines and computing power and their self-directed learning
algorithms, and there’s limited data in local abilities, are inherently difficult to audit, making
languages to close the cultural gap. it hard to trace their biases.25 Other challenges
are organizational and social. A recent paper26
∙ Issue #4: Developing countries have had provides a useful summary of the universal trust
limited involvement in developing standards questions that inevitably accompany algorithmic
for fairness, transparency, and accountability decision-making (e.g., disenfranchisement,
in algorithmic decision-making. disconnection, low traceability and explainability,
bias, poor quality, and reinforcement of power
∙ Issue #5: Developing countries are dependent inequalities) and proposes a helpful trust
on international data infrastructure to develop framework that outlines legal mandates and
and manage their algorithms. guidelines that governments should consider.
∙ Issue #6: Developing countries deploying The trust and legitimacy issues in developing
algorithmic decision-making are dependent countries cut deeper than in advanced
on big tech companies but have little leverage economies, which tend to have a longer tradition
over them. of accountability in government and a civil society
with greater power to interrogate government
decisions. For example, a recent study27 of Kenya,
India, Nigeria, South Africa, and the Philippines
found that existing institutions in these countries,
despite formal powers, routinely fail to protect
9
DIGITAL IMPACT ALLIANCE | 2022
against discrimination. Another recent report28 very narrow pool, often just a single hospital. A
found that in South Africa, algorithmic scoring recent Korean study32 found that only 6% of 516
technologies have “deep historical roots in racist reported studies tested their algorithm at more
social control” and “contemporary South Africa… than one hospital. Very few of these studies were
presents an especially stark illustration of …the conducted in developing countries or considered
‘New Jim Code.’” While the specific findings of the characteristics of their population.
the studies may be contestable, they do echo
many prevalent views. Many developing countries The accuracy rate of the algorithms can drop
have similar colonial legacies, and it’s probable significantly in different medical settings,
that their algorithmic decision-making apparatus depending on the characteristics of patients
is, knowingly or not, informed by discriminatory as well as extraneous factors like the brand of
power systems (e.g., male, gendered, white, equipment used.33 Implementers in developing
heteronormative, powerful, and Western).29 countries must be extremely cautious when
adopting off-the-shelf algorithms that may not
Policymakers in developing countries should have taken their local population characteristics
ensure that their algorithmic decision-making into account. The state-owned Institute for
is done by governmental and civil institutions Artificial Intelligence Research and Development
that are well-rooted in a culture of transparency of Serbia is developing its own algorithm to
and statistical analysis of the disparities replace those currently deployed by vendors.
faced by protected groups; include vigilant
nongovernmental actors attentive to algorithmic Issue #3: People in developing countries
decision-making; and support a reasonably have less experience interacting with
robust and proactive executive branch or an machines and algorithms, and there’s limited
independent office to police discrimination.30 data in local languages to close the cultural
The proposal to establish a new Agency for gap.
the National AI Program in Serbia is a welcome
step, as are plans in both Serbia and Turkey to The science and art of human-machine interaction
conduct regulatory and technical audits of their is evolving, and humans are still learning to work
implementations, but the challenges described with machines. In the Izmir case study, machines
above cannot be underestimated. and human operators form a team, sharing
workflows to achieve a common goal. Nontech
Local participation is another antidote to trust people, particularly in advanced economies,
and legitimacy challenges, but it’s unfortunately are gradually becoming more accustomed to
a known blind spot in the implementation of interacting with machines in their daily lives (e.g.,
many algorithms. For example, this study31 found robot vacuums, semi-autonomous vehicles,
little evidence of affected populations playing a
significant role in the design or management of
algorithms in the humanitarian sector.
10
DIGITAL IMPACT ALLIANCE | 2022
robots on factory floors, and digital assistants like at a “Stop” sign with specific visual features) or
Siri and Alexa) mostly without understanding any “procedural/conventional” (i.e., do not go to the
aspect of the black box algorithms behind the back of the store to pick up your package, wait for
machines. This inscrutability cuts both ways, and the store staff to bring it to you). Typical sources
machines, whose problem-solving techniques of such knowledge include written, video, and
are fundamentally different from humans, can audio material (e.g., books, articles, movies, and
struggle to understand the socio-cultural and cartoons36), ideally online in digital format.
ritual aspects of working with humans.34
The quantity and quality of available explicit
In developing countries, algorithms that don’t knowledge about developing countries is
account for local cultural nuances or are deployed relatively low, and even lower in local languages.
in populations unused to algorithmic decision- For example, according to one estimate,37 60%
making can be particularly harmful. Algorithms of the 10 million most popular websites on the
that rely on machines that can’t converse in local internet are in English. Hindi, spoken by more than
languages can make the divide even greater. At 600 million people worldwide, is the top South
the time of writing, Google Home didn’t support Asian language but accounts for only 0.1% of
Zulu,35 which is widely spoken in South Africa, one online content. Other languages like Bengali and
of the more developed markets in Africa. Urdu, which are spoken by hundreds of millions of
people, don’t even appear on the list. Content in
One way to train machines and algorithms to the African language of Igbo, spoken by at least 30
work better with humans is to expose them to million people, makes up less than 0.1% of all online
a sufficiently large corpus of commonsense material. Initiatives such as Masakhane,38 Zindi,39
knowledge informed by cultural practices. This and No Language Left Behind40 are steps toward
knowledge can be either “declarative” (i.e., stop addressing the issue, but the chasm remains wide.
11
DIGITAL IMPACT ALLIANCE | 2022
12
DIGITAL IMPACT ALLIANCE | 2022
Issue #5: Developing countries are Issue #6: Developing countries deploying
dependent on international data algorithmic decision-making are dependent
infrastructure to develop and manage their on big tech companies but have little
algorithms. leverage over them.
The deployment of algorithms at scale is resource As described above, large international firms still
intensive, requiring large amounts of data and a control access to the computing infrastructure
highly sophisticated and expensive computing and data required to develop, manage, and
infrastructure. According to one estimate,52 it can implement algorithmic decision-making in
cost upward of $150,000 to train a contemporary most countries. The impact of this dependence
neural network for an English to German is worse for developing countries, whose
translation engine, and that network would generally low-per capita income and, in many
release emissions equivalent to a trans-America cases, small size mean they have little leverage
flight. The costs alone make the introduction over these large firms.59 Contrast this with the
of algorithmic decision-making a daunting situation when Europe implemented its General
proposition in most developing countries. Data Protection Regulation (GDPR). When that
happened, companies throughout the entire
In Izmir, the implementation team had to make market scrambled, often at great cost, to update
many design decisions to account for the local their digital products, services, and conditions to
network not being able to support the bandwidth meet the requirements of the GDPR because they
required by the city’s algorithm. The city also did not want to lose access to some of the world’s
lacked the financing required to install additional largest economies.
thermal cameras that would increase the
effectiveness of the algorithms. Very little work has been done to rigorously
examine the effects of the power imbalance
Developing countries typically don’t have the between developing countries and large
complex infrastructure of data storage and international digital firms and platforms. As noted
modern computing hardware required to test above, some countries have responded with
and run algorithms, so they are dependent on a pastiche of disjointed approaches, such as
infrastructure provided by large firms based in bans, social media taxes, and data localization
foreign countries. Compounding the problem is requirements, but there’s limited agreement on
the fact that the global data storage infrastructure more positive responses, such as regional data
is unevenly distributed. One study estimated pools and shared computing infrastructure.
that the United States accounts for almost 40%
of all global data storage sites,53 with another
five countries accounting for an additional 30%.
California alone has more data centers than all
of sub-Saharan Africa.54 Amazon, Google, and
Microsoft manage more than 50% of the world’s
data centers, while Chinese firms operate the
world’s largest ones. This leaves developing
countries in a tenuous position, especially
as concerns about data localization and
sovereignty55 mount and regulations around
the transfer of personal data across national
boundaries become restrictive.
13
DIGITAL IMPACT ALLIANCE | 2022
SECTION 5
14
DIGITAL IMPACT ALLIANCE | 2022
Endnotes
1 “Machine learning and phone data can improve targeting of humanitarian aid,” https://www.nature.
com/articles/s41586-022-04484-9.
2 “Real-Time asset tracking; A starting point for Digital Twin implementation in Manufacturing,” shorturl.
at/CEKOX.
5 “’The algorithm has primacy over media … over each of us, and it controls what we do,’” https://hls.
harvard.edu/today/the-algorithm-has-primacy-over-media-over-each-of-us-and-it-controls-what-
we-do/.
11 “‘F**k the algorithm?: What the world can learn from the UK’s A-level grading fiasco,” https://blogs.lse.
ac.uk/impactofsocialsciences/2020/08/26/fk-the-algorithm-what-the-world-can-learn-from-the-
uks-a-level-grading-fiasco/.
12 “Social Welfare, Risk Profiling and Fundamental Rights: The Case of SyRI in the Netherlands,”
https://www.jipitec.eu/issues/jipitec-12-4-2021/5407#:~:text=The%20Court%20held%20that%20
SyRI,basis%20of%20Article%208%20ECHR.
13 Please note that the case study descriptions are based on material provided by the teams working on
the projects. They have not been analyzed critically. A future report from this author will include the
perspective of additional stakeholders and deeper analysis of the accompanying systems.
14 “AI governance in the public sector: Three tales from the frontiers of automated decision-making in
democratic settings,” https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7164913/.
15 Based on inputs provided by the Government of the Izmir Metropolitan Municipality IT Department
and the Fire Emergency Department.
15
DIGITAL IMPACT ALLIANCE | 2022
24 Apart from the questions noted in the main body of the report, it is important to reiterate that
developing countries seeking to introduce or expand the use of algorithms in government face many
of the same data governance challenges that governments in advanced economies do, and that
inevitably accompany digital transformation everywhere. The following is an illustrative list, but is not
the focus of this brief: the data and computing infrastructure required to run sophisticated algorithms
tends to be inadequate; the quality of data—its completeness, biases, frequency, coverage, and
access—are common problems; the legitimacy, accountability, and transparency questions about
algorithms are difficult to resolve; the regulatory and institutional environment to tackle the issues
raised by the proliferation of algorithms have generally not kept pace with some complaining that
regulations stifle innovation, entrepreneurship, and competition and others grousing that institutions
don’t sufficiently protect the interests of the weak, the marginalized, and the vulnerable; skills,
capacity, and participation gaps are recurring themes within the government, civil society, private
sector, and academia; and financing is never adequate or timely.
26 “A trust framework for government use of artificial intelligence and automated decision making,”
https://arxiv.org/pdf/2208.10087.pdf.
27 “What does automated decision-making portend for the fight against discrimination in developing
countries?” https://digi-con.org/what-does-automated-decision-making-portend-for-the-fight-
against-discrimination-in-developing-countries/.
29 “Machine ethics and African identities: Perspectives of artificial intelligence in Africa,” https://www.
researchgate.net/publication/361644515_Machine_ethics_and_African_identities_Perspectives_of_
artificial_intelligence_in_Africa.
32 “Design Characteristics of Studies Reporting the Performance of Artificial Intelligence Algorithms for
Diagnostic Analysis of Medical Images: Results from Recently Published Papers,” https://pc.kjronline.
org/DOIx.php?id=10.3348/kjr.2019.0025.
16
DIGITAL IMPACT ALLIANCE | 2022
33 “Artificial intelligence could revolutionize medical care. But don’t trust it to read your x-ray just yet,”
https://www.science.org/content/article/artificial-intelligence-could-revolutionize-medical-care-
don-t-trust-it-read-your-x-ray.
41 “He is a doctor, she a nurse: How language carries gender bias into algorithms, perpetuates status
quo,” https://www.outlookindia.com/culture-society/he-is-a-doctor-she-a-nurse-how-language-
carries-gender-bias-into-algorithms-perpetuates-status-quo-news-195387.
43 “The Ethics, Transparency and Accountability Framework for Automated Decision-Making,” https://
www.gov.uk/government/publications/ethics-transparency-and-accountability-framework-for-
automated-decision-making.
48 “Excavating AI: The politics of images in machine learning training sets” https://excavating.ai.
53 “Microsoft, Amazon and Google Account for Over Half of Today’s 600 Hyperscale Data Centers,”
https://www.srgresearch.com/articles/microsoft-amazon-and-google-account-for-over-half-of-
todays-600-hyperscale-data-centers.
17
DIGITAL IMPACT ALLIANCE | 2022
54 “Improving data infrastructure helps ensure equitable access for poor people in poor countries,”
https://blogs.worldbank.org/opendata/improving-data-infrastructure-helps-ensure-equitable-
access-poor-people-poor-countries.
56 “National Digital Economy Policy and Strategy (2020-2030): For a Digital Nigeria,” https://www.ncc.
gov.ng/docman-main/industry-statistics/policies-reports/883-national-digital-economy-policy-
and-strategy/file.
57 “National strategy for development of digital economy and digital society to 2025, orientation to 2030,”
http://www.asemconnectvietnam.gov.vn/default.aspx?ZID1=3&ID1=2&ID8=118296.
58 “Developing countries are being left behind in the AI race—and that’s a problem for all of us,” https://
theconversation.com/developing-countries-are-being-left-behind-in-the-ai-race-and-thats-a-
problem-for-all-of-us-180218.
18