A Shift in Artistic Practices Through Ar
A Shift in Artistic Practices Through Ar
Artificial Intelligence
Authors
Abstract
AI technologies for artistic image and video generation [1] have had to contend
with building an entirely new vocabulary for describing images or frames in
video. Whilst image generation with autonomous painting was established decades
ago (such as the Aaron system by Harold Cohen), it is only relatively recently
that AI in visual domain has become accessible to vast communities thanks to open
source, open-access, and browser-based tools that require minimal local
computation resources, such as StableDiffusion, and Generative Adversarial
Networks such as BigGAN and StyleGAN.
In music and sound arts, four main tracks of AI applications appear [2] and
interactive music systems [3]: approaches working with symbolic music; systems
using audio signals directly; systems that map symbolic music domain to musical
audio or vice versa, and approaches that connect musical domains to other domains
such as bodily movement.
In performing arts, movement generation [4] has been one research track of AI.
The advancements in this domain have created a ripple effect in generative dance
and animation where AI systems have been used as a tool for assistance,
choreography support, performance collaboration, and movement generation.
Text generation and conversational bots [5] have initiated recent discussions on
how omnipresent Large Language Models (LLMs) such as ChatGPT can change all
aspects of society. Additionally, LLMs have been incorporated into
interdisciplinary domains, interactive artworks with chatbots, as well as tool
generation for other artistic domains such as audio synthesizer code or creative
coding script generation examples.
“I think it is only fair to people looking at this work that they should
know what the prompt was and also what software was used. I think the
artists should be allowed to opt-in or opt-out of having their work that
they worked so hard on by hand be used as a dataset for creating this other
artwork.”
Gurney’s comment highlights the current status quo where AI technology developers
overlook the explicit consent from artists and content owners, and how current
regulations fall short on empowering artists in decisions on the use of their
content. Furthermore, massive content generation in the style of a particular
artist could devalue it, as in the case of James Gurney, or increase it by social
networking, as in the case of Holly Herndon [10] who is an artist shared their
voice as a Machine Learning model (vocal deepfake) for other artist to perform
with her voice. This shift in capital value in cultural practices calls for a new
take in enacted ethics of artistic practices in the era of ubiquitous AI.
The case of James Gurney is one example of how the creators of these widely used,
easily accessible AI models for content generation have been gathering data for
their training datasets: by overlooking the consent of data creators. This
directly exploits the artist’s labor that is put into creating the original
content [11]. The data that is publicly shared on the internet is approached as
“free to take” and transformed into a capital commodity through the process of
training Machine Learning models, exhibiting similarities to historical practises
of exploitatively ignoring cultural context and consent, within the structures of
colonialism and its enclosures. This current status quo in data gathering can be
addressed through empowering the artist by incorporating their explicit consent
into current structures. The consent can be manifested by the artist, as in the
case of Stability.ai, where artists can choose an option so that their work is
not used in the training of Stable Diffusion models. However, it is not clear to
what extent this pledge can be honored, considering the multitude of remixes and
format shifts of popular artists' works on the internet. Other licenses in that
discussion, such as Creative Commons, emphasize the importance of fair-use for
public good [12]. While both proposals highlight valuable aspects, we envisage
that a future of copyright in artistic practices requires platforms and
structures for artistic data sovereignty.
An artwork is in close relation with its medium. Along with ubiquitous AI, the
evolution of new art markets such as social media, streaming platforms, and NFTs
initiated a cultural and monetary value reassignation by aggregating status as
major channels of artwork “storage” and distribution. The value of an artwork is
typically affected by the ranking or curation mechanisms of the medium. Those
mechanisms are directly related to the artwork’s visibility and thereby the value
of the artwork. These platforms now have substantial influence in reshaping
current social conceptions around economic and cultural capital. For example, the
notion of added value through engagement appears in Holly Herndon’s AI system for
voice synthesis, titled Holly+. Herndon trained the AI architecture on their own
voice recordings, and Holly+ is the AI model where users can synthesize recorded
audio in the style of Herndon’s personal voice. Herndon deploys a Decentralized
Autonomous Organization (DAO) blockchain to vote on the minting of artworks made
using Holly+, and they distribute tokens [13] to members of the DAO (and the
creator of minted artworks) to share in any profits from the usage of Holly+
recorded on the relevant blockchain. Here, Herndon acknowledges the value put
into Holly+ by each user when they engage with the AI system. By giving away DAO
tokens, Herndon creates a platform for users to acquire both a voting stake in
the Holly+DAO and to fiscally benefit from the capital value produced by artistic
work made with Holly+. This semi-decentralized governing structure for Holly+ is
centred on connecting value to engagement processes: the more people who ‘engage’
with art, the more valuable the art itself is perceived to be.
A world with more widely available AI art creation tools and globalized
platforms, giving more people than ever the ability to create and share art,
could lead to a greater chance to find a wider audience for previously unknown
and/or marginalized creators, styles, and art forms. Currently, however, the same
world and those same platforms are more often built to entrench the dominance of
the hegemonic culture and existing artists. There is still more to be done for
improving inclusion [15]. Geographical borders still matter in accessibility of
the internet, and protection of rights in AI applications (Fig 1). Therefore, the
community must take action to address the active participation of
underrepresented cultures beyond the passive appropriation of their imaginaries.
Cultivating diversity requires ethical considerations in all processes of all
stakeholders and actors of AI; while reducing bias, and analyzing stereotypes and
representation of individuals and their creations.
Acknowledgement
The first, second, and third authors share equal contributions as the first
authors. This work was partially supported by the Wallenberg AI, Autonomous
Systems and Software Program – Humanities and Socie-ty (WASP-HS) funded by the
Marianne and Marcus Wallenberg Foundation and the Marcus and Amalia Wallenberg
Foundation.
References
[3] K. Tatar and P. Pasquier, “Musical agents: A typology and state of the art
towards Musical Metacreation,” Journal of New Music Research 48, no. 1, pp.
56–105, (2019), doi: 10.1080/09298215.2018.1511736.
[5] T. Iqbal and S. Qureshi, “The survey: Text generation models in deep
learning,” Journal of King Saud University - Computer and Information Sciences,
34, No. 6, Part A, pp. 2515–2528, (2022), doi: 10.1016/j.jksuci.2020.04.001.
[6] M. Haataja and J. J. Bryson, “Reflections on the EU’s AI act and how we could
make it even better,” TechREG™ Chronicle, March 2022.
[8] Boden, M. A., and Edmonds, E. A., “What is generative art?” Digital
Creativity 20, no. 1, pp. 21–46, (2009), doi:10.1080/14626260902867915.
[9] Joss Fong, “The text-to-image revolution, explained,” Vox, 1 June 2022, <
https://www.youtube.com/watch?v=SVcsDDABEkM >, accessed 16 April 2023.
[12] Stephen Wolfson, “Fair Use: Training Generative AI,” Creative Commons, 17
February 2023,
<https://creativecommons.org/2023/02/17/fair-use-training-generative-ai/>,
accessed 12 April 2023.
[13] Scott Fitsimones, “The DAO Handbook: How Internet Strangers Are Building
Collective Movements,” The DAO Handbook, 19 January 2023,
<https://www.daohandbook.xyz/>, accessed 12 April 2023.
[14] Jobin, A., Ienca, M., & Vayena, E., “The global landscape of AI ethics
guidelines,” Nature Machine Intelligence, Volume 1, Issue 9, Article 9, (2019),
doi: 10.1038/s42256-019-0088-2
[15] Birhane, A., Isaac, W., Prabhakaran, V., Diaz, M., Elish, M. C., Gabriel,
I., & Mohamed, S., “Power to the People? Opportunities and Challenges for
Participatory AI,” in Equity and Access in Algorithms, Mechanisms, and
Optimization, Article No. 6, 1–8, Arlington VA USA: ACM, (2022) doi:
10.1145/3551624.3555290.
[16] Luccioni, A. S., Akiki, C., Mitchell, M., & Jernite, Y., “Stable Bias:
Analyzing Societal Representations in Diffusion Models”, arXiv:2303.11408. (2023)
doi: 10.48550/arXiv.2303.11408
[17] Lorena O’Neil, “These Women Tried to Warn Us About AI”, Rolling Stone. 12
August, 2023.
<https://www.rollingstone.com/culture/culture-features/women-warnings-ai-danger-r
isk-before-chatgpt-1234804367/>.
[18] Elvira Pollina, “ChatGPT can resume in Italy if meets data watchdog’s
demands.”, Reuters, 13 April 2023,
<https://www.reuters.com/technology/italy-lift-curbs-chatgpt-if-openai-meets-dema
nds-by-end-april-data-protection-2023-04-12/>, accessed 16 April 2023.
[19] Yomiuri Y. Shimbun. “Intellectual Property Plan Signals Reversal on AI
Policy.” June 10 2023,
<https://japannews.yomiuri.co.jp/politics/politics-government/20230610-115423/>,
accessed 14 August 2023.
Bios
Paola Torres Núñez del Prado is pursuing doctoral studies at the Stockholm
University of the Arts focusing on researching and developing hybrid interactive
textile sound interfaces that include the use of A.I. systems.
Roser Batlle-Roca is pursuing doctoral studies in Transparent AI & Music
Generation at Universitat Pompeu Fabra in Spain, in collaboration with JRC-EC and
Sony.