From vocal deepfakes to artificial voice actors and pop star avatars, data-driven machine learning has intensified embodied, musical, and social complexities of voice. While disembodiment and decontextualisation of voice have been musical concerns since the invention of sound recording, AI voice synthesis accelerates these processes and adds new perceptual, cognitive, and social layers. Many ontologies from voice studies imagine voice as resisting fixity, yet in today’s technological climate this resistance may be losing its ontological imperative. Voice is in transformation - possibly crisis - requiring both curiosity and care in paradoxical tension. These changes also unfold within a technological arms race for innovation, profit, and global AI supremacy. Artists are not only early adopters, but experimentalists and bards who participate in the narratives around AI and vocality. This thesis evaluates the changing vocal condition through first-person artistic research with AI voice technologies, exploring their poetics and potentials in three artworks created between 2021–2025. In Search of Good Ancestors / Ahnen in Arbeit was a year-long generative radio broadcast exploring machine learning as a intergenerational vocal memory. iː ɡoʊ weɪ is a hybrid extended voice performance practice using real-time voice transfer to unravel vocal identity on stage. DadaSets investigates the invisibilized vocal labour of AI voice through collaborations with artists, new scoring systems, the absurdist dataset-making performance Bla Blavatar vs Jaap Blonk, and the invention of the voice synthesis instrument Tungnaá. These works are analyzed through an interdisciplinary lens: experimental vocal traditions and the embodied musical-technological ethos of STEIM, alongside philosophies of voice, cognitive neuroscience, and material anthropology; while predictive coding theory frames compositional notions of uncanny, pathological and convivial technologisations of voice. Voice data emerges as paradoxical - both disembodied and relational, material and emergent, gift and commodity - functioning as the basis for musical animacy and collaboration within a rapidly changing socio-technical landscape.

References

Audiovisual reference material, citing audio examples and relevant artistic work of other artists.

< Home

ElevenLabs Anomaly

2024

Generated in 2024, Using ElevenLabs Zero-Shot Voice Cloning Tool as part of research at VocoLab, University College London, see:

Rosi, V., Soopramanien, E. and McGettigan, C. (2025) “Perception and social evaluation of cloned and recorded voices: Effects of familiarity and self-relevance,” Computers in Human Behavior: Artificial Humans, 4, p. 100143. Available at: https://doi.org/10.1016/j.chbah.2025.100143.

Sinewave Speech

2024

Generated in 2024, Using ElevenLabs Zero-Shot Voice Cloning Tool as part of research at VocoLab, University College London, see:

WaveNet Demo

8 September, 2016

Full Demo with Audio Samples at: https://deepmind.google/discover/blog/wavenet-a-generative-model-for-raw-audio/

expected synthesis

unconditional voice synthesis

speaker #1

unconditional voice synthesis

speaker #2

anomaly

unconditional piano synthesis

Chris Darwin, University of Sussex

Sinewave Speech Demonstration: https://users.sussex.ac.uk/~cjd/SWS/index.html

"WaveNets open up a lot of possibilities for TTS, music generation and audio modelling in general. The fact that directly generating timestep per timestep with deep neural networks works at all for 16kHz audio is really surprising, let alone that it outperforms state-of-the-art TTS systems." - Aaron van den Oord, Sander Dieleman, Primary Engineers behind the WaveNet paper.

Oord, A. van den, Dieleman, S. et al. (2016) “WaveNet: A Generative Model for Raw Audio.” arXiv. Available at: https://doi.org/10.48550/arXiv.1609.03499.

Cited Artists

ElevenLabs Anomaly

2024

Generated in 2024, Using ElevenLabs Zero-Shot Voice Cloning Tool as part of research at VocoLab, University College London, see:

Onyx Ashanti

Exo-Voice, Busking in Berlin Park

click to play

Tip

Tip

References