TY - JOUR
T1 - Algorithmic Ventriloquism
T2 - The Contested State of Voice in AI Speech Generators
AU - Ramati, Ido
N1 - Publisher Copyright:
© The Author(s) 2024.
PY - 2024/1/1
Y1 - 2024/1/1
N2 - This article explores the vocal human–machine relations embedded in text-to-speech (TTS) generators. Retracing the human sources behind the synthetic speech and tracking the remediation of the voice by the machine-learning algorithm, it argues that artificial intelligence (AI) speaking agents such as Siri and Alexa, as well as other TTS acts such as TikTok’s, are performing algorithmic ventriloquism. Speaking mechanically with the voices of professional voiceover artists, AI speech technologies algorithmically manipulate these voices, thus generating personas that hold an interconnected chain of tensions between the embodied and the virtual, the particular and the general, the human and the non-human, as well as between speech and writing. Algorithmic ventriloquism serves as an analytical framework to tie the techno-vocalic operation of the TTS system with its cultural, economic, philosophical, and sociolinguistic predicaments. The last section discusses the implications of algorithmic ventriloquism beyond the realm of the voice.
AB - This article explores the vocal human–machine relations embedded in text-to-speech (TTS) generators. Retracing the human sources behind the synthetic speech and tracking the remediation of the voice by the machine-learning algorithm, it argues that artificial intelligence (AI) speaking agents such as Siri and Alexa, as well as other TTS acts such as TikTok’s, are performing algorithmic ventriloquism. Speaking mechanically with the voices of professional voiceover artists, AI speech technologies algorithmically manipulate these voices, thus generating personas that hold an interconnected chain of tensions between the embodied and the virtual, the particular and the general, the human and the non-human, as well as between speech and writing. Algorithmic ventriloquism serves as an analytical framework to tie the techno-vocalic operation of the TTS system with its cultural, economic, philosophical, and sociolinguistic predicaments. The last section discusses the implications of algorithmic ventriloquism beyond the realm of the voice.
KW - TikTok
KW - human–machine relation
KW - media ventriloquism
KW - speech generators
KW - text-to-speech
UR - http://www.scopus.com/inward/record.url?scp=85182152955&partnerID=8YFLogxK
U2 - 10.1177/20563051231224401
DO - 10.1177/20563051231224401
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:85182152955
SN - 2056-3051
VL - 10
JO - Social Media and Society
JF - Social Media and Society
IS - 1
ER -