15.ai

15.ai
	Screenshot of the 15.ai web interface on May 23, 2025
Type of site	Artificial intelligence, speech synthesis, generative artificial intelligence
Available in	English
Owner	15
Founder(s)	15
URL	15.ai (redirects to 15.dev)
Commercial	No
Registration	None
Launched	March 2020; 5 years ago
Current status	Active

15.ai is a free non-commercial web application and research project that uses artificial intelligence to generate text-to-speech voices of fictional characters from popular media. Created by a pseudonymous artificial intelligence researcher known as 15, who began developing the technology as a freshman during their undergraduate research at the Massachusetts Institute of Technology, the application allows users to make characters from video games, television shows, and movies speak custom text with emotional inflections. The platform is able to generate convincing voice output using minimal training data; the name "15.ai" references the creator's statement that a voice can be cloned with just 15 seconds of audio. It was an early example of an application of generative artificial intelligence during the initial stages of the AI boom.

Launched in March 2020, 15.ai became an Internet phenomenon in early 2021 when content utilizing it went viral on social media and quickly gained widespread use among Internet fandoms, such as the My Little Pony: Friendship Is Magic, Team Fortress 2, and SpongeBob SquarePants fandoms. The service featured emotional context through emojis, precise pronunciation control, and multi-speaker capabilities. Critics praised 15.ai's accessibility and emotional control but criticized its technical limitations in prosody options and non-English language support, with mixed results depending on character complexity. 15.ai is credited as the first platform to popularize AI voice cloning in memes and content creation.^[a]

Voice actors and industry professionals debated 15.ai's implications, raising concerns about employment impacts, voice-related fraud, and potential misuse. In January 2022, it was discovered that a company called Voiceverse had generated voice lines using 15.ai without attribution, promoted them as the byproduct of their own technology, and sold them as non-fungible tokens (NFT) without permission.^[b] News publications universally characterized this incident as the company having "stolen" from 15.ai.^[c] The service went offline in September 2022 due to legal issues surrounding artificial intelligence and copyright. Its shutdown was followed by the emergence of commercial alternatives whose founders have acknowledged 15.ai's pioneering influence in the field of deep learning speech synthesis. On May 18, 2025, 15 launched 15.dev as the sequel to 15.ai.

History

[...] The website has multiple purposes. It serves as a proof of concept of a platform that allows anyone to create content, even if they can't hire someone to voice their projects.
It also demonstrates the progress of my research in a far more engaging manner – by being able to use the actual model, you can discover things about it that even I wasn't aware of (such as getting characters to make gasping noises or moans by placing commas in between certain phonemes).
It also doesn't let me get away with picking and choosing the best results and showing off only the ones that work [...] Being able to interact with the model with no filter allows the user to judge exactly how good the current work is at face value.

15, Hacker News^[38]

Background

The field of speech synthesis underwent a significant transformation with the introduction of deep learning approaches. In 2016, DeepMind's publication of the WaveNet paper marked a shift toward neural network-based speech synthesis, which enabled higher audio quality via causal convolutional neural networks. Previously, concatenative synthesis—which worked by stitching together pre-recorded segments of human speech—was the predominant method for generating artificial speech, but it often produced robotic-sounding results at the boundaries of sentences.^[39] In 2018, Google AI's Tacotron 2 showed that neural networks could produce highly natural speech synthesis but required substantial training data (typically tens of hours of audio) to achieve acceptable quality. When trained on two hours of training data, the output quality degraded while still being able to maintain intelligible speech; with 24 minutes of training data, Tacotron 2 failed to produce intelligible speech.^[40] The same year saw the emergence of HiFi-GAN, a generative adversarial network (GAN)-based vocoder that improved the efficiency of waveform generation while producing high-fidelity speech,^[41] followed by Glow-TTS, which introduced a flow-based approach that allowed for both fast inference and voice style transfer capabilities.^[42] Chinese tech companies like Baidu and ByteDance also made contributions to the field by developing breakthroughs that further advanced the technology.^[43]

2016–2020: Conception and development

Derpy Hooves reciting the FitnessGram PACER test introduction in a neutral emotion^[44]

Variation #1

Variation #2

Variation #3

Problems playing these files? See media help.

15.ai was conceived in 2016 as a research project in deep learning speech synthesis by a developer known as 15 (at the age of 18^[45]) during their freshman year at the Massachusetts Institute of Technology (MIT) as part of its Undergraduate Research Opportunities Program (UROP).^[46] 15 was inspired by DeepMind's WaveNet paper, with development continuing through their studies as Google AI released Tacotron 2 the following year. By 2019, they had demonstrated at MIT their ability to replicate WaveNet and Tacotron 2's results using 75% less training data than previously required.^[43] The name "15.ai" is a reference to the developer's statement that a voice can be cloned with as little as 15 seconds of data.^[47]

15 had originally planned to pursue a PhD based on their undergraduate research, but opted to work in the tech industry instead after their startup was accepted into the Y Combinator accelerator in 2019. After their departure in early 2020, 15 returned to their voice synthesis research and began implementing it as a web application. According to a post on X from 15, instead of using conventional voice datasets like LJSpeech that contained simple, monotone recordings, they sought out more challenging voice samples that could demonstrate the model's ability to handle complex speech patterns and emotional undertones.^{[tweet 1]} During this phase, 15 discovered the Pony Preservation Project, a collaborative project started by /mlp/, the My Little Pony board on 4chan. Contributors of the project had manually trimmed, denoised, transcribed, and emotion-tagged thousands of voice lines from My Little Pony: Friendship Is Magic and had compiled them into a dataset that provided ideal training material for 15.ai.^[43]

2020–2022: Release and operation

15.ai was released in March 2020^[45] as a free and non-commercial web application that did not require user registration to use, but did require the user to accept its terms of service before proceeding.^[12] At the time of its launch, the platform had a limited selection of available characters, including those from My Little Pony: Friendship Is Magic and Team Fortress 2.^[48] Users were permitted to create any content with the synthesized voices under two conditions: they had to properly credit 15.ai by including "15.ai" in any posts, videos, or projects using the generated audio;^[49] and they were prohibited from mixing 15.ai outputs with other text-to-speech outputs in the same work to prevent misrepresentation of the technology's capabilities.^[50]

More voices were added to the website in the following months. In late 2020, 15 implemented a multi-speaker embedding in the deep neural network, which enabled the simultaneous training of multiple voices.^[43] Following this, the website's roster expanded from eight to over fifty characters.^[45] In addition, this implementation allowed the deep learning model to recognize common emotional patterns across different characters, even when certain emotions were missing from the characters' training data.^[51]

By May 2020, the site had served over 4.2 million audio files to users.^[52] In early 2021, the application gained popularity after skits, memes, and fan content created using 15.ai went viral on Twitter, TikTok, Reddit, Twitch, Facebook, and YouTube.^[53] At its peak, the platform incurred operational costs of US$12,000^[54] per month from AWS infrastructure needed to handle millions of daily voice generations; despite receiving offers from companies to acquire 15.ai and its underlying technology, the website remained independent and was funded out of the personal previous startup earnings of the developer.^[43]

2022: Voiceverse NFT controversy

A satirical meme representing the "right-click, save as" criticism of NFTs. Critics of Voiceverse pointed out the irony of selling ownership rights to AI voices when they themselves had stolen 15.ai's technology.

On January 14, 2022, 15 discovered that a blockchain-based company called Voiceverse had generated voice lines using 15.ai, falsely showcased them on Twitter as a demonstration of their own voice technology without permission or attribution,^[c] and sold them as NFTs.^[b] This came shortly after 15 had stated in December 2021 that they had no interest in incorporating NFTs into their work.^[55] A screenshot of the log files posted by 15 showed that Voiceverse had generated audio of characters from My Little Pony: Friendship Is Magic using 15.ai and pitched them up to make them sound unrecognizable,^[56] a violation of 15.ai's terms of service, which explicitly prohibited commercial use and required proper attribution.^[57]

When confronted with evidence, Voiceverse stated that their marketing team had used 15.ai without proper attribution while rushing to create a demo.^[58] In response, 15 tweeted "Go fuck yourself,"^[59] which went viral, amassing hundreds of thousands of retweets and likes on Twitter in support of the developer.^[43] The tweets showcasing the stolen voices were subsequently deleted.^[14]

Aftermath

The controversy raised concerns about NFT projects, which, according to critics, were frequently associated with intellectual property theft and questionable business practices.^[61] The incident was documented in the AI Incident Database (AIID)^[23] and the AI, Algorithmic, and Automation Incidents and Controversies (AIAAIC) repository,^[24] and was also featured in Molly White's Web3 Is Going Just Great website.^[17] Pavel Khibchenko of Skillbox listed the incident as an example of fraud in NFTs.^[62] Voice actor and YouTuber Yong Yea criticized voice NFTs for their potential impact on the voice acting industry^[25] and stated in a YouTube video that Voiceverse deliberately plagiarized 15.ai's superior technology to falsely market voice NFTs.^{[video 1]}^{: 15:54–16:13} In a 2024 class action lawsuit filed against LOVO, Inc., the parent company of Voiceverse, court documents cited the company's prior theft of 15.ai's technology as part of the case.^[37]

Voice actor Troy Baker, who had announced his partnership with Voiceverse alongside their promotion of the stolen AI voices, faced mounting criticism for supporting an NFT project and for his confrontational announcement tone.^[63] Following continued backlash and the plagiarism revelation, Baker acknowledged that his original tweet was "antagonistic"^[64] and on January 31, announced that he would discontinue his partnership with Voiceverse.^[65]

2022–present: Inactivity and revival

In September 2022, 15.ai was taken offline due to legal issues surrounding artificial intelligence and copyright.^[66] In a post on Twitter, 15 suggested a future version that would better address copyright concerns from the outset.^[43] During this time, voice AI startups continued to cite 15.ai as a major influence to the field.^[67]

On May 18, 2025, 15 launched 15.dev as the official sequel to 15.ai.^[68]^{[tweet 2]} Fandom news site Equestria Daily reported that the website included "almost every voiced pony in the show" with "a dropdown for various emotions you want to generate."^[69]

Features

Three AI-generated voice line variations from 15.ai showing their waveforms and respective alignment confidence scores

15.ai is non-commercial, has no advertisements, generates no revenue, and operates without requiring user registration or accounts.^[70] Users are able to generate speech by inputting text and selecting a character voice, with optional parameters for emotional contextualizers and phonetic transcriptions. Each request produces three audio variations with distinct emotional deliveries.^[71] Characters available included multiple characters from Team Fortress 2 and My Little Pony: Friendship Is Magic, including the Mane Six and Derpy Hooves; GLaDOS, Wheatley, and the Sentry Turret from the Portal series; SpongeBob SquarePants; Kyu Sugardust from HuniePop, Rise Kujikawa from Persona 4; Daria Morgendorffer and Jane Lane from Daria; Carl Brutananadilewski from Aqua Teen Hunger Force; Steven Universe from Steven Universe; Sans from Undertale; Madeline and multiple characters from Celeste; the Tenth Doctor Who; the Narrator from The Stanley Parable; and HAL 9000 from 2001: A Space Odyssey.^[72] Silent characters like Chell and Gordon Freeman were able to be selected and would emit silent audio files when any text was submitted.^[73] Characters from Undertale and Celeste did not produce spoken words but instead generated their games' distinctive beeps when text was entered.^[74]

Sample emoji probability distributions generated by the DeepMoji model. These emoji distributions were displayed on 15.ai as part of its technical metrics and graphs.^[75]

From 2020, 15.ai has generated audio at 44.1 kHz sampling rate—higher than the 16 kHz standard used by most deep learning text-to-speech systems of that period. This higher fidelity creates more detailed audio spectrograms and greater audio resolution with the tradeoff that imperfections in the synthesis are more noticeable.^[76] 15.ai processes speech using customized deep neural networks and specialized audio synthesis algorithms.^[77] While its underlying technology could produce 10 seconds of audio in less than 10 seconds of processing time (i.e. faster-than-real-time), the user experience often involves longer waits as the servers manages thousands of simultaneous requests, sometimes taking more than a minute to deliver results.^[78]

Due to its nondeterministic design, 15.ai produces variations in its speech output. 15.ai introduced the concept of emotional contextualizers, which allowed users to specify the emotional tone of generated speech through guiding phrases.^[79] The emotional contextualizer functionality utilized DeepMoji, a sentiment analysis neural network developed at the MIT Media Lab that processed emoji embeddings from 1.2 billion Twitter posts to analyze their emotional content.^[49] If an input into 15.ai contained additional context (specified by a vertical bar), the additional context following the bar would be used as the emotional contextualizer.^[80] For example, if the input was Today is a great day!|I'm very sad., the selected character would speak the sentence "Today is a great day!" in the emotion one would expect from someone saying the sentence "I'm very sad."^[81]

An example of a conversion of the text "daisy bell" into speech, starting from English orthography. English words are parsed as a string of ARPABET phonemes, then is passed through a pitch predictor and a mel-spectrogram generator to generate audio.

15.ai uses pronunciation data from Oxford Dictionaries API, Wiktionary, and CMU Pronouncing Dictionary, which uses ARPABET phonetic transcriptions. Users can input ARPABET transcriptions by enclosing phoneme strings in curly braces to correct mispronunciations.^[49] 15.ai's interface uses color-coding to indicate pronunciation certainty^[49] and also displays technical metrics, graphs, and comprehensive model analytics, which has included sentiment analysis and automatic improvements to the vocoder.^[45] The platform limits its prompt to 200 characters; users can combine multiple generations for longer speech sequences.^[82]

Later versions of 15.ai introduced multi-speaker capabilities. Rather than training separate models for each voice, 15.ai uses a unified model that learned multiple voices simultaneously through speaker embeddings: numerical representations that capture each character's unique vocal characteristics.^[43] Along with the emotional context conferred by DeepMoji, this allows the deep learning model to learn shared patterns across different characters' emotional expressions and speaking styles, even when characters lack examples of certain emotions in their training data.^[83]

Reception

Critical reception

Critics described 15.ai as easy to use and generally able to convincingly replicate character voices, with occasional mixed results.^[84] Natalie Clayton of PC Gamer wrote that SpongeBob SquarePants' voice was replicated well, but described challenges in mimicking the Narrator from the The Stanley Parable: "the algorithm simply can't capture Kevan Brighting's whimsically droll intonation."^[85] Zack Zwiezen of Kotaku reported that "[his] girlfriend was convinced it was a new voice line from GLaDOS' voice actor".^[86] Taiwanese newspaper United Daily News also highlighted 15.ai's ability to recreate GLaDOS's mechanical voice, alongside its diverse range of character voice options.^[87] Yahoo! News Taiwan reported that "GLaDOS in Portal can pronounce lines nearly perfectly", but also criticized that "there are still many imperfections, such as word limit and tone control, which are still a little weird in some words."^[88] Chris Button of Byteside called the ability to clone a voice with only 15 seconds of data "freaky," but also described the tech behind it as "impressive."^[89] Robin Lamorlette of Clubic described the technology as "devilishly fun" and wrote that Twitter and YouTube were filled with creative content from users experimenting with the tool.^[90] The platform's voice generation capabilities were regularly featured on Equestria Daily with documented updates, fan creations, and additions of new character voices. In a post introducing new character additions to 15.ai, Equestria Daily's founder Shaun Scotellaro wrote that "some of [the voices] aren't great due to the lack of samples to draw from, but many are really impressive still anyway."^[91] Chinese My Little Pony fan site EquestriaCN also documented 15.ai's development and its updates, though they criticized some of its bugs and long queue wait times.^[92]

Peter Paltridge of Anime Superhero News opined that "voice synthesis has evolved to the point where the more expensive efforts are nearly indistinguishable from actual human speech," but also stated that "In some ways, SAM is still more advanced than this. It was possible to affect SAM's inflections by using special characters, as well as change his pitch at will. With 15.ai, you're at the mercy of whatever random inflections you get."^[93] Conversely, Lauren Morton of Rock, Paper, Shotgun praised the depth of pronunciation control—"if you're willing to get into the nitty gritty of it".^[94] Similarly, Eugenio Moto of Qore.com wrote that "the most experienced of users can change parameters like the stress or the tone."^[95] Takayuki Furushima of Den Fami Nico Gamer highlighted the "smooth pronunciations", and Yuki Kurosawa of AUTOMATON wrote that its "rich emotional expression" was a major feature; both Japanese authors mentioned the lack of Japanese-language support.^[96] Renan do Prado of Arkade and José Villalobos of LaPS4 remarked that while users could create amusing results in Portuguese and Spanish respectively, the generation performed best in English.^[97] Chinese gaming news website GamerSky called the app "interesting", but also criticized the word count limit of the text and the occasional lack of intonations.^[98] Machine learning professor Yongqiang Li wrote that 15.ai "perfectly preserves the rhythm and characteristics of the speaker," and remarked that the application was still free despite having 5,000 people generating voices concurrently at the time of writing.^[99] Marco Cocomello of GLITCHED remarked that despite the 200-character limitation, the results "blew [him] away" when testing the app with GLaDOS's voice.^[100] Spanish author Álvaro Ibáñez wrote in Microsiervos that he found the rhythm of the AI-generated voices interesting and that 15.ai was able to adapt its delivery based on the text's meaning.^[101]

Technical publications provided more in-depth analysis of 15.ai's capabilities and limitations compared to other text-to-speech technologies of the time. Google DeepMind senior research scientist Alex Irpan wrote that when 15.ai launched in 2020, it was "arguably the highest quality voice generation model in the world" and superior to models developed by Google AI.^[102] Rionaldi Chandraseta of Towards Data Science wrote that voice models trained on larger datasets created more convincing output with better phrasing and natural pauses, particularly for extended text.^[77] Bai Feng of XinZhiYuan on QQ News highlighted the technical achievement of 15.ai's high-quality output despite using minimal training data and wrote that it was of significantly higher quality than typical deep learning text-to-speech implementations. Feng also acknowledged that while some pronunciation errors occurred due to the limited training data, it was understandable given that contemporary deep learning models typically required 40 or more hours of audio.^[103] Similarly, Parth Mahendra of AI Daily wrote that while the system "does a good job at accurately replicating most basic words," it struggled with more complex terms, noting that characters would "absolutely butcher the pronunciation" of certain words.^[52] Ji Yunyo of NetEase News called the technology behind 15.ai "remarkably efficient" but also criticized its emotional limitations, writing that the emotional expression was relatively "neutral" and that "extreme" emotions couldn't be properly synthesized, making it less suitable for not safe for work applications.^[104] Ji also wrote that while many deepfake videos required creators to extract and edit material from hours of original content for very short results, 15.ai could achieve similar or better effects with only a few dozen minutes of training data per character.^[105]

Reactions from voice actors of featured characters

Ellen McLain (voice of GLaDOS in Portal) and John Patrick Lowrie (voice of the Sniper in Team Fortress 2) were interviewed on The VŌC Podcast in 2021 about their perspectives on 15.ai and AI voice synthesis technology.

Some voice actors whose characters appeared on 15.ai have publicly shared their thoughts about the platform. In an April 2021 interview, John Patrick Lowrie—who voices the Sniper in Team Fortress 2—said that he had discovered 15.ai when a prospective intern showed him a skit she had created using AI voices of the Team Fortress 2 characters.^{[video 2]}^: 0:51:50 Lowrie commented:

"The technology still has a long way to go before you really believe that these are just human beings, but I was impressed by how much [15.ai] could do. [...] You certainly don't get the delivery that you get from an actual person who's analyzed the scene, [...] but I do think that as a fan source—for people wanting to put together mods and stuff like that—that it could be fun for fans to use the voices of characters they like."^{[video 2]}^: 0:53:12

He drew an analogy to synthesized music, adding:

"If you want the sound of a choir, and you want the sound of an orchestra, and you have the money, you hire a choir and an orchestra. And if you don't have the money, you have something that sounds pretty nice; but it's not the same as a choir and an orchestra."^{[video 2]}^: 1:01:10

In a 2021 live broadcast on his Twitch channel, Nathan Vetterlein—the voice actor of the Scout from Team Fortress 2—listened to an AI recreation of his character's voice and commented: "It's interesting; it's all right. There's some stuff in there".^{[video 3]}

Ethical concerns

Other voice actors had mixed reactions to 15.ai's capabilities. While some industry professionals acknowledged the technical innovation, others raised concerns about the technology's implications for their profession.^[106] When voice actor Troy Baker announced his partnership with Voiceverse NFT, which had misappropriated 15.ai's technology, critics raised concerns about automated voice acting's potential reduction of employment opportunities for voice actors, risk of voice impersonation, and potential misuse in explicit content.^[107] Ruby Innes of Kotaku Australia wrote that "this practice could potentially put voice actors out of work considering you could just use their AI voice rather than getting them to voice act for a project and paying them."^[12] In her coverage of the Voiceverse controversy, Edie WK of Checkpoint Gaming raised the concern that "this kind of technology has the potential to push voice actors out of work if it becomes easier and cheaper to use AI voices instead of working with the actor directly."^[29]

While 15.ai limited its scope to fictional characters and did not reproduce voices of real people or celebrities,^[49] computer scientist Andrew Ng commented that similar technology could be used to do so, including for nefarious purposes.^[48] In his 2020 assessment of 15.ai, Ng outlined potential "enormously productive" applications of voice cloning, such as revolutionizing the use of virtual actors in Hollywood, enabling voice actors to participate in more cartoon and audiobook productions, and allowing content creators to use synthetic celebrity voices to narrate their scripts.^[48] However, he also cautioned that synthesizing a human's voice without consent raises ethical concerns and potential legal issues, and further warned that it could be maliciously exploited to impersonate private individuals.^[48]

Legacy

A January 2021 CNN broadcast showing a viral video that used 15.ai to replace Donald Trump's Home Alone 2 cameo with the Heavy Weapons Guy from Team Fortress 2

15.ai was an early pioneer of audio deepfakes, and its popularity led to the emergence of AI speech synthesis-based memes during the initial stages of the AI boom in 2020. 15.ai is credited as the first platform to popularize AI voice cloning in Internet memes and content creation,^[a] particularly through its ability to generate convincing character voices in real-time without requiring extensive technical expertise.^[108] The platform's impact was especially large in fan communities, such as the My Little Pony: Friendship Is Magic, Portal, Team Fortress 2, and SpongeBob SquarePants fandoms, where it enabled the creation of viral content that garnered millions of views on social media.^[109] Team Fortress 2 content creators also used the platform to produce both short-form memes and complex narrative animations using Source Filmmaker. Fan creations included skits and fan animations,^[110] crossover content,^[111] recreations of viral videos,^[112] adaptations of fan fiction,^[45] music videos, and musical compositions.^[45] Some fan creations gained mainstream attention: a viral video that replaced Donald Trump's cameo in Home Alone 2: Lost in New York with the Heavy Weapons Guy's AI-generated voice was featured on a daytime CNN segment in January 2021.^[113] Some users integrated 15.ai with voice command software to create personal assistants.^[114]

The Tax Breaks is a 17-minute fan-made episode of Friendship Is Magic produced using character voices from 15.ai.^[45]

Its influence since its launch has been publicly recognized, with commercial alternatives like ElevenLabs^[d] and Speechify emerging to fill the void after its initial shutdown.^[116] Contemporary generative voice AI companies have acknowledged 15.ai's pioneering role.^[102] Y Combinator startup PlayHT called the debut of 15.ai "a breakthrough in the field of text-to-speech (TTS) and speech synthesis".^[117] Cliff Weitzman, the founder and CEO of Speechify, credited 15.ai for "making AI voice cloning popular for content creation by being the first [...] to feature popular existing characters from fandoms".^[118] Mati Staniszewski, co-founder and CEO of ElevenLabs, wrote that 15.ai was transformative in the field of AI text-to-speech.^[119]

15.ai established technical precedents that influenced subsequent developments in AI voice synthesis. Its integration of DeepMoji for emotional analysis demonstrated the viability of incorporating sentiment-aware speech generation,^[120] while its support for ARPABET phonetic transcriptions set a standard for precise pronunciation control in public-facing voice synthesis tools.^[43] The platform's multi-speaker model, which enabled simultaneous training of diverse character voices, allowed the system to recognize emotional patterns across different voices even when certain emotions were absent from individual character training sets.^[83] 15.ai also contributed to the reduction of training data requirements for speech synthesis. Contemporary models like Tacotron 2 required tens of hours of audio to produce acceptable results and failed to generate intelligible speech with less than 24 minutes of training data.^[40] In contrast, 15.ai demonstrated the ability to generate speech with substantially less training data; the name "15.ai" refers to the creator's statement that a voice can be cloned with just 15 seconds of data.^[43] The 15-second benchmark became a reference point for subsequent voice synthesis systems; the original statement that only 15 seconds of data is required to clone a human's voice was corroborated by OpenAI in 2024.^[121]

Explanatory footnotes

^ ^a ^b Attributed to multiple references: Rock Paper Shotgun,^[1] Clubic,^[2] GLITCHED,^[3] United Daily News,^[4] Analytics India Magazine,^[5] Inverse,^[6] Speechify,^[7] The Guardian,^[8] Independent,^[9] and Alex Irpan.^[10]
^ ^a ^b Attributed to multiple references: AI Incident Database,^[23] AI, Algorithmic, and Automation Incidents and Controversies,^[24] Gamereactor,^[16] The Journal,^[25] Eurogamer,^[13] and GameGuru.^[34]
^ ^a ^b Attributed to multiple references: The Mary Sue,^[11] Kotaku Australia,^[12] Eurogamer,^[13] NME,^[14] Muropaketti,^[15] Gamereactor,^[16] Web3 Is Going Just Great,^[17] StopGame,^[18] iXBT Games,^[19] DTF,^[20] Sport.es,^[21] FZ,^[22] AI Incident Database,^[23] AI, Algorithmic, and Automation Incidents and Controversies,^[24] The Journal,^[25] LevelUp,^[26] Stevivor,^[27] PlayStation Universe,^[28] Checkpoint Gaming,^[29] Tech Times,^[30] Mobidictum,^[31] OtakuPT,^[32] Gamebrott,^[33] GameGuru,^[34] Shazoo,^[35] Geek Culture,^[36] and Lehrman v. LOVO.^[37]
^ which uses "11.ai" as a legal byname for its web domain^[115]

References

Notes

^ Morton 2021: "Machine learning is absolutely fascinating and yet I mostly just enjoy when people use impressive tech to create weird skits and memes. That's exactly what everyone appears to be doing with an extremely impressive machine-learning tool that lets you type in text for various characters to say out loud. [...] It's made possible by the text to speech algorithm 15.ai that studies clips of characters and uses deep-learning to make those characters say whatever the heck you want."
^ Lamorlette 2021: "Vous avez toujours rêvé de faire dire n'importe quoi à vos personnages de jeux vidéo préférés (ou détestés) ? Le site 15.ai l'a fait !" (transl. "Have you always dreamed of making your favorite (or hated) video game characters say anything? 15.ai has finally made it happen!")
^ Cocomello 2021: "However, back then if you wanted to create your own dialogue, it required layers of sound enhancements and tweaks. Thankfully, the world has evolved and now thanks to the 15.ai app, we can make [...] popular characters say whatever we want"
^ MrSun 2021: "大家是否都曾經想像過，假如能讓自己喜歡的遊戲或是動畫角色說出自己想聽的話，不論是名字、惡搞或是經典名言，都是不少人的夢想吧。不過來到 2021 年，現在這種夢想不再是想想而已，因為有一個網站通過 AI 生成的技術"，(transl. "Have you ever imagined what it would be like if your favorite game or anime characters could say exactly what you want to hear? Whether it's names, parodies, or classic quotes, this is a dream for many. However, as we enter 2021, this dream is no longer just a fantasy, because there is a website that uses AI-generated technology,")
^ Anirudh VK 2023: "While AI voice memes have been around in some form since '15.ai' launched in 2020, [...]"
^ Wright 2023: "AI voice tools used to create "audio deepfakes" have existed for years in one form or another, with 15.ai being a notable example."
^ Weitzman 2023: "It gained popularity because it was the first AI voice platform that featured an assortment of fictional characters from a variety of media sources"
^ Temitope 2024: "During this period, 15.ai earned credit for single-handedly popularizing AI voice cloning—often described as 'audio deepfakes'—in memes, viral content, and fan-driven media."
^ Abisola 2025: "Many credit 15.ai as the first mainstream text-to-speech platform that truly made 'audio deepfakes' go viral,"
^ Irpan 2025: "multiple startups credit 15.ai for creating the market they now compete in."
^ Lawrence 2022.
^ ^a ^b ^c Innes 2022.
^ ^a ^b Phillips 2022.
^ ^a ^b Williams 2022.
^ Muropaketti 2022.
^ ^a ^b Groth-Anderson 2022.
^ ^a ^b White 2022.
^ Skorich 2022.
^ Piletsky 2022.
^ Granger 2022.
^ Baylos 2022.
^ Myrén 2022.
^ ^a ^b ^c Sonali 2022.
^ ^a ^b ^c AIAAIC 2022.
^ ^a ^b ^c Parker 2022.
^ Rosas 2022.
^ Wright 2022.
^ Carcasole 2022.
^ ^a ^b W-K 2022.
^ Henry 2022.
^ Aktaş 2022.
^ Archer 2022.
^ Ifram 2022.
^ ^a ^b Kuchkanov 2022.
^ Коэн 2022.
^ Toh 2022.
^ ^a ^b Paul Lehrman and Linnea Sage, et al. v. LOVO, Inc., No. 1:24-cv-03770, 38 (S.D.N.Y. 2024) ("Separately, VoiceVerse has already been found to have stolen technology from another company. See Ule Lopez, WCCF Tech, "Voiceverse NFT Service Reportedly Uses Stolen Technology from 15ai," (Jan. 16, 2022), https://wccftech.com/voiceverse-nft-service-usesstolen-technology-from-15ai/."), archived from the original on 2025-10-04.
^ Hacker News 2022
^ Barakat, Turk & Demiroglu 2024.
^ ^a ^b Google 2018
^ Kong, Kim & Bae 2020.
^ Kim et al. 2020.
^ ^a ^b ^c ^d ^e ^f ^g ^h ⁱ ^j Temitope 2024.
^ "Examples". May 15, 2025. Retrieved May 16, 2025.
^ ^a ^b ^c ^d ^e ^f ^g Abisola 2025.
^ Chandraseta 2021; Li 2021; Temitope 2024.
^ Phillips 2022; Temitope 2024; Abisola 2025.
^ ^a ^b ^c ^d Ng 2020.
^ ^a ^b ^c ^d ^e Kurosawa 2021.
^ "About". 15.ai (Official website). March 2, 2020. Archived from the original on March 3, 2020. Retrieved December 23, 2024.
^ Kurosawa 2021; Temitope 2024; Abisola 2025.
^ ^a ^b Mahendra 2020.
^ Ruppert 2021; Abisola 2025.
^ Temitope 2024; Abisola 2025.
^ Lopez 2022; Innes 2022; Piletsky 2022.
^ Phillips 2022; Innes 2022; Toh 2022; W-K 2022; Muropaketti 2022; Ifram 2022; Williams 2022; AIAAIC 2022.
^ Innes 2022; Коэн 2022; Myrén 2022; AIAAIC 2022.
^ Phillips 2022; Myrén 2022; AIAAIC 2022.
^ Wright 2022; Groth-Anderson 2022; Myrén 2022; Archer 2022; Williams 2022.
^ Innes 2022; Enriquez 2022.
^ Коэн 2022; Baylos 2022.
^ Khibchenko 2022.
^ Lawrence 2022; Innes 2022; White 2022; W-K 2022.
^ Wright 2022; White 2022; Carcasole 2022; Enriquez 2022.
^ Parker 2022; Granger 2022.
^ Staniszewski 2024; Temitope 2024.
^ Temitope 2024; Irpan 2025.
^ "FAQ". 15.dev. May 18, 2025. Archived from the original on October 1, 2025. Retrieved May 18, 2025.
^ Scotellaro 2025.
^ Williams 2022; Wright 2022; Innes 2022.
^ Ibáñez 2022.
^ Zwiezen 2021; Clayton 2021; Morton 2021; Ruppert 2021; Villalobos 2021; Furushima 2021; Kurosawa 2021; Cocomello 2021; Lamorlette 2021; Phillips 2022.
^ Morton 2021; 遊戲 2021.
^ 遊戲 2021.
^ Ibáñez 2022; Abisola 2025.
^ Feng 2020.
^ ^a ^b Chandraseta 2021.
^ Chandraseta 2021; Lamorlette 2021.
^ Chandraseta 2021; Temitope 2024.
^ Chandraseta 2021: "By adding a '|' after the original sentence and providing an extra sentence, we could control what emotion the original sentence will be spoken with. In other words, 'text_1|text_2' will produce a voice line of text_1 with the emotion of text_2."
^ Chandraseta 2021: "because it could force the bot into generating previously unknown data, such as saying 'Today is a great day' with a sad or angry emotion"
^ Cocomello 2021; Ruppert 2021.
^ ^a ^b Kurosawa 2021; Temitope 2024.
^ Clayton 2021; Ruppert 2021; Villalobos 2021; Cocomello 2021.
^ Clayton 2021.
^ Zwiezen 2021.
^ 遊戲 2021: "目前「15.ai」的網頁上，提供了不少的音源，[...]除了《傳送門》之外，15.ai 網站目前也支援了許多來自遊戲、電影或動畫中的人物語音，" (transl. "Currently, the "15.ai" website provides a lot of audio sources. [...] In addition to "Portal", the 15.ai website currently also supports voices for many characters from games, movies or animations.
^ MrSun 2021: "的 GLaDOS 也能完美的唸出任何台詞。當然網站也補充目前還有很多不完美的地方，像是字數限制、語氣控制在某些話上還是略有怪異，但只要肯花時間，也能像是其他網友一樣，通過剪輯來完成有趣的創作，" (transl. "Even GLaDOS in "Portal" can perfectly recite any lines. Of course, the website also added that there are still many imperfections, such as word limit and tone control, which are still a bit weird in some words, but as long as you are willing to spend time, you can also complete interesting creations through editing like other netizens.")
^ Button 2021.
^ Lamorlette 2021: "On peut donc retrouver sur ces réseaux de nombreux exemples de ce que peut donner le mélange entre un esprit créatif et une technologie aussi efficace que diablement amusante." (transl. "These social networks are therefore full of examples of what can be achieved by combining a creative mind with technology that is as effective as it is devilishly fun.")
^ Scotellaro 2020.
^ www.equestriacn.com 2021.
^ Paltridge 2021.
^ Morton 2021.
^ Moto 2021: "Incluso, los más clavados pueden cambiar algunos parámetros como la intencionalidad o el tono." (transl. "Actually, the most experienced of users can change some parameters like the stress or tone.")
^
- Furushima 2021: 日本語入力には対応していないが、ローマ字入力でもなんとなくそれっぽい発音になる。; 15.aiはテキスト読み上げサービスだが、特筆すべきはそのなめらかな発音と、ゲームに登場するキャラクター音声を再現している点だ。 (transl. It does not support Japanese input, but even if you input using romaji, it will somehow give you a similar pronunciation.; 15.ai is a text-to-speech service, but what makes it particularly noteworthy is its smooth pronunciation and the fact that it reproduces the voices of characters that appear in games.)
- Kurosawa 2021: "もうひとつ15.aiの大きな特徴として挙げられるのが、豊かな感情表現だ" (transl. "Another major feature of 15.ai is its rich emotional expression.")
- Kurosawa 2021: "英語版ボイスのみなので注意" (transl. "Please note that this is an English voice only version.")
^
- do Prado 2021: "Obviamente o programa funciona no idioma inglês, mas dá pra gerar umas frases bem emboladas e engraças em português, estilo aqueles memes usando vozes em outros idiomas falando em português." (transl. "Obviously, the program works in English, but you can generate some really confusing and funny sentences in Portuguese, like those memes using voices in other languages speaking Portuguese.")
- Villalobos 2021: "En este sentido, en las últimas horas se ha hecho popular un sitio web que emula la voz de GlaDOS para que diga todas las palabras que quieras, siempre y cuando estén en inglés, aunque puedes escribir algo en español e intentará pronunciarlo, pero no lo hará correctamente." (transl. "In this sense, in recent hours a website has become popular that emulates the voice of GlaDOS so that it says all the words you want, as long as they are in English, although you can write something in Spanish and it will try to pronounce it, but it will not do it correctly.")
^
- GamerSky 2021: "虽然AI的声音缺少了些抑扬顿挫，不过效果也还算有趣。" (transl. "Although the AI's voice lacks some intonation, the effect is still interesting.")
- GamerSky 2021: "目前15.ai提供的角色选项较少，由于文本的字数限制，生成的语音也相对较短" (transl. "Currently, 15.ai provides relatively few character options, and due to the word limit of the text, the generated voice is relatively short.")
^
- Li 2021: "完美保留了发音人的韵律和特色，" (transl.: "perfectly preserves the rhythm and characteristics of the speaker,")
- Li 2021: "该网站的访问量为在线任务差不多5000以上，而且目前完全免费，" (transl.: "The number of requests to the website is more than 5,000 tasks, and it is still currently completely free.")
^ Cocomello 2021.
^ Ibáñez 2022: "Personalmente encontré interesantes las pausas y el ritmo y que ciertamente se nota que según el contenido del texto se «interpreta» el resultado según lo que se intenta transmitir." (transl. "Personally, I found the pauses and rhythm interesting, and that it is certainly noticeable that depending on the content of the text, the result is 'interpreted' according to what is being trying to convey.")
^ ^a ^b Irpan 2025.
^
- Feng 2020: "该工具生成的音频文件的采样率为 44100 Hz，而大多数基于深度学习的文本转语音实现，所使用的采样率为16,000 Hz。所以用它产生的音频，声谱会更详细（更高质量的音频），同时缺陷也更明显。" (transl. "The audio files generated by this tool have a sampling rate of 44100 Hz, while most deep learning-based text-to-speech implementations use a sampling rate of 16,000 Hz. Therefore, the audio generated by it will have a more detailed sound spectrum (higher quality audio), but the defects will be more obvious.")
- Feng 2020: "当然在这么小的语料上训练的模型也是有缺陷的，有些单词可能发音不准确，其实这也很好理解，即使是人，在遇到生词的时候也不一定能准确发音，而传统的深度模型通常有 40 个小时或者更多的语料，所以错误率会低一些。" (transl. "Of course, the model trained on such a small corpus is also flawed, and some words may not be pronounced correctly . In fact, this is easy to understand. Even humans may not be able to pronounce new words accurately when they encounter them. Traditional deep models usually have 40 hours or more of corpus, so the error rate will be lower.")
^ Ji 2021: "但是由于情绪表现只能联系上下文进行自动识别，导致这些语音在情感表达上比较"中庸"，一些"极端"的情绪无法通过语音合成正常表达，[...]距离其被正式用于某些NSFW的同人作品，还有很长的路要走。" (transl. "the emotional expression can only be automatically recognized in the context, which makes these voices relatively "neutral" in emotional expression. Some "extreme" emotions cannot be expressed normally through voice synthesis. [...] it still has a long way to go before it can be officially used in some NSFW fan works.")
^ Ji 2021: "网友在油管上看到的许多"深度伪造"视频，都依赖视频创作者从原本数小时的数据资料里进行提取编辑，最终才能制作非常简短的内容，并且呈现效果还很一般。而15.ai的开发者表示，自己的这项技术可以轻松实现那些视频效果（事实上15.ai的许多角色进行深度学习的数据时长只有几十分钟）。" (transl. "Many of the "deep fake" videos that netizens see on YouTube rely on video creators to extract and edit hours of data to produce very short content, and the presentation effect is still very average. The developers of 15.ai said that their technology can easily achieve those video effects (in fact, the data for deep learning of many characters of 15.ai is only tens of minutes long).")
^ Parker 2022; Temitope 2024.
^ Innes 2022; W-K 2022; Ng 2020; Parker 2022; Skorich 2022; Piletsky 2022; Enriquez 2022.
^ Ruppert 2021; Morton 2021.
^ 遊戲 2021; Kurosawa 2021; Morton 2021; Temitope 2024.
^ Zwiezen 2021; Ruppert 2021; Kurosawa 2021; Abisola 2025.
^ Ruppert 2021.
^ Zwiezen 2021; Morton 2021.
^ Clayton 2021; CNN 2021.
^ Furushima 2021.
^ ElevenLabs 2024b.
^ Staniszewski 2024; Play.ht 2024; Weitzman 2023.
^ Play.ht 2024.
^ Weitzman 2023.
^ Staniszewski 2024.
^ Osman 2022.
^ OpenAI 2024; Temitope 2024.

Tweets

^ @fifteenai (December 7, 2024). "The past and future of 15.ai" (Tweet). Archived from the original on December 8, 2024. Retrieved December 19, 2024 – via Twitter.
^ @fifteenai (May 18, 2025). "We are so back. https://15.dev Only MLP characters for now. More characters, features, and improvements will be added soon. Check Twitter and/or the Discord server (linked on the website) for updates! (Expect possible downtime as I calibrate server capacity and GPU allocations depending on how busy the website gets.)" (Tweet). Archived from the original on October 4, 2025. Retrieved May 18, 2025 – via Twitter.

Videos

^ Yea, Yong (January 14, 2022). "Troy Baker Faces Mass Backlash For Supporting Shady AI Voice NFTs With Company That Has Stolen Work". YouTube. Event occurs at 15:54–16:13. Archived from the original on December 20, 2024. Retrieved March 23, 2025. This isn't just one of those things [Voiceverse] can go 'Whoopsies!' on. [They] plagiarized somebody else's work and used that as a means to falsely market the quality of [their] own products, by using somebody else's higher quality voice AI to promote [Voiceverse] for [their] own benefit.
^ ^a ^b ^c The VŌC Podcast // John Patrick Lowrie & Ellen McLain Interview (The voices of GLaDOS and Sniper) (Podcast). The VŌC Podcast. April 11, 2021. Event occurs at 0:51:50–1:01:25. Archived from the original on August 20, 2025. Retrieved January 15, 2025.
^ Vetterlein, Nathan (January 10, 2021). "Nate listens to his AI self". Twitch. Retrieved January 21, 2025.

Works cited

Abisola, Shojobi (January 3, 2025). "The MIT Project That Paved Way For Modern Voice AI". Independent. Archived from the original on February 27, 2025. Retrieved February 27, 2025.
Aktaş, Utku (January 19, 2022). "Troy Baker-backed NFT firm admitted using voice lines from another service without permission". Mobidictum. Archived from the original on June 14, 2024. Retrieved March 1, 2025.
"Voiceverse NFT caught plagiarising voice lines from AI service 15.ai". AI, Algorithmic, and Automation Incidents and Controversies. January 2022. Archived from the original on October 4, 2025. Retrieved October 3, 2025.
Archer, Helder (January 24, 2022). "Grupo NFT do ator de voz de The Last of Us apanhado a roubar vozes de outro serviço" [The Last of Us Voice Actor NFT Group Caught Stealing Voices from Another Service]. OtakuPT (in Portuguese). Archived from the original on January 24, 2022. Retrieved September 13, 2025.
Barakat, Huda; Turk, Oytun; Demiroglu, Cenk (2024). "Deep learning-based expressive speech synthesis: a systematic review of approaches, challenges, and resources". EURASIP Journal on Audio, Speech, and Music Processing. 2024 (11) 11. doi:10.1186/s13636-024-00329-7. S2CID 267652761.
Baylos, Ramón (January 17, 2022). "La compañía de NFTs que se alió con el actor de voz de Joel de The Last of Us la ha liado bastante parda" [NFT Company Partnered with Voice Actor of Joel From The Last of Us Has Really Messed Up]. Sport.es (in Spanish). Archived from the original on January 23, 2025. Retrieved March 25, 2025.
Button, Chris (January 19, 2021). "Make GLaDOS, SpongeBob and other friends say what you want with this AI text-to-speech tool". Byteside. Archived from the original on June 25, 2024. Retrieved December 18, 2024.
Cabibi-Wilkin, Lily (January 26, 2022). "NFTs Are Bad. So Why Do People Keep Making Them?" (PDF). The Herald. Jonesboro, Arkansas: Arkansas State University. p. 2A. Archived (PDF) from the original on July 6, 2024. Retrieved March 4, 2025.
Carcasole, David (January 17, 2022). "Troy Baker's NFT Partner Company Caught Claiming Voice Lines From Another Service As Their Own". PlayStation Universe. Archived from the original on October 6, 2022. Retrieved February 28, 2025.
Chandraseta, Rionaldi (January 21, 2021). "Generate Your Favourite Characters' Voice Lines using Machine Learning". Towards Data Science. Archived from the original on January 21, 2021. Retrieved December 18, 2024.
Clayton, Natalie (January 19, 2021). "Make the cast of TF2 recite old memes with this AI text-to-speech tool". PC Gamer. Archived from the original on January 19, 2021. Retrieved December 18, 2024.
Cocomello, Marco (January 20, 2021). "Make Portal's GLaDOS and Other Characters Say Whatever You Want with This New App". GLITCHED. Archived from the original on March 12, 2025. Retrieved March 10, 2025.
"CNN Newsroom". CNN. January 15, 2021. Archived from the original on August 21, 2025.
do Prado, Renan (January 19, 2021). "Faça GLaDOS, Bob Esponja e outros personagens falarem textos escritos por você!" [Make GLaDOS, SpongeBob and Other Characters Speak Texts Written by You!]. Arkade (in Brazilian Portuguese). Archived from the original on August 19, 2022. Retrieved December 22, 2024.
"Can I publish the content I generate on the platform?". ElevenLabs (Official website). 2024b. Archived from the original on December 23, 2024. Retrieved December 23, 2024.
Enriquez, XC (January 16, 2022). "Voice Actor for Joel Receives Backlash after NFT Tweet". ClutchPoints. Archived from the original on March 23, 2025. Retrieved March 23, 2025.
"15.ai已经重新上线，版本更新至v23" [15.ai Back Online, Updated to Version 23]. EquestriaCN (in Chinese). October 1, 2021. Archived from the original on May 19, 2024. Retrieved December 22, 2024.
Feng, Bai (March 15, 2020). "模型参数过亿跑不动？看MIT小哥，少量数据完成高质量文本转语音！" [Struggling with Models with Over 100 Million Parameters? See How an MIT Student Achieved High‑Quality Text‑to‑Speech with Minimal Data!]. QQ News (in Chinese). XinZhiYuan. Archived from the original on February 27, 2025. Retrieved February 22, 2025.
Furushima, Takayuki (January 18, 2021). "『Portal』のGLaDOSや『UNDERTALE』のサンズがテキストを読み上げてくれる。文章に込められた感情まで再現することを目指すサービス「15.ai」が話題に" [Portal's GLaDOS and Undertale's Sans Will Read Text for You — AI Service 15.ai Aims to Reproduce Even Emotions in Text, Becomes a Hot Topic]. Den Fami Nico Gamer (in Japanese). Archived from the original on January 18, 2021. Retrieved December 18, 2024.
"这个网站可用AI生成语音让ACG角色"说"出你输入的文本" [This Website Can Use AI to Generate Voice, Making ACG Characters "Say" the Text You Input]. GamerSky (in Chinese). January 18, 2021. Archived from the original on December 11, 2024. Retrieved December 18, 2024.
"Audio samples from "Semi-Supervised Training for Improving Data Efficiency in End-to-End Speech Synthesis"". August 30, 2018. Archived from the original on November 11, 2020. Retrieved June 5, 2022.
Granger (January 31, 2022). "Трой Бейкер отказался от партнёрства с NFT-платформой Voiceverse и извинился за резкое высказывание в поддержку токенов" [Troy Baker Ends Partnership with NFT Platform Voiceverse and Apologises for Pointed Comments in Support of the NFTs]. DTF (in Russian). Archived from the original on March 23, 2025. Retrieved September 23, 2025.
Groth-Anderson, Magnus (January 19, 2022). "Troy Baker-støttet NFT-virksomhed indrømmer at have stjålet indhold" [Troy Baker‑Backed NFT Company Admits to Stealing Content]. Gamereactor (in Danish). Archived from the original on March 1, 2025. Retrieved March 1, 2025.
"15.ai". Hacker News. June 12, 2022. Archived from the original on April 9, 2023. Retrieved December 29, 2024.
Henry, Joseph (January 18, 2022). "Troy Baker's Partner NFT Company Voiceverse Reportedly Steals Voice Lines From 15.ai". Tech Times. Archived from the original on January 18, 2022. Retrieved October 1, 2025.
Ibáñez, Álvaro (June 15, 2022). "Un algoritmo que convierte texto a voz "con emoción y sentimiento" e imita a personajes y voces conocidas" [Algorithm Converts Text to Speech 'with Emotion and Feeling' and Imitates Well‑Known Characters and Voices]. Microsiervos (in Spanish). Archived from the original on December 12, 2024. Retrieved March 23, 2025.
Ifram, Lauda (January 18, 2022). "Proyek NFT Troy Baker Ketahuan Mencuri Aset Suara AI Tanpa Seizin Pemiliknya" [Troy Baker's NFT Project Caught Stealing AI Voice Assets Without Their Owners' Permission]. Gamebrott (in Indonesian). Archived from the original on May 26, 2022. Retrieved September 12, 2025.
Innes, Ruby (January 18, 2022). "Voiceverse Is The Latest NFT Company Caught Using Someone Else's Content". Kotaku Australia. Archived from the original on July 26, 2024. Retrieved February 28, 2025.
Irpan, Alex (July 21, 2025). "Brony Musicians Seize The Means of Production: My Eyewitness Account to BABSCon 2025". Alex Irpan. Archived from the original on July 30, 2025. Retrieved October 1, 2025.
Ji, Yunyo (January 19, 2021). "这个国外的语音合成网站，可以让玩家操控二次元角色说话" [Overseas Voice Synthesis Site Lets Gamers Make ACG Characters Talk]. 163.com (in Chinese). NetEase News. Archived from the original on February 27, 2025. Retrieved February 26, 2025.
Khibchenko, Pavel (February 15, 2022). "NFT в геймдеве: проблемы регулирования, гнев игроков и поспешные решения разработчиков". Skillbox (in Russian). Archived from the original on June 5, 2023. Retrieved February 28, 2025.
Kim, Jaehyeon; Kim, Sungwon; Kong, Jungil; Yoon, Sungroh (2020). "Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment Search". In Larochelle, Hugo; Ranzato, Marc'Aurelio; Hadsell, Raia; Balcan, Maria-Florina; Lin, Hsuan-Tien (eds.). Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6–12, 2020, virtual. arXiv:2005.11129. S2CID 218862956.
Kong, Jungil; Kim, Jaehyeon; Bae, Jaekyoung (2020). "HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis". In Larochelle, Hugo; Ranzato, Marc'Aurelio; Hadsell, Raia; Balcan, Maria-Florina; Lin, Hsuan-Tien (eds.). Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6–12, 2020, virtual. arXiv:2010.05646. S2CID 222291664.
Kuchkanov, Phil (January 15, 2022). ""NFT-штука Троя Бейкера ворует чужие работы". В сети уничтожают известного актера озвучки" ['Troy Baker's NFT Thing Steals Other People's Work.' The Internet Is Destroying the Famous Voice Actor]. GameGuru.ru. Archived from the original on January 15, 2022. Retrieved March 23, 2025.
Kurosawa, Yuki (January 19, 2021). "ゲームキャラ音声読み上げソフト「15.ai」公開中。『Undertale』や『Portal』のキャラに好きなセリフを言ってもらえる" [Game Character Text-to-Speech Tool 15.ai Now Available. Get Characters from Undertale and Portal Say Your Desired Lines]. AUTOMATON (in Japanese). Archived from the original on January 19, 2021. Retrieved December 18, 2024.
Lamorlette, Robin (January 25, 2021). "Insolite : un site permet de faire dire ce que vous souhaitez à GlaDOS (et à d'autres personnages de jeux vidéo)" [Unusual: A Site Lets You Make GLaDOS (and Other Video Game Characters) Say Whatever You Want]. Clubic (in French). Archived from the original on January 19, 2025. Retrieved March 23, 2025.
Lawrence, Briana (January 19, 2022). "Shonen Jump Scare Leads to Company Reassuring Fans That They Aren't Getting Into NFTs". The Mary Sue. Archived from the original on January 13, 2025. Retrieved December 23, 2024.
Li, Yongqiang (January 22, 2021). "语音开源项目优选：免费配音网站15.ai" [Top Open‑Source Voice Project: Free Text‑to‑Speech Site 15.ai]. Zhihu (in Chinese). Archived from the original on December 19, 2024. Retrieved December 18, 2024.
Lopez, Ule (January 16, 2022). "Voiceverse NFT Service Reportedly Uses Stolen Technology from 15ai [UPDATE]". Wccftech. Archived from the original on January 16, 2022. Retrieved June 7, 2022.
Mahendra, Parth (May 11, 2020). "Spongebob Can Now Narrate Your Writing". AI Daily. Archived from the original on July 1, 2021. Retrieved March 4, 2025.
Morton, Lauren (January 18, 2021). "Put words in game characters' mouths with this fascinating text to speech tool". Rock, Paper, Shotgun. Archived from the original on January 18, 2021. Retrieved December 18, 2024.
Moto, Eugenio (January 20, 2021). "15.ai, el sitio que te permite usar voces de personajes populares para que digan lo que quieras". Qore (in Spanish). Archived from the original on December 28, 2024. Retrieved December 21, 2024.
MrSun (January 19, 2021). "讓你喜愛的ACG角色說出任何話！ AI生成技術幫助你實現夢想" [Make Your Favorite ACG Characters Say Anything! AI Generation Technology Helps You Realize Your Dreams]. Yahoo! News Taiwan (in Chinese). Archived from the original on December 28, 2024. Retrieved December 22, 2024.
"Troy Bakerin tukema NFT-yhtiö kärähti – Kaupitteli ääninäyttelyä luvatta" [Troy Baker‑Backed NFT Company Exposed – Sold Voice Acting Without Permission]. Muropaketti (in Finnish). January 17, 2022. Archived from the original on May 25, 2022. Retrieved March 1, 2025.
Myrén, Jonny (January 18, 2022). "NFT-företaget som Troy Baker marknadsför tog ljudklipp från annan tjänst" [NFT Company Promoted by Troy Baker Took Audio Clips from Another Service]. FZ (in Swedish). Archived from the original on January 1, 2024. Retrieved March 24, 2025.
Ng, Andrew (April 1, 2020). "Voice Cloning for the Masses". DeepLearning.AI. Archived from the original on December 28, 2024. Retrieved December 22, 2024.
"Navigating the Challenges and Opportunities of Synthetic Voices". OpenAI. March 9, 2024. Archived from the original on November 25, 2024. Retrieved December 18, 2024.
Osman, Mohamed (2022). "Emo-TTS:Parallel Transformer-based Text-to-Speech Model with Emotional Awareness". Emo-TTS: Parallel Transformer-based Text-to-Speech Model with Emotional Awareness. IEEE. pp. 169–174. doi:10.1109/ICCI54321.2022.9756092. ISBN 978-1-6654-9973-6. S2CID 248256259.
Paltridge, Peter (January 18, 2021). "This Website Will Say Whatever You Type In Spongebob's Voice". Anime Superhero News. Archived from the original on October 17, 2021. Retrieved December 22, 2024.
Parker, Jordan (February 5, 2022). "Like them or not, NFTs are here to stay". The Journal. Webster University. Archived from the original on November 8, 2024. Retrieved March 23, 2025.
Phillips, Tom (January 17, 2022). "Troy Baker-backed NFT firm admits using voice lines taken from another service without permission". Eurogamer. Archived from the original on January 17, 2022. Retrieved December 31, 2024.
Piletsky, Boris (January 15, 2022). "Создателей NFT-голосов, которых поддержал Трой Бейкер, уличили в краже голосов в тот же день" [Creators of Voice NFTs Backed by Troy Baker Caught Stealing Voices The Very Same Day]. iXBT Games. Archived from the original on September 24, 2023. Retrieved March 23, 2025.
"Everything You Need to Know About 15.ai: The AI Voice Generator". Play.ht. September 12, 2024. Archived from the original on December 25, 2024. Retrieved December 18, 2024.
Rosas, Víctor (January 17, 2022). "¡La decepción, hermano! Proyecto NFT apoyado por Troy Baker usó tecnología ajena" [Bro, What a Disappointment! Troy Baker-Backed NFT Project Used Third-Party Technology.]. LevelUp.com (in Spanish). Yahoo! Finance. Archived from the original on February 3, 2022. Retrieved September 12, 2025.
Ruppert, Liana (January 18, 2021). "Make Portal's GLaDOS And Other Beloved Characters Say The Weirdest Things with This App". Game Informer. Archived from the original on January 18, 2021. Retrieved December 18, 2024.
Scotellaro, Shaun (October 5, 2020). "15.ai Adds Tons of New Pony Voices". Equestria Daily. Archived from the original on December 26, 2024. Retrieved December 21, 2024.
Scotellaro, Shaun (May 19, 2025). "15.ai Returns with Pony Voice Creation as the Focus". Equestria Daily. Archived from the original on August 22, 2025. Retrieved May 19, 2025.
Skorich, Lina (January 15, 2022). "Трою Бейкеру пришлось извиняться за решение сотрудничать с NFT-компанией" [Troy Baker Forced to Apologize for Decision to Partner with NFT Company]. StopGame. Archived from the original on October 2, 2022. Retrieved March 23, 2025.
Sonali, Pednekar (January 14, 2022). Lam, Khoa (ed.). "Incident 277: Voices Created Using Publicly Available App Stolen and Resold as NFT without Attribution". AI Incident Database. Archived from the original on January 13, 2025. Retrieved October 4, 2025.
Staniszewski, Mati (2024). "15.AI: Everything You Need to Know & Best Alternatives". ElevenLabs (Official website). Archived from the original on December 25, 2024. Retrieved December 18, 2024.
Temitope, Yusuf (December 10, 2024). "15.ai Creator reveals journey from MIT Project to internet phenomenon". The Guardian. Archived from the original on December 28, 2024. Retrieved December 25, 2024.
Toh, Brandon (January 18, 2022). "Troy Baker's NFT Partner Company Voiceverse Caught Using Voice Lines From Another Service Without Permission". Geek Culture. Archived from the original on November 30, 2022. Retrieved February 28, 2025.
遊戲, 遊戲角落 (January 20, 2021). "這個AI語音可以模仿《傳送門》GLaDOS講出任何對白！連《Undertale》都可以學" [This AI Voice Can Imitate Portal's GLaDOS Saying Any Dialog! It Can Even Learn Undertale]. United Daily News (in Chinese (Taiwan)). Archived from the original on December 19, 2024. Retrieved December 18, 2024.
Villalobos, José (January 18, 2021). "Descubre 15.AI, un sitio web en el que podrás hacer que GlaDOS diga lo que quieras" [Discover 15.AI, a Website Where You Can Make GlaDOS Say What You Want]. LaPS4 (in Spanish). Archived from the original on January 18, 2021. Retrieved January 18, 2021.
Anirudh VK (March 18, 2023). "Deepfakes Are Elevating Meme Culture, But At What Cost?". Analytics India Magazine. Archived from the original on December 26, 2024. Retrieved December 18, 2024.
W-K, Edie (January 15, 2022). "Troy Baker angers the internet with NFT partnership". Checkpoint Gaming. Archived from the original on December 12, 2024. Retrieved February 28, 2025.
Weitzman, Cliff (November 19, 2023). "15.ai: All about 15.ai and the best alternative". Speechify. Archived from the original on December 25, 2024. Retrieved December 31, 2024.
White, Molly (January 14, 2022). "Voice actor Troy Baker announces his involvement in "voice NFT" project Voiceverse with an antagonistic tweet, shortly before it's revealed that the project stole work". Web3 Is Going Just Great. Archived from the original on July 24, 2024. Retrieved February 28, 2025.
Williams, Demi (January 18, 2022). "Voiceverse NFT admits to taking voice lines from non-commercial service". NME. Archived from the original on January 18, 2022. Retrieved December 18, 2024.
Wright, Steve (January 17, 2022). "Troy Baker-backed NFT company admits to using content without permission". Stevivor. Archived from the original on January 17, 2022. Retrieved December 18, 2024.
Wright, Steven (March 21, 2023). "Why Biden, Trump, and Obama Arguing Over Video Games Is YouTube's New Obsession". Inverse. Archived from the original on December 20, 2024. Retrieved December 18, 2024.
Zwiezen, Zack (January 18, 2021). "Website Lets You Make GLaDOS Say Whatever You Want". Kotaku. Archived from the original on January 17, 2021. Retrieved December 18, 2024.
Коэн (January 15, 2022). "Создателей голосовых NFT, поддерживаемых Троем Бейкером, обвинили в воровстве голоса" [Creators of Voice NFTs Backed by Troy Baker Accused of Voice Theft]. Shazoo (in Russian). Archived from the original on January 23, 2022. Retrieved February 28, 2025.

External links

[memes-11] Attributed to multiple references: Rock Paper Shotgun,^[1] Clubic,^[2] GLITCHED,^[3] United Daily News,^[4] Analytics India Magazine,^[5] Inverse,^[6] Speechify,^[7] The Guardian,^[8] Independent,^[9] and Alex Irpan.^[10]

[sold-12] Attributed to multiple references: AI Incident Database,^[23] AI, Algorithmic, and Automation Incidents and Controversies,^[24] Gamereactor,^[16] The Journal,^[25] Eurogamer,^[13] and GameGuru.^[34]

[stolen-40] Attributed to multiple references: The Mary Sue,^[11] Kotaku Australia,^[12] Eurogamer,^[13] NME,^[14] Muropaketti,^[15] Gamereactor,^[16] Web3 Is Going Just Great,^[17] StopGame,^[18] iXBT Games,^[19] DTF,^[20] Sport.es,^[21] FZ,^[22] AI Incident Database,^[23] AI, Algorithmic, and Automation Incidents and Controversies,^[24] The Journal,^[25] LevelUp,^[26] Stevivor,^[27] PlayStation Universe,^[28] Checkpoint Gaming,^[29] Tech Times,^[30] Mobidictum,^[31] OtakuPT,^[32] Gamebrott,^[33] GameGuru,^[34] Shazoo,^[35] Geek Culture,^[36] and Lehrman v. LOVO.^[37]

[124] which uses "11.ai" as a legal byname for its web domain^[115]

[1] Morton 2021: "Machine learning is absolutely fascinating and yet I mostly just enjoy when people use impressive tech to create weird skits and memes. That's exactly what everyone appears to be doing with an extremely impressive machine-learning tool that lets you type in text for various characters to say out loud. [...] It's made possible by the text to speech algorithm 15.ai that studies clips of characters and uses deep-learning to make those characters say whatever the heck you want."

[2] Lamorlette 2021: "Vous avez toujours rêvé de faire dire n'importe quoi à vos personnages de jeux vidéo préférés (ou détestés) ? Le site 15.ai l'a fait !" (transl. "Have you always dreamed of making your favorite (or hated) video game characters say anything? 15.ai has finally made it happen!")

[3] Cocomello 2021: "However, back then if you wanted to create your own dialogue, it required layers of sound enhancements and tweaks. Thankfully, the world has evolved and now thanks to the 15.ai app, we can make [...] popular characters say whatever we want"

[4] MrSun 2021: "大家是否都曾經想像過，假如能讓自己喜歡的遊戲或是動畫角色說出自己想聽的話，不論是名字、惡搞或是經典名言，都是不少人的夢想吧。不過來到 2021 年，現在這種夢想不再是想想而已，因為有一個網站通過 AI 生成的技術"，(transl. "Have you ever imagined what it would be like if your favorite game or anime characters could say exactly what you want to hear? Whether it's names, parodies, or classic quotes, this is a dream for many. However, as we enter 2021, this dream is no longer just a fantasy, because there is a website that uses AI-generated technology,")

[5] Anirudh VK 2023: "While AI voice memes have been around in some form since '15.ai' launched in 2020, [...]"

[6] Wright 2023: "AI voice tools used to create "audio deepfakes" have existed for years in one form or another, with 15.ai being a notable example."

[7] Weitzman 2023: "It gained popularity because it was the first AI voice platform that featured an assortment of fictional characters from a variety of media sources"

[8] Temitope 2024: "During this period, 15.ai earned credit for single-handedly popularizing AI voice cloning—often described as 'audio deepfakes'—in memes, viral content, and fan-driven media."

[9] Abisola 2025: "Many credit 15.ai as the first mainstream text-to-speech platform that truly made 'audio deepfakes' go viral,"

[10] Irpan 2025: "multiple startups credit 15.ai for creating the market they now compete in."

[FOOTNOTELawrence2022-13] Lawrence 2022.

[FOOTNOTEInnes2022-14] Innes 2022.

[FOOTNOTEPhillips2022-15] Phillips 2022.

[FOOTNOTEWilliams2022-16] Williams 2022.

[FOOTNOTEMuropaketti2022-17] Muropaketti 2022.

[FOOTNOTEGroth-Anderson2022-18] Groth-Anderson 2022.

[FOOTNOTEWhite2022-19] White 2022.

[FOOTNOTESkorich2022-20] Skorich 2022.

[FOOTNOTEPiletsky2022-21] Piletsky 2022.

[FOOTNOTEGranger2022-22] Granger 2022.

[FOOTNOTEBaylos2022-23] Baylos 2022.

[FOOTNOTEMyrén2022-24] Myrén 2022.

[FOOTNOTESonali2022-25] Sonali 2022.

[FOOTNOTEAIAAIC2022-26] AIAAIC 2022.

[FOOTNOTEParker2022-27] Parker 2022.

[FOOTNOTERosas2022-28] Rosas 2022.

[FOOTNOTEWright2022-29] Wright 2022.

[FOOTNOTECarcasole2022-30] Carcasole 2022.

[FOOTNOTEW-K2022-31] W-K 2022.

[FOOTNOTEHenry2022-32] Henry 2022.

[FOOTNOTEAktaş2022-33] Aktaş 2022.

[FOOTNOTEArcher2022-34] Archer 2022.

[FOOTNOTEIfram2022-35] Ifram 2022.

[FOOTNOTEKuchkanov2022-36] Kuchkanov 2022.

[FOOTNOTEКоэн2022-37] Коэн 2022.

[FOOTNOTEToh2022-38] Toh 2022.

[courtcase-39] Paul Lehrman and Linnea Sage, et al. v. LOVO, Inc., No. 1:24-cv-03770, 38 (S.D.N.Y. 2024) ("Separately, VoiceVerse has already been found to have stolen technology from another company. See Ule Lopez, WCCF Tech, "Voiceverse NFT Service Reportedly Uses Stolen Technology from 15ai," (Jan. 16, 2022), https://wccftech.com/voiceverse-nft-service-usesstolen-technology-from-15ai/."), archived from the original on 2025-10-04.

[hn-41] Hacker News 2022

[FOOTNOTEBarakatTurkDemiroglu2024-42] Barakat, Turk & Demiroglu 2024.

[Google-43] Google 2018

[FOOTNOTEKongKimBae2020-44] Kong, Kim & Bae 2020.

[FOOTNOTEKimKimKongYoon2020-45] Kim et al. 2020.

[FOOTNOTETemitope2024-46] ^ ^a ^b ^c ^d ^e ^f ^g ^h ⁱ ^j Temitope 2024.

[examples-47] "Examples". May 15, 2025. Retrieved May 16, 2025.

[FOOTNOTEAbisola2025-48] ^ ^a ^b ^c ^d ^e ^f ^g Abisola 2025.

[FOOTNOTEChandraseta2021Li2021Temitope2024-49] Chandraseta 2021; Li 2021; Temitope 2024.

[FOOTNOTEPhillips2022Temitope2024Abisola2025-50] Phillips 2022; Temitope 2024; Abisola 2025.

[FOOTNOTENg2020-52] Ng 2020.

[FOOTNOTEKurosawa2021-53] Kurosawa 2021.

[54] "About". 15.ai (Official website). March 2, 2020. Archived from the original on March 3, 2020. Retrieved December 23, 2024.

[FOOTNOTEKurosawa2021Temitope2024Abisola2025-55] Kurosawa 2021; Temitope 2024; Abisola 2025.

[FOOTNOTEMahendra2020-56] Mahendra 2020.

[FOOTNOTERuppert2021Abisola2025-57] Ruppert 2021; Abisola 2025.

[FOOTNOTETemitope2024Abisola2025-58] Temitope 2024; Abisola 2025.

[FOOTNOTELopez2022Innes2022Piletsky2022-59] Lopez 2022; Innes 2022; Piletsky 2022.

[FOOTNOTEPhillips2022Innes2022Toh2022W-K2022Muropaketti2022Ifram2022Williams2022AIAAIC2022-60] Phillips 2022; Innes 2022; Toh 2022; W-K 2022; Muropaketti 2022; Ifram 2022; Williams 2022; AIAAIC 2022.

[FOOTNOTEInnes2022Коэн2022Myrén2022AIAAIC2022-61] Innes 2022; Коэн 2022; Myrén 2022; AIAAIC 2022.

[FOOTNOTEPhillips2022Myrén2022AIAAIC2022-62] Phillips 2022; Myrén 2022; AIAAIC 2022.

[FOOTNOTEWright2022Groth-Anderson2022Myrén2022Archer2022Williams2022-63] Wright 2022; Groth-Anderson 2022; Myrén 2022; Archer 2022; Williams 2022.

[FOOTNOTEInnes2022Enriquez2022-64] Innes 2022; Enriquez 2022.

[FOOTNOTEКоэн2022Baylos2022-65] Коэн 2022; Baylos 2022.

[FOOTNOTEKhibchenko2022-66] Khibchenko 2022.

[FOOTNOTELawrence2022Innes2022White2022W-K2022-68] Lawrence 2022; Innes 2022; White 2022; W-K 2022.

[FOOTNOTEWright2022White2022Carcasole2022Enriquez2022-69] Wright 2022; White 2022; Carcasole 2022; Enriquez 2022.

[FOOTNOTEParker2022Granger2022-70] Parker 2022; Granger 2022.

[FOOTNOTEStaniszewski2024Temitope2024-71] Staniszewski 2024; Temitope 2024.

[FOOTNOTETemitope2024Irpan2025-72] Temitope 2024; Irpan 2025.

[tweet-73] "FAQ". 15.dev. May 18, 2025. Archived from the original on October 1, 2025. Retrieved May 18, 2025.

[FOOTNOTEScotellaro2025-75] Scotellaro 2025.

[FOOTNOTEWilliams2022Wright2022Innes2022-76] Williams 2022; Wright 2022; Innes 2022.

[FOOTNOTEIbáñez2022-77] Ibáñez 2022.

[FOOTNOTEZwiezen2021Clayton2021Morton2021Ruppert2021Villalobos2021Furushima2021Kurosawa2021Cocomello2021Lamorlette2021Phillips2022-78] Zwiezen 2021; Clayton 2021; Morton 2021; Ruppert 2021; Villalobos 2021; Furushima 2021; Kurosawa 2021; Cocomello 2021; Lamorlette 2021; Phillips 2022.

[FOOTNOTEMorton2021遊戲2021-79] Morton 2021; 遊戲 2021.

[FOOTNOTE遊戲2021-80] 遊戲 2021.

[FOOTNOTEIbáñez2022Abisola2025-81] Ibáñez 2022; Abisola 2025.

[FOOTNOTEFeng2020-82] Feng 2020.

[FOOTNOTEChandraseta2021-83] Chandraseta 2021.

[FOOTNOTEChandraseta2021Lamorlette2021-84] Chandraseta 2021; Lamorlette 2021.

[FOOTNOTEChandraseta2021Temitope2024-85] Chandraseta 2021; Temitope 2024.

[86] Chandraseta 2021: "By adding a '|' after the original sentence and providing an extra sentence, we could control what emotion the original sentence will be spoken with. In other words, 'text_1|text_2' will produce a voice line of text_1 with the emotion of text_2."

[87] Chandraseta 2021: "because it could force the bot into generating previously unknown data, such as saying 'Today is a great day' with a sad or angry emotion"

[FOOTNOTECocomello2021Ruppert2021-88] Cocomello 2021; Ruppert 2021.

[FOOTNOTEKurosawa2021Temitope2024-89] Kurosawa 2021; Temitope 2024.

[FOOTNOTEClayton2021Ruppert2021Villalobos2021Cocomello2021-90] Clayton 2021; Ruppert 2021; Villalobos 2021; Cocomello 2021.

[FOOTNOTEClayton2021-91] Clayton 2021.

[FOOTNOTEZwiezen2021-92] Zwiezen 2021.

[93] 遊戲 2021: "目前「15.ai」的網頁上，提供了不少的音源，[...]除了《傳送門》之外，15.ai 網站目前也支援了許多來自遊戲、電影或動畫中的人物語音，" (transl. "Currently, the "15.ai" website provides a lot of audio sources. [...] In addition to "Portal", the 15.ai website currently also supports voices for many characters from games, movies or animations.

[94] MrSun 2021: "的 GLaDOS 也能完美的唸出任何台詞。當然網站也補充目前還有很多不完美的地方，像是字數限制、語氣控制在某些話上還是略有怪異，但只要肯花時間，也能像是其他網友一樣，通過剪輯來完成有趣的創作，" (transl. "Even GLaDOS in "Portal" can perfectly recite any lines. Of course, the website also added that there are still many imperfections, such as word limit and tone control, which are still a bit weird in some words, but as long as you are willing to spend time, you can also complete interesting creations through editing like other netizens.")

[FOOTNOTEButton2021-95] Button 2021.

[96] Lamorlette 2021: "On peut donc retrouver sur ces réseaux de nombreux exemples de ce que peut donner le mélange entre un esprit créatif et une technologie aussi efficace que diablement amusante." (transl. "These social networks are therefore full of examples of what can be achieved by combining a creative mind with technology that is as effective as it is devilishly fun.")

[FOOTNOTEScotellaro2020-97] Scotellaro 2020.

[FOOTNOTEwww.equestriacn.com2021-98] www.equestriacn.com 2021.

[FOOTNOTEPaltridge2021-99] Paltridge 2021.

[FOOTNOTEMorton2021-100] Morton 2021.

[101] Moto 2021: "Incluso, los más clavados pueden cambiar algunos parámetros como la intencionalidad o el tono." (transl. "Actually, the most experienced of users can change some parameters like the stress or tone.")

[102] 
Furushima 2021: 日本語入力には対応していないが、ローマ字入力でもなんとなくそれっぽい発音になる。; 15.aiはテキスト読み上げサービスだが、特筆すべきはそのなめらかな発音と、ゲームに登場するキャラクター音声を再現している点だ。 (transl. It does not support Japanese input, but even if you input using romaji, it will somehow give you a similar pronunciation.; 15.ai is a text-to-speech service, but what makes it particularly noteworthy is its smooth pronunciation and the fact that it reproduces the voices of characters that appear in games.)

Kurosawa 2021: "もうひとつ15.aiの大きな特徴として挙げられるのが、豊かな感情表現だ" (transl. "Another major feature of 15.ai is its rich emotional expression.")

Kurosawa 2021: "英語版ボイスのみなので注意" (transl. "Please note that this is an English voice only version.")

[101] Furushima 2021: 日本語入力には対応していないが、ローマ字入力でもなんとなくそれっぽい発音になる。; 15.aiはテキスト読み上げサービスだが、特筆すべきはそのなめらかな発音と、ゲームに登場するキャラクター音声を再現している点だ。 (transl. It does not support Japanese input, but even if you input using romaji, it will somehow give you a similar pronunciation.; 15.ai is a text-to-speech service, but what makes it particularly noteworthy is its smooth pronunciation and the fact that it reproduces the voices of characters that appear in games.)

[102] Kurosawa 2021: "もうひとつ15.aiの大きな特徴として挙げられるのが、豊かな感情表現だ" (transl. "Another major feature of 15.ai is its rich emotional expression.")

[103] Kurosawa 2021: "英語版ボイスのみなので注意" (transl. "Please note that this is an English voice only version.")

[103] 
do Prado 2021: "Obviamente o programa funciona no idioma inglês, mas dá pra gerar umas frases bem emboladas e engraças em português, estilo aqueles memes usando vozes em outros idiomas falando em português." (transl. "Obviously, the program works in English, but you can generate some really confusing and funny sentences in Portuguese, like those memes using voices in other languages speaking Portuguese.")

Villalobos 2021: "En este sentido, en las últimas horas se ha hecho popular un sitio web que emula la voz de GlaDOS para que diga todas las palabras que quieras, siempre y cuando estén en inglés, aunque puedes escribir algo en español e intentará pronunciarlo, pero no lo hará correctamente." (transl. "In this sense, in recent hours a website has become popular that emulates the voice of GlaDOS so that it says all the words you want, as long as they are in English, although you can write something in Spanish and it will try to pronounce it, but it will not do it correctly.")

[105] Prado 2021: "Obviamente o programa funciona no idioma inglês, mas dá pra gerar umas frases bem emboladas e engraças em português, estilo aqueles memes usando vozes em outros idiomas falando em português." (transl. "Obviously, the program works in English, but you can generate some really confusing and funny sentences in Portuguese, like those memes using voices in other languages speaking Portuguese.")

[106] Villalobos 2021: "En este sentido, en las últimas horas se ha hecho popular un sitio web que emula la voz de GlaDOS para que diga todas las palabras que quieras, siempre y cuando estén en inglés, aunque puedes escribir algo en español e intentará pronunciarlo, pero no lo hará correctamente." (transl. "In this sense, in recent hours a website has become popular that emulates the voice of GlaDOS so that it says all the words you want, as long as they are in English, although you can write something in Spanish and it will try to pronounce it, but it will not do it correctly.")

[104] 
GamerSky 2021: "虽然AI的声音缺少了些抑扬顿挫，不过效果也还算有趣。" (transl. "Although the AI's voice lacks some intonation, the effect is still interesting.")

GamerSky 2021: "目前15.ai提供的角色选项较少，由于文本的字数限制，生成的语音也相对较短" (transl. "Currently, 15.ai provides relatively few character options, and due to the word limit of the text, the generated voice is relatively short.")

[108] GamerSky 2021: "虽然AI的声音缺少了些抑扬顿挫，不过效果也还算有趣。" (transl. "Although the AI's voice lacks some intonation, the effect is still interesting.")

[109] GamerSky 2021: "目前15.ai提供的角色选项较少，由于文本的字数限制，生成的语音也相对较短" (transl. "Currently, 15.ai provides relatively few character options, and due to the word limit of the text, the generated voice is relatively short.")

[105] 
Li 2021: "完美保留了发音人的韵律和特色，" (transl.: "perfectly preserves the rhythm and characteristics of the speaker,")

Li 2021: "该网站的访问量为在线任务差不多5000以上，而且目前完全免费，" (transl.: "The number of requests to the website is more than 5,000 tasks, and it is still currently completely free.")

[111] Li 2021: "完美保留了发音人的韵律和特色，" (transl.: "perfectly preserves the rhythm and characteristics of the speaker,")

[112] Li 2021: "该网站的访问量为在线任务差不多5000以上，而且目前完全免费，" (transl.: "The number of requests to the website is more than 5,000 tasks, and it is still currently completely free.")

[FOOTNOTECocomello2021-106] Cocomello 2021.

[107] Ibáñez 2022: "Personalmente encontré interesantes las pausas y el ritmo y que ciertamente se nota que según el contenido del texto se «interpreta» el resultado según lo que se intenta transmitir." (transl. "Personally, I found the pauses and rhythm interesting, and that it is certainly noticeable that depending on the content of the text, the result is 'interpreted' according to what is being trying to convey.")

[FOOTNOTEIrpan2025-108] Irpan 2025.

[109] 
Feng 2020: "该工具生成的音频文件的采样率为 44100 Hz，而大多数基于深度学习的文本转语音实现，所使用的采样率为16,000 Hz。所以用它产生的音频，声谱会更详细（更高质量的音频），同时缺陷也更明显。" (transl. "The audio files generated by this tool have a sampling rate of 44100 Hz, while most deep learning-based text-to-speech implementations use a sampling rate of 16,000 Hz. Therefore, the audio generated by it will have a more detailed sound spectrum (higher quality audio), but the defects will be more obvious.")

Feng 2020: "当然在这么小的语料上训练的模型也是有缺陷的，有些单词可能发音不准确，其实这也很好理解，即使是人，在遇到生词的时候也不一定能准确发音，而传统的深度模型通常有 40 个小时或者更多的语料，所以错误率会低一些。" (transl. "Of course, the model trained on such a small corpus is also flawed, and some words may not be pronounced correctly . In fact, this is easy to understand. Even humans may not be able to pronounce new words accurately when they encounter them. Traditional deep models usually have 40 hours or more of corpus, so the error rate will be lower.")

[117] Feng 2020: "该工具生成的音频文件的采样率为 44100 Hz，而大多数基于深度学习的文本转语音实现，所使用的采样率为16,000 Hz。所以用它产生的音频，声谱会更详细（更高质量的音频），同时缺陷也更明显。" (transl. "The audio files generated by this tool have a sampling rate of 44100 Hz, while most deep learning-based text-to-speech implementations use a sampling rate of 16,000 Hz. Therefore, the audio generated by it will have a more detailed sound spectrum (higher quality audio), but the defects will be more obvious.")

[118] Feng 2020: "当然在这么小的语料上训练的模型也是有缺陷的，有些单词可能发音不准确，其实这也很好理解，即使是人，在遇到生词的时候也不一定能准确发音，而传统的深度模型通常有 40 个小时或者更多的语料，所以错误率会低一些。" (transl. "Of course, the model trained on such a small corpus is also flawed, and some words may not be pronounced correctly . In fact, this is easy to understand. Even humans may not be able to pronounce new words accurately when they encounter them. Traditional deep models usually have 40 hours or more of corpus, so the error rate will be lower.")

[110] Ji 2021: "但是由于情绪表现只能联系上下文进行自动识别，导致这些语音在情感表达上比较"中庸"，一些"极端"的情绪无法通过语音合成正常表达，[...]距离其被正式用于某些NSFW的同人作品，还有很长的路要走。" (transl. "the emotional expression can only be automatically recognized in the context, which makes these voices relatively "neutral" in emotional expression. Some "extreme" emotions cannot be expressed normally through voice synthesis. [...] it still has a long way to go before it can be officially used in some NSFW fan works.")

[111] Ji 2021: "网友在油管上看到的许多"深度伪造"视频，都依赖视频创作者从原本数小时的数据资料里进行提取编辑，最终才能制作非常简短的内容，并且呈现效果还很一般。而15.ai的开发者表示，自己的这项技术可以轻松实现那些视频效果（事实上15.ai的许多角色进行深度学习的数据时长只有几十分钟）。" (transl. "Many of the "deep fake" videos that netizens see on YouTube rely on video creators to extract and edit hours of data to produce very short content, and the presentation effect is still very average. The developers of 15.ai said that their technology can easily achieve those video effects (in fact, the data for deep learning of many characters of 15.ai is only tens of minutes long).")

[FOOTNOTEParker2022Temitope2024-114] Parker 2022; Temitope 2024.

[FOOTNOTEInnes2022W-K2022Ng2020Parker2022Skorich2022Piletsky2022Enriquez2022-115] Innes 2022; W-K 2022; Ng 2020; Parker 2022; Skorich 2022; Piletsky 2022; Enriquez 2022.

[FOOTNOTERuppert2021Morton2021-116] Ruppert 2021; Morton 2021.

[FOOTNOTE遊戲2021Kurosawa2021Morton2021Temitope2024-117] 遊戲 2021; Kurosawa 2021; Morton 2021; Temitope 2024.

[FOOTNOTEZwiezen2021Ruppert2021Kurosawa2021Abisola2025-118] Zwiezen 2021; Ruppert 2021; Kurosawa 2021; Abisola 2025.

[FOOTNOTERuppert2021-119] Ruppert 2021.

[FOOTNOTEZwiezen2021Morton2021-120] Zwiezen 2021; Morton 2021.

[FOOTNOTEClayton2021CNN2021-121] Clayton 2021; CNN 2021.

[FOOTNOTEFurushima2021-122] Furushima 2021.

[FOOTNOTEElevenLabs2024b-123] ElevenLabs 2024b.

[FOOTNOTEStaniszewski2024Play.ht2024Weitzman2023-125] Staniszewski 2024; Play.ht 2024; Weitzman 2023.

[FOOTNOTEPlay.ht2024-126] Play.ht 2024.

[FOOTNOTEWeitzman2023-127] Weitzman 2023.

[FOOTNOTEStaniszewski2024-128] Staniszewski 2024.

[FOOTNOTEOsman2022-129] Osman 2022.

[FOOTNOTEOpenAI2024Temitope2024-130] OpenAI 2024; Temitope 2024.

[51] @fifteenai (December 7, 2024). "The past and future of 15.ai" (Tweet). Archived from the original on December 8, 2024. Retrieved December 19, 2024 – via Twitter.

[faq-74] @fifteenai (May 18, 2025). "We are so back. https://15.dev Only MLP characters for now. More characters, features, and improvements will be added soon. Check Twitter and/or the Discord server (linked on the website) for updates! (Expect possible downtime as I calibrate server capacity and GPU allocations depending on how busy the website gets.)" (Tweet). Archived from the original on October 4, 2025. Retrieved May 18, 2025 – via Twitter.

[67] Yea, Yong (January 14, 2022). "Troy Baker Faces Mass Backlash For Supporting Shady AI Voice NFTs With Company That Has Stolen Work". YouTube. Event occurs at 15:54–16:13. Archived from the original on December 20, 2024. Retrieved March 23, 2025. This isn't just one of those things [Voiceverse] can go 'Whoopsies!' on. [They] plagiarized somebody else's work and used that as a means to falsely market the quality of [their] own products, by using somebody else's higher quality voice AI to promote [Voiceverse] for [their] own benefit.

[voc-112] The VŌC Podcast // John Patrick Lowrie & Ellen McLain Interview (The voices of GLaDOS and Sniper) (Podcast). The VŌC Podcast. April 11, 2021. Event occurs at 0:51:50–1:01:25. Archived from the original on August 20, 2025. Retrieved January 15, 2025.

[113] Vetterlein, Nathan (January 10, 2021). "Nate listens to his AI self". Twitch. Retrieved January 21, 2025.

[a]