archived 31 Mar 2026 00:38:39 UTCarchive.today webpage capture | Saved from | ||
| All snapshots | from host cloud.google.com | ||
| WebpageScreenshot | |||
| Streaming audio synthesis | Power your AI agents with ultra-low-latency speech for seamless, real-time conversations with streaming audio synthesis. |
| Long audio synthesis | Asynchronously synthesize up to 1 million bytes of input with long audio synthesis. |
| Voice and language selection | Choose from an extensive selection of 380+ voices across 75+ languages and variants, with more to come soon. |
| Text and SSML support | Customize your speech with SSML tags that allow you to add pauses, numbers, date and time formatting, and other pronunciation instructions. |
| Pitch tuning | Personalize the pitch of your selected voice, up to 20 semitones more or less than the default. |
| Speaking rate tuning | Adjust your speaking rate to be 4x faster or slower than the normal rate. |
| Volume gain control | Increase the volume of the output by up to 16 db or decrease the volume up to -96 db. |
| Integrated REST and gRPC APIs | Easily integrate with any application or device that can send a REST or gRPC request including phones, PCs, tablets, and IoT devices (for example cars, TVs, speakers). |
| Audio format flexibility | Convert text to MP3, Linear16, OGG Opus, and a number of other audio formats. |
| Audio profiles | Optimize for the type of speaker from which your speech is intended to play, such as headphones or phone lines. |