Google Text To Speech AI icon
Google Text To Speech AI icon

Google Text To Speech AI

 Like

Convert text into natural-sounding speech using an API powered by the best of Google’s AI technologies.

Google Text To Speech AI screenshot 1

License model

Application type

Country of Origin

  • US flagUnited States

Platforms

  • Online
  No rating
0likes
0comments
0news articles

Features

Suggest and vote on features
  1.  Extensible by Plugins/Extensions
  2.  Text to Speech
  3.  No Coding Required
  4.  Cloud Sync
  5.  Real time collaboration
  6.  AI-Powered

Google Text To Speech AI News & Activities

Highlights All activities

Recent activities

Show all activities

Google Text To Speech AI information

  • Developed by

    US flagGoogle
  • Licensing

    Proprietary and Commercial product.
  • Pricing

    One time purchase that costs $0, and / or subscription that costs $0 per month.
  • Alternatives

    37 alternatives listed
  • Supported Languages

    • English
    • Turkish
    • German
    • Spanish
    • Italian
    • Russian
    • Ukrainian
    • French
    • Dutch
    • Arabic
    • Chinese
    • Japanese
    • Korean

Our users have written 0 comments and reviews about Google Text To Speech AI, and it has gotten 0 likes

Google Text To Speech AI was added to AlternativeTo by canermeow on Mar 8, 2025 and this page was last updated Mar 8, 2025.
No comments or reviews, maybe you want to be first?
Post comment/review

What is Google Text To Speech AI?

Improve customer interactions with intelligent, lifelike responses Engage users with voice user interface in your devices and applications Personalize your communication based on user preference of voice and language

Features

Custom Voice Train a custom speech synthesis model using your own audio recordings to create a unique and more natural-sounding voice for your organization. You can define and choose the voice profile that suits your organization and quickly adjust to changes in voice needs without needing to record new phrases. Learn more.

Long audio synthesis Asynchronously synthesize up to 1 million bytes of input with Long Audio Synthesis.

Voice and language selection Choose from an extensive selection of 220+ voices across 40+ languages and variants, with more to come soon.

WaveNet voices Take advantage of 90+ WaveNet voices built based on DeepMind’s groundbreaking research to generate speech that significantly closes the gap with human performance.

Text and SSML support Customize your speech with SSML tags that allow you to add pauses, numbers, date and time formatting, and other pronunciation instructions.

Pitch tuning Personalize the pitch of your selected voice, up to 20 semitones more or less than the default.

Speaking rate tuning Adjust your speaking rate to be 4x faster or slower than the normal rate.

Volume gain control Increase the volume of the output by up to 16db or decrease the volume up to -96db.

Integrated REST and gRPC APIs Easily integrate with any application or device that can send a REST or gRPC request including phones, PCs, tablets, and IoT devices (for example cars, TVs, speakers).

Audio format flexibility Convert text to MP3, Linear16, OGG Opus, and a number of other audio formats.

Audio profiles Optimize for the type of speaker from which your speech is intended to play, such as headphones or phone lines.