Try our new research platform with insights from 80,000+ expert users

Amazon Polly vs Google Cloud Text-to-Speech comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Nov 2, 2025

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Amazon Polly
Ranking in Text-To-Speech Services
1st
Average Rating
7.4
Reviews Sentiment
7.6
Number of Reviews
5
Ranking in other categories
No ranking in other categories
Google Cloud Text-to-Speech
Ranking in Text-To-Speech Services
2nd
Average Rating
8.4
Reviews Sentiment
5.2
Number of Reviews
3
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of January 2026, in the Text-To-Speech Services category, the mindshare of Amazon Polly is 21.7%, down from 33.1% compared to the previous year. The mindshare of Google Cloud Text-to-Speech is 21.2%, down from 29.9% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Text-To-Speech Services Market Share Distribution
ProductMarket Share (%)
Amazon Polly21.7%
Google Cloud Text-to-Speech21.2%
Other57.1%
Text-To-Speech Services
 

Featured Reviews

AG
Senior Software Developer at a tech vendor with 10,001+ employees
Text has been converted to speech across multiple languages with customizable voice settings
The most beneficial aspect of Amazon Polly is its ability to convert text to speech in multiple languages. It allows us to change the voice configurations for both male and female voices, and enables adjustments in pronunciation and delays. These features help us effectively target our users. Additionally, the integration capabilities with AWS services like Lambda aid us in storing Polly voice messages in DynamoDB and S3. It also offers configurations in multiple languages, enhancing our service reach.
reviewer2252211 - PeerSpot reviewer
Principal Architect & NLP Python Developer at a computer software company with 1-10 employees
Support issues overshadow solid features in daily operations
The support is inadequate. We are dealing with them on our development talk today. There's a lot of finger-pointing going on in terms of whose problem it is. Moving our stuff up to the Google Cloud and getting it to work just as well as it does on people's development machines is problematic. Their support for that, even though we paid for it, isn't really very helpful. That's prevalent in the computer business. You need to have your own experts, otherwise you're really in trouble. The product is an eight out of 10. The support is at best a five. We have to write certain features ourselves because their offerings aren't very powerful. When I don't have a problem, it works pretty well, better than anybody else. But when I do have a problem, I'm severely impacted. It takes a lot of time and money to go back and fix it. What has gotten better with Google Cloud Text-to-Speech is their stuff sounds so natural, it really brings a smile to my face. I wish their support would be better.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The most beneficial aspect of Amazon Polly is its ability to convert text to speech in multiple languages."
"We can use the SSML tags in Amazon Polly to modify text-to-speech by controlling speech patterns and behaviour."
"The sound generated by Amazon Polly is very natural, and I appreciate the options to select different voices, including an expensive or cheaper one, and the Structured Speech Markup Language (SSML) feature allows me to specify if I want a warmer or higher tune, which has helped make the meditations sound very natural."
"Amazon Polly is useful because it's helpful to hear the words on top of it when I can't take in information in a general way. Sometimes, it's very taxing if I'm trying to read cases. They have the neural voices, and they're so realistic. You don't even know that a person is not reading to you, making things much better. I know that they do have the ability to provide you with your own lexicon that's personal to you. I like that you can adjust the pitch and the speed of the voice because some people talk way too fast. Or if you're reading, I read slowly, so that's always helpful. One of the functions that I find helpful is that when reading material on the web, it's like it has its own browser. You go to the URL, and you don't have to read the whole thing, and you can stick the cursor on the place where you want it to start. Then if you want it to skip over something, you put it somewhere else, and that's ideal for reading case law because you skip around a lot. You don't really read it from start to finish. It helps if someone's going to read all those citations because they definitely want to be able to skip that."
"Amazon Polly offers significant features like the ability to select different voice categories and language options, such as Spanish, Portuguese, German, and French, which is particularly useful for maintaining worldwide contact centers and enhances customer experience by allowing us to give voice responses instead of text-based responses."
"What has gotten better with Google Cloud Text-to-Speech is their stuff sounds so natural, it really brings a smile to my face."
"It's not complex to set up."
"Precision is the most valuable feature of Google Cloud Text-to-Speech because the text is perfectly voiced."
 

Cons

"When you put more tags inside Amazon Polly to define break time and instruct the speech to be conversational, sometimes it gives you an error."
"The price could be better. I wish it weren't so expensive to do because it's really cool. I would love to see them have lexicon packages of them like, this is for lawyers, this is for accountants, and it's going to have a lot of things in it. I also think they could do a better job at showing use cases other than telemarketing or contact center stuff like bots that are very commercial. I know that's where the money is, but it's such a huge hole that's missing for people with disabilities that are even worse than mine. Some people cannot see or hear at all, but they're not just cognitively impaired."
"Amazon Polly's standard text-to-speech feature could be enhanced to deliver more natural and expressive human-like speech."
"We had some problems with Dialogflow."
"Google Cloud Text-to-Speech is 100 out of 100 when it works, and when it doesn't work, which is fairly often, it gets a zero."
"Google Cloud Text-to-Speech has just one female voice and one male voice in Brazil, while it has a lot of voices in other countries."
 

Pricing and Cost Advice

"The price could be better. Neural voices are so realistic, and I want to say that they have it so that you can try to tell where the voice is coming from or something like that. But if I have more than one, it's so expensive to have to listen to a bunch of cases on my phone and have the neural voice read to me. It really wouldn't be worth it. It'd be paying probably more than what I make in the case. Right now, I'm on the free tier, and I think the number of minutes that you get is reasonable as long as you're not doing this all the time and you're using it judiciously. I have some credits that I think I can use, but I don't know how fast they'll go through."
"The solution has a pay-as-you-go pricing model, where you must pay according to your usage."
"I rate Google Cloud Text-to-Speech three out of ten for pricing."
report
Use our free recommendation engine to learn which Text-To-Speech Services solutions are best for your needs.
881,082 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Educational Organization
8%
Comms Service Provider
8%
Computer Software Company
7%
Financial Services Firm
7%
Financial Services Firm
12%
Educational Organization
10%
Computer Software Company
9%
Comms Service Provider
8%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
No data available
 

Questions from the Community

What is your experience regarding pricing and costs for Amazon Polly?
Amazon Polly uses a pay-as-you-go pricing model. The standard voice type costs around $4 per one million characters, while the neural voice type costs approximately $10. It is free for the first tw...
What needs improvement with Amazon Polly?
Amazon Polly's standard text-to-speech feature could be enhanced to deliver more natural and expressive human-like speech. New speaking styles, emotions, more languages, and advanced features could...
What is your primary use case for Amazon Polly?
We are using Amazon Polly ( /products/amazon-polly-reviews ) to convert text into speech. It is being utilized to provide speech and voice messages to disabled users and also to deliver these speec...
What is your experience regarding pricing and costs for Google Cloud Text-to-Speech?
Our experience is we didn't have any other choice. We can't really say that it's well-priced or badly priced. We just didn't have another choice as far as we were concerned.
What needs improvement with Google Cloud Text-to-Speech?
The support is inadequate. We are dealing with them on our development talk today. There's a lot of finger-pointing going on in terms of whose problem it is. Moving our stuff up to the Google Cloud...
What is your primary use case for Google Cloud Text-to-Speech?
We use Speech-to-Text and Text-to-Speech to be able to talk to our users. We have an AI meaning engine that back-ends that. Once we get the speech, we can tell what it means. That's our use case. W...
 

Overview

 

Sample Customers

GoAnimate, Duolingo, Bandwidth
Home Depot, Paypal, Target, HSBC, McKesson
Find out what your peers are saying about Amazon Polly vs. Google Cloud Text-to-Speech and other solutions. Updated: December 2025.
881,082 professionals have used our research since 2012.