What is our primary use case?
We use Speech-to-Text and Text-to-Speech to be able to talk to our users. We have an AI meaning engine that back-ends that. Once we get the speech, we can tell what it means. That's our use case.
When we tested Speech-to-Text a few years ago, it was better than the equivalent IBM products because it was more accurate. But it's not by any means 100% accurate, and we have to correct for those errors in our AI software.
It's using neural networks and that stochastic processing is 70-75% accurate. It gets it wrong too often, and we really don't like that. Since I personally work with this, I don't like that a lot. But they seem to be the best game in town right now.
There's nobody else out there doing STT and TTS that is as good as them. There are several competitors trying, including Nuance and IBM, but their solutions are not very good, at least not as good as Google Cloud Text-to-Speech.
What is most valuable?
Google Cloud Text-to-Speech is 100 out of 100 when it works, and when it doesn't work, which is fairly often, it gets a zero. It doesn't fail gracefully, it doesn't fail in an expected way, it just fails. It just stops working and then we get errors, and then my development people have to try and figure out what those are and do something about it.
If you want to talk to your computer, then in our view, using Google Cloud Text-to-Speech is the way to do it. But it's not without problems. The difference between how it works in a Windows environment on a development machine and how it works up in the cloud are unsettling and costly because we're spending expensive engineering time trying to figure it out.
What needs improvement?
The support is inadequate. We are dealing with them on our development talk today. There's a lot of finger-pointing going on in terms of whose problem it is. Moving our stuff up to the Google Cloud and getting it to work just as well as it does on people's development machines is problematic. Their support for that, even though we paid for it, isn't really very helpful. That's prevalent in the computer business. You need to have your own experts, otherwise you're really in trouble.
The product is an eight out of 10. The support is at best a five. We have to write certain features ourselves because their offerings aren't very powerful.
When I don't have a problem, it works pretty well, better than anybody else. But when I do have a problem, I'm severely impacted. It takes a lot of time and money to go back and fix it.
What has gotten better with Google Cloud Text-to-Speech is their stuff sounds so natural, it really brings a smile to my face. I wish their support would be better.
For how long have I used the solution?
I have been using this solution my whole life.
What was my experience with deployment of the solution?
Moving our stuff up to the Google Cloud and getting it to work just as well as it does on people's development machines is problematic. The support for that, even though we paid for it, isn't really very helpful.
On development machines and laptops, it works great. It works out of the box. That's terrific and one of the reasons we went with them. Up in the cloud though, we encounter error after error.
What do I think about the stability of the solution?
The product is the best of breed as far as we're concerned right now. When I don't have a problem, it works pretty well, better than anybody else. But when I do have a problem, I'm severely impacted. It takes a lot of time and money to go back and fix it.
What do I think about the scalability of the solution?
Moving our stuff up to the Google Cloud and getting it to work just as well as it does on people's development machines is problematic. The support for that, even though we paid for it, isn't really very helpful. Up in the cloud, we encounter error after error.
How are customer service and support?
We just got error messages and we couldn't get any useful responses from customer support. Their support is inadequate. The support is bad.
How would you rate customer service and support?
Which solution did I use previously and why did I switch?
We evaluated IBM and Nuance. I personally use AWS, but we don't use their cloud. We use the Google Cloud. We're not buying it from AWS, we're buying it directly from Google.
How was the initial setup?
The accuracy is the most important factor. It doesn't matter what you do with the utterance if you don't recognize it. These other products didn't recognize it as well as Google Cloud Text-to-Speech does.
What about the implementation team?
We haven't done any of that yet. We're following the research at MIT of Rosalind Picard about those kinds of issues. We plan to do something with it, we just haven't quite got there yet.
What was our ROI?
Our expectation was we didn't have any other choice. We can't really say that it's well-priced or badly priced. We just didn't have another choice as far as we were concerned. The product is the best of breed as far as we're concerned right now.
What's my experience with pricing, setup cost, and licensing?
Our experience is we didn't have any other choice. We can't really say that it's well-priced or badly priced. We just didn't have another choice as far as we were concerned.
Which other solutions did I evaluate?
IBM and Nuance were the products we evaluated.
What other advice do I have?
I'm not familiar with Speaker Diarization. Regarding the custom voice creation feature, we just use their voices, which are fine for our concerns.
We haven't used it with Google's ML. However it's coming out of the box to us, we've got enough problems understanding the meaning of the utterance. We don't want to spend money on that if Google can do it.
Part of our metrics involves call abandonment and some internal metrics we've developed about understanding what to do in a conversation with an utterance and how that plays with the users. Currently, it's disappointingly bad with complex conversations. Simple queries are easy, but real human conversations need a lot of work with AI support.
Their pricing is competitive, but what matters most is that it works. The other competitors don't work well enough for us to consider them. That's just the cost of doing business.
Overall rating: 8/10
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Google