Try our new research platform with insights from 80,000+ expert users
Umar Ijaz - PeerSpot reviewer
Back End Developer at AskHumans
Real User
Top 5Leaderboard
Jun 28, 2024
Handles large data, good documentation is available and powerful model
Pros and Cons
  • "Deepgram is able to handle large volumes of audio data without compromising accuracy."
  • "The area of live transcription could be improved. Sometimes, Deepgram's WebSocket is disposed due to redundancy."

What is our primary use case?

I use Deepgram for audio transcriptions and speech recognition. I am working on a feedback survey app where users provide verbal feedback that Deepgram transcribes into text. 

We receive the results and implement features like punctuation and Smart Format.

How has it helped my organization?

Deepgram has significantly improved our transcription process in terms of speed and accuracy. It has allowed us to efficiently convert verbal feedback into text, enabling quicker analysis and implementation of new features. 

Integrating Deepgram has streamlined our workflow, enhancing productivity and delivering more accurate transcription results.

What is most valuable?

We previously used IBM Watson, which was slow and had limitations in accurately transcribing words. After evaluating OpenAI's Whisper model, we discovered Deepgram, which incorporates Whisper and adds the powerful Nova model.

Deepgram's latency is impressively low, around 0.5 to 1 second, making it a superior choice.

What needs improvement?

Live transcription could be improved. Sometimes, Deepgram's WebSocket is disposed of due to redundancy issues. Enhanced stability in live transcription would be beneficial.

Buyer's Guide
Deepgram
March 2026
Learn what your peers think about Deepgram. Get advice and tips from experienced pros sharing their opinions. Updated: March 2026.
884,873 professionals have used our research since 2012.

For how long have I used the solution?

I have been using Deepgram for one and a half years.

What do I think about the stability of the solution?

Initially, we encountered some stability issues, but Deepgram has since improved its architecture. With the addition of hooks for status updates, the accuracy has improved to approximately 90 to 95%, which is better than other models we've tested.

What do I think about the scalability of the solution?

It's scalable. Our platform handles 50 to 60 users simultaneously without compromising accuracy. For instance, a 20-minute audio file was transcribed within a second, demonstrating its ability to handle large volumes of audio data effectively.

How are customer service and support?

My experience with customer service and support has been positive. They are responsive and helpful, and they provide timely resolutions to any issues.

Which solution did I use previously and why did I switch?

We previously used IBM Watson, but it didn't deliver appropriate results. We searched for alternatives and found OpenAI's Whisper model, which was initially slow. After thorough analysis, we discovered Deepgram. It proved to be superior, leading to our decision to migrate. We used a detailed spreadsheet to compare various models before making the switch.

How was the initial setup?

Thanks to clear documentation, the initial setup was very easy. If you have prerequisite knowledge of the programming language you're using, it’s straightforward to follow the documentation and implement it into your system. When I started, I closely followed the documentation, which made the process very manageable.

Deployment model: We last deployed it on the Google Cloud Platform (GCP).

What about the implementation team?

The implementation was done in-house.

What was our ROI?

Our ROI has increased due to enhanced transcription accuracy and speed, leading to more efficient workflows and better user satisfaction.

What's my experience with pricing, setup cost, and licensing?

The pricing is moderate. While live transcription may incur some charges when the connection is open, they become minimal over time. So, it's a balanced option—neither cheap nor overly expensive.

Which other solutions did I evaluate?

Yes, besides IBM Watson, we evaluated OpenAI's Whisper model.

What other advice do I have?

Deepgram is highly recommended. Users don’t need to do anything special before using it, as the documentation is comprehensive. I am a Node.js developer and have used Deepgram packages for Node.js. Understanding your programming language is key, whether it's Node.js, Python, or others.

AI Features:

I have integrated various AI models into our application. Deepgram's sentiment analysis feature allows us to create graphs and analyses to determine if words are positive, negative, or neutral. This helps us summarize feedback and derive actionable insights.

My ratings:

I would rate it an eight out of ten. The live transcription feature needs improvement as the WebSocket sometimes gives errors or breaks down during live streams.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Google
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Full Stack Developer at Global IT App Info Solution
Real User
Top 10Leaderboard
Jun 6, 2024
Used to transcribe videos, but does not properly identify the number of speakers
Pros and Cons
  • "The speed of the solution for transcribing videos is good."
  • "The solution does not properly identify the number of speakers."

What is our primary use case?

I run Deepgram on my local system to transcribe videos.

What is most valuable?

The speed of the solution for transcribing videos is good.

What needs improvement?

I need to transcribe my videos to text chat, but there are some issues when I run Deepgram. The solution does not properly identify the number of speakers. For example, Deepgram only identifies two speakers out of three or four speakers in some videos.

The solution also makes some spelling and English grammar mistakes. Deepgram does not properly identify some specific words in a sentence.

For how long have I used the solution?

I have been using Deepgram for one to two months.

What do I think about the stability of the solution?

We haven't faced any breakdowns or bugs with the solution.

What do I think about the scalability of the solution?

My team consists of two members who use Deepgram.

How are customer service and support?

The solution’s technical support is average. I talked to the technical support team regarding an issue where the solution couldn't identify the exact number of speakers. The support team asked me to use certain parameters, but the results were inaccurate. I used all the parameters suggested by the support team, but the speakers were still not identified clearly. However, other services could properly identify the speakers of the videos.

What's my experience with pricing, setup cost, and licensing?

The solution’s pricing is cheap.

What other advice do I have?

I chose to use Deepgram after researching it on Google and finding some good feedback that the solution had good APIs. It's easy for a new user to learn to use the solution.

I would not recommend Deepgram to other users because it does not properly identify video communication. If you compare it with the other APIs, you can easily find that they do not properly identify some words, exclamatory signs, full stops, etc. These are small mistakes the tool is not properly identifying. Also, the solution does not properly identify the speakers. Users should check other APIs before choosing Deepgram.

Overall, I rate the solution a six out of ten.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Buyer's Guide
Deepgram
March 2026
Learn what your peers think about Deepgram. Get advice and tips from experienced pros sharing their opinions. Updated: March 2026.
884,873 professionals have used our research since 2012.
Arslan Rasheed - PeerSpot reviewer
Full Stack Developer at Pluginfy Technologies
Real User
Top 5Leaderboard
Jun 30, 2024
Used for TTS (Text-to-Speech) and STT (Speech-to-Text) purposes
Pros and Cons
  • "The solution's Speech-to-Text conversion feature is really awesome."
  • "Deepgram is currently restricted to only the English variants, but it should include other languages, such as German or French."

What is our primary use case?

We use the solution for TTS (Text-to-Speech) and STT (Speech-to-Text) purposes.

What is most valuable?

The solution's Speech-to-Text conversion feature is really awesome.

What needs improvement?

Deepgram is currently restricted to only the English variants, but it should include other languages, such as German or French.

For how long have I used the solution?

I have been using Deepgram for five to six months.

What do I think about the stability of the solution?

Deepgram is a stable solution.

What do I think about the scalability of the solution?

The Deepgram cloud can handle large volumes of audio data. Around three to four people use the solution in our organization.

How was the initial setup?

The solution’s initial setup is easy.

What's my experience with pricing, setup cost, and licensing?

Deepgram is a cheap solution. We can create an account for $200, which we can initially use for the Deepgram services.

What other advice do I have?

I have used Deepgram with Twilio for the calling system. I would recommend Deepgram to users who want to use it for speech-to-text purposes.

Overall, I rate the solution an eight out of ten.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Buyer's Guide
Download our free Deepgram Report and get advice and tips from experienced pros sharing their opinions.
Updated: March 2026
Buyer's Guide
Download our free Deepgram Report and get advice and tips from experienced pros sharing their opinions.