Try our new research platform with insights from 80,000+ expert users

Cloudera Data Science Workbench vs Dremio comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Nov 9, 2025

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Cloudera Data Science Workb...
Ranking in Data Science Platforms
22nd
Average Rating
7.0
Reviews Sentiment
6.9
Number of Reviews
2
Ranking in other categories
No ranking in other categories
Dremio
Ranking in Data Science Platforms
11th
Average Rating
8.4
Reviews Sentiment
6.6
Number of Reviews
11
Ranking in other categories
Cloud Data Warehouse (5th)
 

Mindshare comparison

As of January 2026, in the Data Science Platforms category, the mindshare of Cloudera Data Science Workbench is 1.6%, up from 1.4% compared to the previous year. The mindshare of Dremio is 2.3%, down from 4.2% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Science Platforms Market Share Distribution
ProductMarket Share (%)
Dremio2.3%
Cloudera Data Science Workbench1.6%
Other96.1%
Data Science Platforms
 

Featured Reviews

Ismail Peer - PeerSpot reviewer
Program Management Lead Advisor at Unionbank Philippines
Useful for data science modeling but improvement is needed in MLOps and pricing
If you don't configure CDSW well, then it might be not useful for you. Deploying the tool can vary in complexity, but most of the time, it's relatively simple and straightforward. Triggering a job from data to production is easy, as the platform automates the deployment process. However, ensuring optimal resource allocation is essential for smooth operations.
Corrr Moray - PeerSpot reviewer
SR BI developer at BRQ Digital Solutions
Has simplified complex data integration workflows and supported consistent reporting across multiple sources
We also have a close relationship with the team that does the Dremio maintenance for the database, like upgrading the versions and they know about some specific problems we had in the past, such as a memory leak. We had a memory leak on some versions, which sometimes stopped the service. Since we are using Dremio installed like a server, not a SaaS solution, many times we need to stop and restart the service to clear all the cache and all that, and this is the thing I should add. I see that many times the new versions of Dremio have not fixed old bugs, and in some new versions, old problems that were previously fixed come back again, so I think the upgrade part could use improvement. I remember using some features in the past, like pivot tables, which proved to be really difficult, but I know this is a fault also for other vendors. Pivoting, transposing, and unpivoting are often not so good. CTEs also many times prove to be not so good, so I think these two main items could be improved significantly if they standardize them.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The Cloudera Data Science Workbench is customizable and easy to use."
"I appreciate CDSW's ability to logically segregate environments, such as data, DR, and production, ensuring they don't interfere with each other. The deployment of machine learning is fast and easy to manage. Its API calls are also fast."
"Overall, you can rate it as eight out of ten."
"Dremio gives you the ability to create services which do not require additional resources and sterilization."
"Dremio allows querying the files I have on my block storage or object storage."
"Dremio has positively impacted my organization by helping us create a single source of truth, a singular data warehouse where we can have access to all of the data sets."
"The first feature that stands out for me in Dremio is the federated type of query, which allows the possibility to use multiple endpoints without worrying about writing custom SQL that runs only for SQL Server or for Postgres and Redshift."
"We primarily use Dremio to create a data framework and a data queue."
"The most valuable feature of Dremio is it can sit on top of any other data storage, such as Amazon S3, Azure Data Factory, SGFS, or Hive. The memory competition is good. If you are running any kind of materialized view, you'd be running in memory."
"Dremio enables you to manage changes more effectively than any other data warehouse platform. There are two things that come into play. One is data lineage. If you are looking at data in Dremio, you may want to know the source and what happened to it along the way or how it may have been transformed in the data pipeline to get to the point where you're consuming it."
 

Cons

"Running this solution requires a minimum of 12GB to 16GB of RAM."
"The tool's MLOps is not good. It's pricing also needs to improve."
"We had a memory leak on some versions, which sometimes stopped the service."
"Dremio takes a long time to execute large queries or the executing of correlated queries or nested queries. Additionally, the solution could improve if we could read data from the streaming pipelines or if it allowed us to create the ETL pipeline directly on top of it, similar to Snowflake."
"I cannot use the recursive common table expression (CTE) in Dremio because the support page says it's currently unsupported."
"Dremio doesn't support the Delta connector. Dremio writes the IT support for Delta, but the support isn't great. There is definitely room for improvement."
"They need to have multiple connectors. Starburst is rich in connectors, however, they are lacking Salesforce connectivity as of today."
"We've faced a challenge with integrating Dremio and Databricks, specifically regarding authentication. It is not shaking hands very easily."
"It shows errors sometimes."
"They have an automated tool for building SQL queries, so you don't need to know SQL. That interface works, but it could be more efficient in terms of the SQL generated from those things. It's going through some growing pains. There is so much value in tools like these for people with no SQL experience. Over time, Dermio will make these capabilities more accessible to users who aren't database people."
 

Pricing and Cost Advice

"The product is expensive."
"Right now the cluster costs approximately $200,000 per month and is based on the volume of data we have."
"Dremio is less costly competitively to Snowflake or any other tool."
report
Use our free recommendation engine to learn which Data Science Platforms solutions are best for your needs.
881,082 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
36%
Manufacturing Company
9%
Healthcare Company
7%
Computer Software Company
5%
Financial Services Firm
28%
Computer Software Company
9%
Manufacturing Company
6%
Healthcare Company
5%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
By reviewers
Company SizeCount
Small Business1
Midsize Enterprise5
Large Enterprise5
 

Questions from the Community

What do you like most about Cloudera Data Science Workbench?
I appreciate CDSW's ability to logically segregate environments, such as data, DR, and production, ensuring they don't interfere with each other. The deployment of machine learning is fast and easy...
What needs improvement with Cloudera Data Science Workbench?
The tool's MLOps is not good. It's pricing also needs to improve.
What is your primary use case for Cloudera Data Science Workbench?
We have different use cases. Our banking use case uses machine learning to identify customer life events and recommend the best-suited card products. These machine-learning models are deployed in o...
What do you like most about Dremio?
Dremio allows querying the files I have on my block storage or object storage.
What is your experience regarding pricing and costs for Dremio?
I don't have information about pricing, setup cost, and licensing for Dremio, so I am not entitled to discuss it.
What needs improvement with Dremio?
I wouldn't say there is anything Dremio can be improved on. If I could change something, I would say many developers and programmers, when they are starting to work in this specific field or area, ...
 

Also Known As

CDSW
Dremio AWS - BYOL
 

Overview

 

Sample Customers

IQVIA, Rush University Medical Center, Western Union
UBS, TransUnion, Quantium, Daimler, OVH
Find out what your peers are saying about Cloudera Data Science Workbench vs. Dremio and other solutions. Updated: December 2025.
881,082 professionals have used our research since 2012.