Try our new research platform with insights from 80,000+ expert users

Cloudera Distribution for Hadoop vs ScyllaDB comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Jan 7, 2025

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Cloudera Distribution for H...
Ranking in NoSQL Databases
9th
Average Rating
8.0
Reviews Sentiment
6.3
Number of Reviews
51
Ranking in other categories
Hadoop (2nd)
ScyllaDB
Ranking in NoSQL Databases
3rd
Average Rating
7.8
Reviews Sentiment
7.0
Number of Reviews
12
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of January 2026, in the NoSQL Databases category, the mindshare of Cloudera Distribution for Hadoop is 3.3%, up from 2.1% compared to the previous year. The mindshare of ScyllaDB is 8.3%, down from 10.8% compared to the previous year. It is calculated based on PeerSpot user engagement data.
NoSQL Databases Market Share Distribution
ProductMarket Share (%)
ScyllaDB8.3%
Cloudera Distribution for Hadoop3.3%
Other88.4%
NoSQL Databases
 

Featured Reviews

Rok Dolinsek - PeerSpot reviewer
Manager, Bussines Development & Co Owner at Troia d.o.o.
Enables on-premise implementation with powerful data processing capabilities
This is the only solution that is possible to install on-premise. Cloudera provides a hybrid solution that combines compute on cloud or on-premises. It includes all machine learning algorithms in the Spark machine learning library. All functionalities needed for a big data platform and ETL are on the platform, eliminating the need for other tools. It is scalable, ready for vertical scaling, and very powerful, offering numerous functionalities and configurations for generative AI.
Manikandan Gunasekaran - PeerSpot reviewer
Director of Engineering at Ola
Reliable data management with great reliability and performance
From a sales pitch standpoint, it needs to deliver on promises of better ROI and compaction. Additionally, ticketing and support systems could be improved due to the time it takes to get answers. There's also an issue with compatibility when attempting to switch back from the enterprise to the community version.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"We had a data warehouse before all the data. We can process a lot more data structures."
"The product provides better data processing features than other tools."
"The data science aspect of the solution is valuable."
"Provides a viable open-source solution for enterprise implementations and reliable, intelligent data analysis."
"The features I find most valuable is that the solution is that it is easy to install and to work with. It starts with the installation and from there on the management is very simple and centralized."
"The product as a whole is good."
"The tool's most interesting features are the distributed file system and unstructured data processing capability. Because we have a lot of unstructured data, like XML and social media logs, these features make it more valuable than the usual data warehousing solutions."
"Cloudera provides a hybrid solution that combines compute on cloud or on-premises."
"The product's most valuable features are efficiency and reliability."
"The best features of ScyllaDB are how it synchronizes data and its failover system. There's a unique formula to decide the number of nodes you need and the minimum required, which I find helpful. It also offers encryption and supports APIs, making it great for distributed systems and scaling databases across different regions. While it's easy to use, having prior experience helps configure it properly. There are many configurations; if you don't understand them, you might mess up the design. So, understanding your system's needs, like whether it requires more read or write operations, is crucial for setting up the correct configuration."
"ScyllaDB is fast and reliable. It has good performance."
"The database is easy to use, fast, and accessible for applications because the API is straightforward."
"It is lightweight, and it requires less infrastructure."
"ScyllaDB allows fine-tuning of the table structure. Speed is probably the most critical factor because we perform a lot of heavy data ingestion. One of its core features is its ability to handle high volumes and maintain speed when accessing data. Additionally, high availability and partitioning are built-in features of ScyllaDB."
"The performance and scalability are good, and we hardly see any major issues with ScyllaDB."
"The performance aspects of Scylla are good, as always... A good point about Scylla is that it can be used extensively."
 

Cons

"There are multiple bugs when we update."
"Without the big data environment, we cannot store all of this data live. We have billions of records and terabytes of storage to be used. It's not an option actually for us to have a big data environment."
"The tool doesn't support reporting, and relational databases are still the major source of reporting data. Apache Iceberg will be launched soon within the Cloudera cluster for analytical purposes. The Cloudera Machine Learning aspect could be tuned and enhanced to enable us to host some predictive analytics machine learning and AI use cases."
"The Cloudera training has deteriorated significantly."
"The governance aspect of the solution should be improved."
"It would be useful if Cloudera had more tools like SQL Engines that offer the traditional relational database. We have to do a lot of work preparing the data outside Cloudera before getting it into the platform."
"The user infrastructure and user interface needs to be improved, as well as the performance. The GUI needs to be better."
"Currently, we are using many other tools such as Spark and Blade Job to improve the performance."
"ScyllaDB needs to improve its handling of transactions."
"Some of the regular commands in NoSQL do not work."
"The documentation of Scylla is an area with shortcomings and needs to be improved."
"It seems we have better options available. So probably don't go for ScyllaDB. The reason is, first, it's very high. It's not as straightforward as, like, Postgres or ClickHouse to set up. It requires a complex setup."
"The documentation is not well established for new developers."
"From a sales pitch standpoint, it needs to deliver on promises of better ROI and compaction."
"The product needs to add more features and improve the response time of the support team."
"Data export, along with how we can purchase the data periodically, needs to be improved so that the storage is within control. Then, we could optimize it even better."
 

Pricing and Cost Advice

"The tool is not expensive."
"I believe we pay for a three-year license."
"The price is very high. The solution is expensive."
"It is an expensive product."
"Cloudera requires a license to use."
"When comparing with Oracle Sybase and SQL, it's cheaper. It's not expensive."
"The price could be better for the product."
"The solution is fairly expensive."
"I believe that there is a yearly licensing cost and that it's expensive."
"The paid version of ScyllaDB is not that expensive. The main advantage of the paid version is direct support from the ScyllaDB team, which can resolve issues faster—typically within a day, compared to two to three days with the free version. The paid version also offers better guidance and support, while the free version has good documentation and is more high-level. I’d rate their support team nine out of ten because of the quick responses from their community."
"It's free."
"It is an expensive tool compared to its competitor."
"It's a bit expensive."
report
Use our free recommendation engine to learn which NoSQL Databases solutions are best for your needs.
881,082 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
23%
Computer Software Company
9%
Healthcare Company
7%
Comms Service Provider
6%
Financial Services Firm
11%
Comms Service Provider
10%
Computer Software Company
10%
University
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business16
Midsize Enterprise9
Large Enterprise31
By reviewers
Company SizeCount
Small Business3
Midsize Enterprise2
Large Enterprise8
 

Questions from the Community

What do you like most about Cloudera Distribution for Hadoop?
The tool can be deployed using different container technologies, which makes it very scalable.
What is your experience regarding pricing and costs for Cloudera Distribution for Hadoop?
The price for Cloudera is average, yet it is very good compared to other solutions. It can be deployed on-premises, unlike competitors' cloud-only solutions.
What needs improvement with Cloudera Distribution for Hadoop?
If they could support modifying the data more easily than the current implementation, it would be beneficial.
What is your experience regarding pricing and costs for Scylla?
The enterprise version comes with a cost of about $300,000 per year, however, we did not experience the promised compaction benefits.
What needs improvement with Scylla?
From a sales pitch standpoint, it needs to deliver on promises of better ROI and compaction. Additionally, ticketing and support systems could be improved due to the time it takes to get answers. T...
What is your primary use case for Scylla?
We dump a lot of our data, such as every entry created with respect to when a user rides a scooter, every record gets updated to ScyllaDB. It is used as a single source of truth and it manages mass...
 

Overview

 

Sample Customers

37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
IBM, Investing.com, mParticle, Comcast, GE, Fanatics, Ola, CERN, adgear, Samsung
Find out what your peers are saying about Cloudera Distribution for Hadoop vs. ScyllaDB and other solutions. Updated: December 2025.
881,082 professionals have used our research since 2012.