Try our new research platform with insights from 80,000+ expert users

Cloudera Distribution for Hadoop vs SingleStore comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Cloudera Distribution for H...
Average Rating
8.0
Reviews Sentiment
6.3
Number of Reviews
51
Ranking in other categories
Hadoop (2nd), NoSQL Databases (10th)
SingleStore
Average Rating
8.8
Reviews Sentiment
7.4
Number of Reviews
6
Ranking in other categories
Database as a Service (DBaaS) (17th), Vector Databases (17th)
 

Mindshare comparison

While both are Databases solutions, they serve different purposes. Cloudera Distribution for Hadoop is designed for Hadoop and holds a mindshare of 14.0%, down 27.4% compared to last year.
SingleStore, on the other hand, focuses on Database as a Service (DBaaS), holds 2.9% mindshare, up 1.6% since last year.
Hadoop Market Share Distribution
ProductMarket Share (%)
Cloudera Distribution for Hadoop14.0%
HPE Data Fabric14.3%
Apache Spark13.4%
Other58.3%
Hadoop
Database as a Service (DBaaS) Market Share Distribution
ProductMarket Share (%)
SingleStore2.9%
Amazon RDS13.5%
MongoDB Atlas12.3%
Other71.3%
Database as a Service (DBaaS)
 

Featured Reviews

Rok Dolinsek - PeerSpot reviewer
Manager, Bussines Development & Co Owner at Troia d.o.o.
Enables on-premise implementation with powerful data processing capabilities
This is the only solution that is possible to install on-premise. Cloudera provides a hybrid solution that combines compute on cloud or on-premises. It includes all machine learning algorithms in the Spark machine learning library. All functionalities needed for a big data platform and ETL are on the platform, eliminating the need for other tools. It is scalable, ready for vertical scaling, and very powerful, offering numerous functionalities and configurations for generative AI.
VK
Solution Architect at Wipro Limited
An excellent choice for diverse data processing needs with exceptional in-memory capabilities, robust failover mechanisms, easy scalability and high performance
Scalability is its key strength. Adding servers for scalability is a straightforward process involving simply incorporating a few additional servers and recycling the cluster triggers automatic repartitioning and redistribution of data. For instance, if the initial database creation involved a hundred servers and later, four more servers are added, specific commands can be executed to increase the partitions to one hundred twenty. The data is then efficiently redistributed across the expanded partitions without the need for manual data movement, ensuring a seamless and efficient scalability process. In my current organization, approximately three projects involve the usage of SingleStore, with a team size ranging from ten to twenty individuals.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"With a cluster available, you can manage the security layer using the shared SDX - it provides flexibility."
"We experienced many issues when we started working with Hadoop 3.0 in the Cloudera 6.0 version, so there are a lot of things that need to improve. I believe they are working on that."
"This is the only solution that is possible to install on-premise."
"The solution is stable."
"The most valuable feature is Impala, the querying engine, which is very fast."
"In terms of scalability, if you have enough hardware you can scale out. Scalability doesn't have any issues."
"The data science aspect of the solution is valuable."
"Cloudera Distribution for Hadoop provides numerous features and capabilities combined into one platform, offers power processing, supports different file systems and query engines, and provides parallel processing for handling many requests."
"The ability to store data in memory is a standout feature, enhanced by robust failover mechanisms."
"The most valuable feature is the ability to create pipelines, streamline and extract data from the pipelines."
"The paramount advantage is the exceptional speed."
"It's a distributed relational database, so it does not have a single server, it has multiple servers. Its architecture itself is fast because it has multiple nodes to distribute the workload and process large amounts of data."
"The product can automatically reinstall and reconfigure in case of a shutdown."
"MemSQL supports the MySQL protocol, and many functions are similar, so the learning curve is very short."
 

Cons

"Cloudera's support is extremely bad and cannot be relied on."
"There are better solutions out there that have more features than this one."
"The initial setup of Cloudera is difficult."
"There are multiple bugs when we update."
"Without the big data environment, we cannot store all of this data live. We have billions of records and terabytes of storage to be used. It's not an option actually for us to have a big data environment."
"We experienced many issues when we started working with Hadoop 3.0 in the Cloudera 6.0 version, so there is a lot of things that need to improve."
"Currently, we are using many other tools such as Spark and Blade Job to improve the performance."
"The price of this solution could be lowered."
"Poor key distribution can significantly impact performance, requiring a backward approach in design rather than adding tables incrementally."
"For new customers, it's very tough to start. Their documentation isn't organized, and there's no online training available. SingleStore is working on it, but that's a major drawback."
"Having the ability to migrate servers using a single command would be extremely beneficial."
"There should be more pipelines available because I think that if MemSQL can connect to other services, that would be great."
"It is not the optimal choice for direct data collection through queries, and it's more suited for aggregation tasks."
"We don't get good discounts in Pakistan."
 

Pricing and Cost Advice

"It is an expensive product."
"The tool is not expensive."
"I believe we pay for a three-year license."
"I haven't bought a license for this solution. I'm only using the Apache license version."
"I wouldn't recommend CDH to others because of its high cost."
"The product’s price depends from project to project."
"The solution is fairly expensive."
"The solution is expensive."
"I would advise users to try the free 128GB version."
"Using it for analytical purposes can be cost-effective in the long run, especially in terms of infrastructure."
"They have two main options: cloud installation and bare-metal installation, each with different pricing models."
"SingleStore is a bit expensive."
"The product's licensing is not expensive. It is comparable."
report
Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
881,733 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
21%
Marketing Services Firm
9%
Computer Software Company
8%
Comms Service Provider
6%
Financial Services Firm
30%
Computer Software Company
10%
Comms Service Provider
8%
Retailer
5%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business16
Midsize Enterprise9
Large Enterprise31
By reviewers
Company SizeCount
Small Business4
Large Enterprise3
 

Questions from the Community

What do you like most about Cloudera Distribution for Hadoop?
The tool can be deployed using different container technologies, which makes it very scalable.
What is your experience regarding pricing and costs for Cloudera Distribution for Hadoop?
The price for Cloudera is average, yet it is very good compared to other solutions. It can be deployed on-premises, unlike competitors' cloud-only solutions.
What needs improvement with Cloudera Distribution for Hadoop?
If they could support modifying the data more easily than the current implementation, it would be beneficial.
Ask a question
Earn 20 points
 

Overview

 

Sample Customers

37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
400+ customers including: 6sense, Adobe, Akamai, Ant Money, Arcules, CARFAX, Cigna, Cisco, Comcast, DELL, DBS Bank, Dentsu, DirectlyApply, EY, Factors.AI, Fathom Analytics, FirstEnergy, GE, Goldman Sachs, Heap, Hulu, IMAX, impact.com, Kroger, LG, LiveRamp, Lumana, Nvidia, OpenDialog, Outreach, Palo Alto Networks, PicPay, RBC, Samsung, SegMetrics, Siemens, SiteImprove, SiriusXM, SK Telecom, SKAI, SONY, STC, SunRun, TATA, Thorn, ZoomInfo.
Find out what your peers are saying about Apache, Cloudera, Amazon Web Services (AWS) and others in Hadoop. Updated: January 2026.
881,733 professionals have used our research since 2012.