No more typing reviews! Try our Samantha, our new voice AI agent.

Cloudera Distribution for Hadoop vs IBM InfoSphere BigInsights [EOL] comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Cloudera Distribution for H...
Average Rating
8.0
Reviews Sentiment
6.3
Number of Reviews
51
Ranking in other categories
Hadoop (2nd), NoSQL Databases (10th)
IBM InfoSphere BigInsights ...
Average Rating
7.6
Number of Reviews
7
Ranking in other categories
No ranking in other categories
 

Featured Reviews

SA
Head of Advaced Analytics & Intelligence; AGM at Alinma Bank
Integration of multiple features supports data analytics and processing
Cloudera Distribution for Hadoop provides numerous features and capabilities combined into one platform.The solution offers power processing and supports different file systems and query engines. It provides parallel processing for handling many requests. The platform includes role-based access control in Cloudera Distribution for Hadoop. It secures the data itself and provides users with different roles and privileges.
it_user743022 - PeerSpot reviewer
BigData Consultant at a tech services company with 10,001+ employees
Served our customers better by giving real-time suggestions and proactive maintenance, however the UI was not interactive
* The UI was not interactive: Responses used to be very slow and hang up at times. * The UI was not really helping to track the real-time jobs and its logs. * You can bring in a better UI for job management and health checks. * Developer API documentation needs improvement.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Cloudera is a very manageable solution with good support."
"The most valuable feature is that I can use CDH for almost all use cases across all industries, including the financial sector, public sector, private retailers, and so on."
"The pricing is very competitive, it's not bad."
"It made Hadoop easy to use and made it easy to get started."
"We experienced many issues when we started working with Hadoop 3.0 in the Cloudera 6.0 version, so there are a lot of things that need to improve. I believe they are working on that."
"The solution is stable."
"Cloudera, as a whole, is designed to provide organizations with solutions for big data."
"The solution's most valuable feature is the enterprise data platform."
"It integrates with JSqsh, enabling us to submit long-running exports from the shell."
"It gives us the option of extending our analytics system."
"This helped us to serve our customers better by giving real-time suggestions and proactive maintenance."
"Definitely a product worth evaluating, esp if you are an IBM shop and if done on Bluemix, it gives a jump start on protoypes/POCs."
"Watson is the perfect engine for text analysis for us, but in 2014 it doesn’t support the Russian language."
"InfoSphere Streams was the one core product from the platform in which we were using. We were building a real-time response system and we built it on InfoSphere Streams."
"This is a very helpful product, with continuous improvements by IBM and a great customer service which enables easy access to valuable information for both Hadoop developers and system administrators."
"The thing that I have found most valuable in this solution is the BIQSQL implementation which is fully SQL ANSI compliant."
 

Cons

"Cloudera 5 is currently very unstable. Between two Cloudera 5 clusters, we have an incident at least twice a week due to what are now outstanding bugs."
"The tool doesn't support reporting, and relational databases are still the major source of reporting data. Apache Iceberg will be launched soon within the Cloudera cluster for analytical purposes. The Cloudera Machine Learning aspect could be tuned and enhanced to enable us to host some predictive analytics machine learning and AI use cases."
"We found some difficulties when importing Hive tables from another cluster."
"While the deployed product is generally functional, there are instances where it presents difficulties."
"There are multiple bugs when we update."
"I would like to see an improvement in how the solution helps me to handle the whole cluster."
"The licensing was by node."
"The performance of some analytics engines provided by Cloudera is not that good."
"The UI was not interactive: Responses used to be very slow and hang up at times."
"For our business customer pricing is very important motivation, so I can advise change licensing policy from “by volume in the cluster” to “number of machines in the cluster”."
"I encountered issues with having the appropriate documentation resources, as well as getting the right stability when explored virtualized environments based on Virtualbox and HyperV software."
"I have found a lot of issues in Fluid Query and BigInsights Applications to move data in the enterprise version."
"Initial setup is rather complex in comparison with Cloudera."
"I'd like to see faster execution time, especially for simple queries that don't touch on many rows and don't involve many operations (Joins, Unions, Groupbys)."
"Unfortunately the stability of the platform was an issue."
 

Pricing and Cost Advice

"I haven't bought a license for this solution. I'm only using the Apache license version."
"The tool is not expensive."
"Cloudera requires a license to use."
"The tool is expensive...For the SMB market or customers whose environments are not that complex and do not have multiple systems running, Cloudera might not be a good option."
"I wouldn't recommend CDH to others because of its high cost."
"I believe we pay for a three-year license."
"The price is very high. The solution is expensive."
"The price could be better for the product."
Information not available
report
Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
900,644 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
23%
Construction Company
10%
Marketing Services Firm
8%
Manufacturing Company
6%
No data available
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business16
Midsize Enterprise9
Large Enterprise32
By reviewers
Company SizeCount
Small Business3
Large Enterprise4
 

Questions from the Community

What is your experience regarding pricing and costs for Cloudera Distribution for Hadoop?
The price for Cloudera is average, yet it is very good compared to other solutions. It can be deployed on-premises, unlike competitors' cloud-only solutions.
What needs improvement with Cloudera Distribution for Hadoop?
If they could support modifying the data more easily than the current implementation, it would be beneficial.
What is your primary use case for Cloudera Distribution for Hadoop?
We use Cloudera Distribution for Hadoop for many use cases including analytics, storing huge data sets, and various data processing tasks.
Ask a question
Earn 20 points
 

Also Known As

No data available
InfoSphere BigInsights
 

Overview

 

Sample Customers

37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
Coherent Path Inc., Optibus, Delhaize America, Diyotta Inc., Ernst & Young, Teikoku Databank Ltd., NCSU, Vestas
Find out what your peers are saying about Apache, Cloudera, Amazon Web Services (AWS) and others in Hadoop. Updated: May 2026.
900,644 professionals have used our research since 2012.