No more typing reviews! Try our Samantha, our new voice AI agent.

Cloudera Distribution for Hadoop vs IBM InfoSphere BigInsights [EOL] comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Cloudera Distribution for H...
Average Rating
8.0
Reviews Sentiment
6.3
Number of Reviews
51
Ranking in other categories
Hadoop (2nd), NoSQL Databases (10th)
IBM InfoSphere BigInsights ...
Average Rating
7.6
Number of Reviews
7
Ranking in other categories
No ranking in other categories
 

Featured Reviews

SA
Head of Advaced Analytics & Intelligence; AGM at Alinma Bank
Integration of multiple features supports data analytics and processing
Cloudera Distribution for Hadoop provides numerous features and capabilities combined into one platform.The solution offers power processing and supports different file systems and query engines. It provides parallel processing for handling many requests. The platform includes role-based access control in Cloudera Distribution for Hadoop. It secures the data itself and provides users with different roles and privileges.
it_user743022 - PeerSpot reviewer
BigData Consultant at a tech services company with 10,001+ employees
Served our customers better by giving real-time suggestions and proactive maintenance, however the UI was not interactive
* The UI was not interactive: Responses used to be very slow and hang up at times. * The UI was not really helping to track the real-time jobs and its logs. * You can bring in a better UI for job management and health checks. * Developer API documentation needs improvement.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Cloudera is doing a great job in the field offering an enterprise ready data platform."
"Very solid. Excellent user experience. good documentation."
"I am very comfortable with this product."
"We used it to build an enterprise data hub."
"The file system is a valuable feature."
"Cloudera Manager is the most valuable feature for its ease of use, features, ease of upgrade and install components."
"The most valuable feature is Impala, the querying engine, which is very fast."
"Cloudera really has no competition."
"It integrates with JSqsh, enabling us to submit long-running exports from the shell."
"It gives us the option of extending our analytics system."
"InfoSphere Streams was the one core product from the platform in which we were using. We were building a real-time response system and we built it on InfoSphere Streams."
"This helped us to serve our customers better by giving real-time suggestions and proactive maintenance."
"Definitely a product worth evaluating, esp if you are an IBM shop and if done on Bluemix, it gives a jump start on protoypes/POCs."
"Watson is the perfect engine for text analysis for us, but in 2014 it doesn’t support the Russian language."
"The thing that I have found most valuable in this solution is the BIQSQL implementation which is fully SQL ANSI compliant."
"This is a very helpful product, with continuous improvements by IBM and a great customer service which enables easy access to valuable information for both Hadoop developers and system administrators."
 

Cons

"Flume seems unstable and has to be restarted quite often."
"The one thing that we struggled with predominately was support. Because it was relatively new, support was always a big issue and I think it's still a bit of an ongoing concern with the team currently managing it."
"The user infrastructure and user interface needs to be improved, as well as the performance. The GUI needs to be better."
"Cloudera 5 is currently very unstable. Between two Cloudera 5 clusters, we have an incident at least twice a week due to what are now outstanding bugs."
"On same ground I didn't see much training materials from Cloudera."
"The performance of some analytics engines provided by Cloudera is not that good."
"The Cloudera training is terrible."
"Without the big data environment, we cannot store all of this data live. We have billions of records and terabytes of storage to be used. It's not an option actually for us to have a big data environment."
"I encountered issues with having the appropriate documentation resources, as well as getting the right stability when explored virtualized environments based on Virtualbox and HyperV software."
"I'd like to see faster execution time, especially for simple queries that don't touch on many rows and don't involve many operations (Joins, Unions, Groupbys)."
"The UI was not interactive: Responses used to be very slow and hang up at times."
"For our business customer pricing is very important motivation, so I can advise change licensing policy from “by volume in the cluster” to “number of machines in the cluster”."
"I have found a lot of issues in Fluid Query and BigInsights Applications to move data in the enterprise version."
"Unfortunately the stability of the platform was an issue."
"Initial setup is rather complex in comparison with Cloudera."
 

Pricing and Cost Advice

"The product’s price depends from project to project."
"I wouldn't recommend CDH to others because of its high cost."
"Cloudera Distribution for Hadoop is expensive, with support costs involved."
"When comparing with Oracle Sybase and SQL, it's cheaper. It's not expensive."
"The pricing must be improved."
"It is an expensive product."
"The solution is fairly expensive."
"I believe we pay for a three-year license."
Information not available
report
Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
900,644 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
23%
Construction Company
10%
Marketing Services Firm
8%
Manufacturing Company
6%
No data available
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business16
Midsize Enterprise9
Large Enterprise32
By reviewers
Company SizeCount
Small Business3
Large Enterprise4
 

Questions from the Community

What is your experience regarding pricing and costs for Cloudera Distribution for Hadoop?
The price for Cloudera is average, yet it is very good compared to other solutions. It can be deployed on-premises, unlike competitors' cloud-only solutions.
What needs improvement with Cloudera Distribution for Hadoop?
If they could support modifying the data more easily than the current implementation, it would be beneficial.
What is your primary use case for Cloudera Distribution for Hadoop?
We use Cloudera Distribution for Hadoop for many use cases including analytics, storing huge data sets, and various data processing tasks.
Ask a question
Earn 20 points
 

Also Known As

No data available
InfoSphere BigInsights
 

Overview

 

Sample Customers

37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
Coherent Path Inc., Optibus, Delhaize America, Diyotta Inc., Ernst & Young, Teikoku Databank Ltd., NCSU, Vestas
Find out what your peers are saying about Apache, Cloudera, Amazon Web Services (AWS) and others in Hadoop. Updated: May 2026.
900,644 professionals have used our research since 2012.