No more typing reviews! Try our Samantha, our new voice AI agent.

Cloudera Distribution for Hadoop vs IBM InfoSphere BigInsights [EOL] comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Cloudera Distribution for H...
Average Rating
8.0
Reviews Sentiment
6.3
Number of Reviews
51
Ranking in other categories
Hadoop (2nd), NoSQL Databases (10th)
IBM InfoSphere BigInsights ...
Average Rating
7.6
Number of Reviews
7
Ranking in other categories
No ranking in other categories
 

Featured Reviews

SA
Head of Advaced Analytics & Intelligence; AGM at Alinma Bank
Integration of multiple features supports data analytics and processing
Cloudera Distribution for Hadoop provides numerous features and capabilities combined into one platform.The solution offers power processing and supports different file systems and query engines. It provides parallel processing for handling many requests. The platform includes role-based access control in Cloudera Distribution for Hadoop. It secures the data itself and provides users with different roles and privileges.
it_user743022 - PeerSpot reviewer
BigData Consultant at a tech services company with 10,001+ employees
Served our customers better by giving real-time suggestions and proactive maintenance, however the UI was not interactive
* The UI was not interactive: Responses used to be very slow and hang up at times. * The UI was not really helping to track the real-time jobs and its logs. * You can bring in a better UI for job management and health checks. * Developer API documentation needs improvement.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Cloudera is a great product and, overall, there are many features."
"After Hadoop implementation they are getting confidence that now analysis is more appropriate and fast."
"With a cluster available, you can manage the security layer using the shared SDX - it provides flexibility."
"Improved Business Intelligence reporting from daily to every two hours satisfying the business stakeholders who would favour transactional systems to draft reports because it had the latest data."
"For enterprise organizations that can bear the cost, it's a good solution."
"The most valuable feature is Impala, the querying engine, which is very fast."
"I like the combination of all the tools that allow me to provide solutions and enable me to solve the use cases I'm working on."
"We were able to utilize data which was untapped previously."
"Definitely a product worth evaluating, esp if you are an IBM shop and if done on Bluemix, it gives a jump start on protoypes/POCs."
"The thing that I have found most valuable in this solution is the BIQSQL implementation which is fully SQL ANSI compliant."
"This helped us to serve our customers better by giving real-time suggestions and proactive maintenance."
"It gives us the option of extending our analytics system."
"It integrates with JSqsh, enabling us to submit long-running exports from the shell."
"This is a very helpful product, with continuous improvements by IBM and a great customer service which enables easy access to valuable information for both Hadoop developers and system administrators."
"Watson is the perfect engine for text analysis for us, but in 2014 it doesn’t support the Russian language."
"InfoSphere Streams was the one core product from the platform in which we were using. We were building a real-time response system and we built it on InfoSphere Streams."
 

Cons

"On same ground I didn't see much training materials from Cloudera."
"There are better solutions out there that have more features than this one."
"The only thing that needs improvement is the cost, it's a very expensive solution and one of the main reasons companies are not attracted to the product."
"Flume seems unstable and has to be restarted quite often."
"Spark with R integration is missing. Also, it is lacking Spark SQL support."
"Currently, we are using many other tools such as Spark and Blade Job to improve the performance."
"They should focus on upgrading their technical capabilities in the market."
"Without the big data environment, we cannot store all of this data live. We have billions of records and terabytes of storage to be used. It's not an option actually for us to have a big data environment."
"I'd like to see faster execution time, especially for simple queries that don't touch on many rows and don't involve many operations (Joins, Unions, Groupbys)."
"For our business customer pricing is very important motivation, so I can advise change licensing policy from “by volume in the cluster” to “number of machines in the cluster”."
"The UI was not interactive: Responses used to be very slow and hang up at times."
"Initial setup is rather complex in comparison with Cloudera."
"I encountered issues with having the appropriate documentation resources, as well as getting the right stability when explored virtualized environments based on Virtualbox and HyperV software."
"Unfortunately the stability of the platform was an issue."
"I have found a lot of issues in Fluid Query and BigInsights Applications to move data in the enterprise version."
 

Pricing and Cost Advice

"Cloudera Distribution for Hadoop is expensive, with support costs involved."
"It is an expensive product."
"When comparing with Oracle Sybase and SQL, it's cheaper. It's not expensive."
"The price could be better for the product."
"I wouldn't recommend CDH to others because of its high cost."
"The solution is expensive."
"The solution is fairly expensive."
"Cloudera requires a license to use."
Information not available
report
Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
900,644 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
23%
Construction Company
10%
Marketing Services Firm
8%
Manufacturing Company
6%
No data available
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business16
Midsize Enterprise9
Large Enterprise32
By reviewers
Company SizeCount
Small Business3
Large Enterprise4
 

Questions from the Community

What is your experience regarding pricing and costs for Cloudera Distribution for Hadoop?
The price for Cloudera is average, yet it is very good compared to other solutions. It can be deployed on-premises, unlike competitors' cloud-only solutions.
What needs improvement with Cloudera Distribution for Hadoop?
If they could support modifying the data more easily than the current implementation, it would be beneficial.
What is your primary use case for Cloudera Distribution for Hadoop?
We use Cloudera Distribution for Hadoop for many use cases including analytics, storing huge data sets, and various data processing tasks.
Ask a question
Earn 20 points
 

Also Known As

No data available
InfoSphere BigInsights
 

Overview

 

Sample Customers

37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
Coherent Path Inc., Optibus, Delhaize America, Diyotta Inc., Ernst & Young, Teikoku Databank Ltd., NCSU, Vestas
Find out what your peers are saying about Apache, Cloudera, Amazon Web Services (AWS) and others in Hadoop. Updated: May 2026.
900,644 professionals have used our research since 2012.