- Cloudera Manager for administering the Hadoop cluster
- Cloudera specific solutions like Impala
- Extensive documentation
- Good user community
Vice President - Big Data and Delivery at a computer software company with 51-200 employees
Cloudera Manager is a good tool to administer. Sometimes it gets confusing to follow a single path for installation.
What is most valuable?
How has it helped my organization?
Implementing a Hadoop cluster has become relatively straight-forward using CDH. Administering it is also less complex. As a result, efforts spent in these areas are less than anticipated.
What needs improvement?
- Some of the UI features seem confusing e.g. charts on the CM Services page
- Sometimes it gets confusing to follow a single path for installation due to multiple recommended approaches e.g. parcels vs packages
For how long have I used the solution?
We have been using it for the last two years.
Buyer's Guide
Cloudera Distribution for Hadoop
May 2025

Learn what your peers think about Cloudera Distribution for Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: May 2025.
851,823 professionals have used our research since 2012.
What was my experience with deployment of the solution?
Following a single path for installation becomes confusing due to multiple recommended approaches e.g. parcels vs packages.
What do I think about the stability of the solution?
Flume seems unstable and has to be restarted quite often.
What do I think about the scalability of the solution?
None as such
How are customer service and support?
We are mostly using Cloudera Express so we did not use their technical support. However, the Cloudera community is an active place and Cloudera representatives participate actively in understanding and resolving issues.
Which solution did I use previously and why did I switch?
Cloudera is a prominent player in the Hadoop space and we did not have a need to adopt a different solution. However, we are also looking to work on Hadoop and MapR
How was the initial setup?
Following a single path for installation was initially confusing due to multiple recommended approaches e.g. parcels vs. packages. However, after a while, we managed to master it. However, knoweldge of Cloudera Manager and Hadoop architecture is a must.
What about the implementation team?
We have our own team of consultants who are proficient in implementing it. The high level steps about the implementation remain the same; however, it is the environment specific issues which are challenging.
What was our ROI?
We haven't really measured ROI.
What's my experience with pricing, setup cost, and licensing?
Licensing price on per node basis for Cloudera seems to be pretty steep (based on the inputs we have received from Cloudera).
What other advice do I have?
It is user friendly and installation is pretty straightforward. Cloudera Manager is a good tool to administer it. However, configuration for specific requirements is sometimes pretty complex.
You should have a team which is knowledgeable in Hadoop. Do keep in mind that the product is still maturing so there are good chances that you will come across unexpected issues now and then.
Disclosure: My company has a business relationship with this vendor other than being a customer: We're Cloudera partners and regularly install CDH
Architect at a marketing services firm with 501-1,000 employees
Cloudera Manager Hadoop Cluster Installation Evaluation
I decided to give Cloudera's Manager software a try, and was pleasantly surprised at how simple it becomes to deploy a substantial Hadoop cluster.
I began by creating an automated kickstart installer for RHEL 6.2 (booting off a custom isolinux image created for this purpose), with all of the required packages, so that from server power on to creating a 20+ node cluster takes less than 15 minutes. The limitation for the number of concurrent node installs is based on network and disk i/o bottlenecks on the deployment server. If you wanted to PXE boot the cluster in a production environment, you would want a bank of servers behind a load balancer, optimally.
Once the Manager is installed on the master node, you simply log into the administration webpage, and from there, add all of the hosts to deploy the cluster on. One nice discovery was that it takes advantage of regular expressions for host names or IP addresses, so you can literally create a cluster containing hundreds of nodes with a trivial amount of effort.
Once the software is deployed, you can select the roles for each of the servers. It's an incredibly painless deployment. That being said, it is not without its flaws.
One of the primary flaws is that all of the configuration and log files are in non-standard locations, and are split in non-standard ways. It's obvious from the way that the files are arranged that it simplifies programmatic deployment. It also makes it a bit harder for a human who is used to standard Hadoop deployments to figure out where everything is located.
And finally, I discovered a bug with one of the packaged software products, Oozie. One of the resource files, oozie-bundle-0.1.xsd contains an invalid regular expression on line 22. I haven't tracked down the behavior, but for some reason JDK 1.6.30 will parse that invalid regex, but JDK 1.7U2 will exit with errors. Naturally, I was running JDK 1.7U2, so it took me a little extra time to debug the problem.
Overall, I quite liked Cloudera's Manager. It's certainly one of the better cluster deployment products I've seen.
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Buyer's Guide
Cloudera Distribution for Hadoop
May 2025

Learn what your peers think about Cloudera Distribution for Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: May 2025.
851,823 professionals have used our research since 2012.
Consultant at a tech consulting company with 51-200 employees
The Cloudera Hadoop manager eased the work of orchestrating scripts.
Valuable Features
Very solid. Excellent user experience. good documentation. The Cloudera Manager is definitely a deal breaker. Packaging for Ubuntu is great for all the components.
Improvements to My Organization
Before the introduction of Cloudera Manager (that actually works), all the orchestration was done with scripts and Chef, and inexperienced team members had difficulties to participate in maintenance. The Cloudera Hadoop manager eased the work.
Room for Improvement
More customization, better documentation for the API (basically it's the same for all Cloudera Hadoop components).
Use of Solution
I've used it for two years.
Deployment Issues
No issues encountered.
Stability Issues
No issues encountered.
Scalability Issues
No issues encountered.
Customer Service and Technical Support
Didn't use dedicated service or support. The documentation is a bit of a mess, but it is decent and sufficient.
Initial Setup
Straightforward. The CDH VirtualBox with preconfigured environment helps for demonstration purposes
Implementation Team
We did it in-house.
Other Solutions Considered
We also looked at Hortonworks, but chose Cloudera because of my familiarity with it.
Other Advice
Do a comparisomn with Hortonworks as it's always good to compare to another major vendor
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Lead Bigdata Developer at a tech services company with 10,001+ employees
We used it to build an enterprise data hub, but Apache Kudu needs improvement.
Valuable Features:
The most valuable feature for me are--
- Sentry - provides granular-level security
- Impala - open-source, MPP database
Improvements to My Organization:
We used it to build an enterprise data hub.
Room for Improvement:
Apache Kudu needs improvement. It's a real-time updatable database.
Implementation Team:
We used a vendor team to implement the solution.
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Chief Executive Officer at a financial services firm with 51-200 employees
Overall operational, stable but price could be better
Pros and Cons
- "The product as a whole is good."
- "There are better solutions out there that have more features than this one."
What is our primary use case?
We use the solution for the data warehousing.
What is most valuable?
The product as a whole is good.
What needs improvement?
There are better solutions out there that have more features than this one.
For how long have I used the solution?
I have just started using the solution.
What do I think about the stability of the solution?
I do not know of any issues with the stability of the solution.
What about the implementation team?
I have an internal team that does maintenance for the solution.
What's my experience with pricing, setup cost, and licensing?
The price could be better for the product.
Which deployment model are you using for this solution?
On-premises
Disclosure: I am a real user, and this review is based on my own experience and opinions.
R&D Solutions Architect at a tech vendor with 10,001+ employees
It has good ease of use in terms of integration within the Hadoop ecosystem related products.
Valuable Features
Enterprise resource management, ease of use in terms of integration within the Hadoop ecosystem related products, and security.
Room for Improvement
Mainly they have to continuously evolve following the technology trends and replace or adapt part of their solutions accordingly.
Use of Solution
We've used it since October 2012.
Deployment Issues
No issues encountered.
Stability Issues
No issues encountered.
Scalability Issues
No issues encountered.
Customer Service and Technical Support
Pretty responsive and reactive compared to their competitors in the field.
Initial Setup
It was extremely easy, and allowed less experienced personnel to get into the context pretty fast. Any difficulties/complexities faced were not related to the product itself rather than to the cluster infrastructure used.
Implementation Team
In our case it was an in-house team including data scientists and data engineers (management & QA as well). With the appropriate training and the support offered by the vendor, it is not that hard to implement a small to medium scale project solution. However, complexity and size varies significantly between projects; therefore, it really depends.
ROI
That is not easy to answer since Huawei has several divisions using the product in different ways. Again regarding pricing/licensing highly depends on the context and the aims of the given organization for instance the level of support they are going to need, the type of services they are going to provide, or even the business domain they are targeting.
Other Solutions Considered
There were two provider solutions that have been evaluated. However, the level of customer service and technical support from Cloudera was better than the first one, and the second solution licence pricing was higher compared to Cloudera’s pricing schema.
Other Advice
Cloudera is doing a great job in the field offering an enterprise ready data platform. Based on my experiences I would definitely recommend it.
Disclosure: My company has a business relationship with this vendor other than being a customer: We do have a partnership with Cloudera.
Data engineer at a tech services company with 11-50 employees
Supports a wide range of tools and has a good support community
Pros and Cons
- "We also really like the Cloudera community. You can have any question and will have your answer within a few hours."
- "Without the big data environment, we cannot store all of this data live. We have billions of records and terabytes of storage to be used. It's not an option actually for us to have a big data environment."
What is our primary use case?
Our primary use case for this solution is to host a big amount of data in our platform, processing, analysis and all of this stuff on the platform.
What is most valuable?
Cloudera is always developing new tools and supports a wide range of tools. We also really like the Cloudera community. You can have any question and will have your answer within a few hours. Cloudera is better than other competitors because they acquired Hortonworks.
What needs improvement?
We're processing a huge amount of data on our system. Without the big data environment, we cannot store all of this data live. We have billions of records and terabytes of storage to be used. It's not an option actually for us to have a big data environment. Cloudera is trying to adopt new technologies.
I think the idea of open source tools now is dominating. So Cloudera has to decide how to deal with open-source tools. I subscribe to Cloudera to get an enterprise version but I have found that I can get some of its features from other vendors that would be at a lower cost than Cloudera. They should lower the price.
For how long have I used the solution?
We have been using Cloudera for a year.
What do I think about the stability of the solution?
It's stable. I have no issue regarding the stability.
What do I think about the scalability of the solution?
It's scalable. You can add more nodes and you can expand your cluster easily.
How are customer service and technical support?
After we open a ticket, the issue can be resolved very quickly, they have a management portal. I don't contact them directly, but I haven't heard anybody having any problems with it.
How was the initial setup?
The initial setup is complicated. We needed the vendor to install it themselves. The deployment took around three weeks. Three people were involved because they just follow up and supervise the deployment, but they're not deploying anything. The vendor does it.
What other advice do I have?
In terms of the advice, I would say to focus on what tools are available on the market. In terms of open-source, most companies are delivering open source technologies and providing support to these tools. Now I have the option to purchase a license for whatever platform for $1. I can deliver it with another small company at a lower cost. If I was the decision-maker, I'd invest in open-source tools. Cloudera and all of these companies are trying to adapt to these big data technologies and open source tools. Cloudera is trying to put it inside their platform so that we can have a compatible solution.
I would rate it an eight out of ten.
Which deployment model are you using for this solution?
On-premises
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Lead Consultant - Product Development at FIS (http://www.fisglobal.com/)
We use this solution to use big data for our analyses
What is our primary use case?
Our core product is an insurance product and the actuarial module is quite complex. SMEs so far collect data from various sources into Excel sheets and through macros do the analytics which is a very crude form of doing the analysis. So we thought to use big data for such analysis.
How has it helped my organization?
That is still in PUC stage, as I have mentioned our analyst used to do the actuarial on a spreadsheet but after Hadoop implementation they are getting confidence that now analysis is more appropriate and fast. Now exploring cloud implementation as well.
What is most valuable?
Keeping multi copies of the file and tools of map reduce like PIG, HIVE due to their flexibility it is easy to develop the application with less or almost no knowledge of Java and Sql. And capability to handle huge data size.
What needs improvement?
As such in the product side, I don't have much to comment. But like other upcoming technologies like RPA, AI, GO etc they have ample training materials with variety of USE Cases, which users can understand and aligned with their current requirements. On same ground I didn't see much training materials from Cloudera.
For how long have I used the solution?
One to three years.
What do I think about the stability of the solution?
Seems quite stable, as such didn't face any issue.
What do I think about the scalability of the solution?
It is very stable, didn't face any performance issue.
Which solution did I use previously and why did I switch?
No when we were heard of Hadoop, we tried on that only. I mean tried to migrate from spreadsheets to Hadoop.
How was the initial setup?
Very straight forward. Typical Windows type installation...Next, next, next clicks.
What about the implementation team?
In-house.
What was our ROI?
Other department handles all these so I can't comment on that.
What's my experience with pricing, setup cost, and licensing?
Which other solutions did I evaluate?
Not really.
Disclosure: I am a real user, and this review is based on my own experience and opinions.

Buyer's Guide
Download our free Cloudera Distribution for Hadoop Report and get advice and tips from experienced pros
sharing their opinions.
Updated: May 2025
Popular Comparisons
Apache Spark
HPE Ezmeral Data Fabric
Apache HBase
Neo4j Graph Database
Oracle NoSQL
Buyer's Guide
Download our free Cloudera Distribution for Hadoop Report and get advice and tips from experienced pros
sharing their opinions.
Quick Links
Learn More: Questions:
Hi
Can I have Cloudera's Manager software for free to test and deploy it on a sandBox to work on a POC purposes.