We use it for data science activities.
Solution Architect at Teradata Corporation
We use it for data science activities. Security and workload management need improvement.
Pros and Cons
- "We use it for data science activities."
- "Security and workload management need improvement."
What is our primary use case?
How has it helped my organization?
Data is now available.
What is most valuable?
I have no preferences towards any feature.
What needs improvement?
- Security
- Performance
- Workload management
Buyer's Guide
Cloudera Data Platform
June 2026
Learn what your peers think about Cloudera Data Platform. Get advice and tips from experienced pros sharing their opinions. Updated: June 2026.
900,747 professionals have used our research since 2012.
For how long have I used the solution?
Less than one year.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Works at a comms service provider with 10,001+ employees
Enabled us to implement fraud detection and improve performance at a lower cost
Pros and Cons
- "Ranger for security; with Ranger we can manager user’s permissions/access controls very easily."
- "We have successfully ported a Microsoft SSIS product application into Hadoop, that saved millions of dollars for the company and, at the same time, they are getting better performance."
- "Hive performance. If Hive performance increased, Hadoop would replace (not everywhere) traditional databases (Oracle/Teradata, etc.), which would save a lot of money for the company."
What is most valuable?
A few of them, namely: Hive/Tez, HBase, Ranger, Yarn and Ambari. Ambari helps managing the platform, Hive is very easy to use. Ranger for security; with Ranger we can manager user’s permissions/access controls very easily.
How has it helped my organization?
We have successfully ported a Microsoft SSIS product application into Hadoop, that saved millions of dollars for the company and, at the same time, they are getting better performance. Also, we implemented fraud detection, as quickly as possible, for the online orders. (Fraudulent orders became a big headache for our company. The early detection of fraud is saving the company a lot of money).
What needs improvement?
Hive performance. If Hive performance increased, Hadoop would replace (not everywhere) traditional databases (Oracle/Teradata, etc.), which would save a lot of money for the company.
For how long have I used the solution?
I have been working on this HDP platform since Jan 2015.
What do I think about the stability of the solution?
No, our company is a satisfied customer.
What do I think about the scalability of the solution?
No, not at all.
What other advice do I have?
Product is good. Reason I gave a rating of eight is that their community is very large and relatively very quick in bug fixes.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Buyer's Guide
Cloudera Data Platform
June 2026
Learn what your peers think about Cloudera Data Platform. Get advice and tips from experienced pros sharing their opinions. Updated: June 2026.
900,747 professionals have used our research since 2012.
BigData(QA & RnD) with 51-200 employees
The user-friendly feature of the Ambari Web UI is one of its best features. On the other hand, the Ambari upgrade is difficult.
Pros and Cons
- "Ambari Web UI: user-friendly."
- "It has helped our organisation cater to clients who are using Big Data for data storage and analysis combined with our security product."
- "Deleting any service requires a lot of clean up, unlike Cloudera."
What is most valuable?
- Ambari Web UI: user-friendly
- Views for Hive, Tez, Pig
- Spark and Ranger
How has it helped my organization?
It has helped our organisation cater to clients who are using Big Data for data storage and analysis combined with our security product.
What needs improvement?
Deleting any service requires a lot of clean up, unlike Cloudera.
For how long have I used the solution?
Five years.
What do I think about the stability of the solution?
Not until now.
What do I think about the scalability of the solution?
No.
How are customer service and technical support?
Very supportive, prompt responses.
Which solution did I use previously and why did I switch?
We didn't use a previous solution.
How was the initial setup?
The Ambari upgrade is not very user-friendly.
What's my experience with pricing, setup cost, and licensing?
Not applicable.
Which other solutions did I evaluate?
Cloudera and MapR.
What other advice do I have?
It's a great company with a great product employing dedicated people.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Big Data - Senior Solutions Architect at a tech vendor with 10,001+ employees
It is open and there is no lock-in.
Pros and Cons
- "Hortonworks is the best, comparing all three flavors."
- "Initially, we went with Cloudera due to it being a popular choice in the market, etc, then realized it was bad choice."
What is most valuable?
We evaluated Cloudera and Hortonworks. Based on our evaluation and actual experience in production of 60 nodes and development of 12 nodes, the most valuable features of Hortonworks are:
- 100% open
- No lock-in like Cloudera
- Fast and accurate support instantly
- Largest number of committers to Hadoop by any means
- Hive is better in performance and ease of use compared to Impala
How has it helped my organization?
It helps a lot in data in motion (ingestion and manage in real time). We are able to do 3rd-party data monetization of our data within a t+20 minute time frame to our end customers.
What needs improvement?
- Cost
- Reliability
- Speed
- Ease of use
For how long have I used the solution?
I have used it for three years.
What was my experience with deployment of the solution?
I initially encountered deployment issues, but they were very good in resolving them.
What do I think about the stability of the solution?
I have not encountered stability issues.
What do I think about the scalability of the solution?
I have not encountered any scalability issues at all. That's the key reason we picked HDP over Cloudera, as Cloudera have issues & don't support compression of Hive in ORC format. They push only their products (not good).
How are customer service and technical support?
Customer Service:
Customer service has been excellent from the day one until now... and our Admin is comfortable with the SLA and turnaround time.
Technical Support:Technical support is very good and proactive with SmartSense.
Which solution did I use previously and why did I switch?
We previously used a different solution. We switched from Cloudera. Initially, we went with Cloudera due to it being a popular choice in the market, etc, then realized it was bad choice. Before we scaled from 6 nodes to 12 nodes and before we went livein production, we scrapped it due to Impala's performance and lock-in.
How was the initial setup?
Using Ambari, it was easy to set up and we even tried the AWS for a test cluster.
What about the implementation team?
An in-house team implemented it: two admins, seven developers, one data scientist, one PM and 22 business users at the customer (end-user side).
What was our ROI?
ROI is 300%.
What's my experience with pricing, setup cost, and licensing?
Hortonworks is the best, comparing all three flavors. If all is well, we might use open source alone in the next three years; others you can't due to lock-in...
Which other solutions did I evaluate?
Before choosing this product, we also evaluate Cloudera.
What other advice do I have?
It is the best in terms of product vision and actual delivery.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Solution Architect at MIMOS Berhad
It gives us semantic analysis based on the feeds from social networking data, clickstream data, etc., but it needs to support disaster recovery features such as mirroring.
Pros and Cons
- "It's the one and only complete open source big data platform Ambari-managed admin configuration for HDFS, YARN, Hive, HBase, etc."
- "Customer Service: 3/10 Technical Support: 3/10"
What is most valuable?
- It's the one and only complete open source big data platform
- Ambari-managed admin configuration for HDFS, YARN, Hive, HBase, etc.
- Customized dashboards
- Web-based HDFS browser
- SQL editor for Hive
- Apache Phoenix - OLTP and operational analytics on Hadoop
- Apache Zeppelin - A web-based notebook that enables interactive data analytics
How has it helped my organization?
- Maintenance of our own data lake in the enterprise-level
- Storage and analysis of server logs
- Applying Operational Intelligence in the enterprise-level based on the analysis of various department units data
- Semantic analysis based on the feeds from social networking data, clickstream data, etc.
What needs improvement?
- Rolling upgrade
- Disaster recovery features such as mirroring should be supported
For how long have I used the solution?
We've used it for one year.
What was my experience with deployment of the solution?
No issues encountered.
What do I think about the stability of the solution?
No issues encountered.
What do I think about the scalability of the solution?
No issues encountered.
How are customer service and technical support?
Customer Service:
3/10
Technical Support:3/10
Which solution did I use previously and why did I switch?
No previous solution was in place.
How was the initial setup?
It's easy to setup.
What about the implementation team?
We did it in-house.
What's my experience with pricing, setup cost, and licensing?
Completely use the community edition along with other features that can be implemented on top.
Which other solutions did I evaluate?
No other solutions were looked at.
What other advice do I have?
Study, analyze, and compare with other big data platforms features according to your requirements before choosing the appropriate one.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
CTO at a tech services company
The setup of hadoop was easy thanks to Ambari, but installing the security components was complex.
Pros and Cons
- "Hadoop/Cloudera is still much cheaper than Oracle's RDBM system, if you want to handle a huge amount of data and make complex analytics."
- "The setup of Hadoop was easy thanks to Ambari, but installing the security components was complex."
What is most valuable?
It has a powerful, user-friendly interface called Ambari which allowed us to administrate our cluster easily.
How has it helped my organization?
It allows us to performa data lake implementation to handle/treat huge amounts of data, or what we call the "terrible bytes”.
What needs improvement?
Integrate a complete hive web client (Ambari views), like Hue Today, in the next release.
For how long have I used the solution?
I've used it for three years.
What do I think about the stability of the solution?
The HDP v2.3 is stable release. But we have encountered some issues linked to hbase (fixed by: hbase server and region server should not be installed on the same node).
How are customer service and technical support?
It has good documentation, but it's not fully complete for complex security needs (knox/ranger with cluster).
Which solution did I use previously and why did I switch?
We have installed our first cluster using native Apache repositories.
How was the initial setup?
The setup of Hadoop was easy thanks to Ambari, but installing the security components was complex.
What about the implementation team?
Our cluster has been implemented in-house. We have automated the entire installation of Hadoop, set-up and configuration included.
What's my experience with pricing, setup cost, and licensing?
Hadoop/Cloudera is still much cheaper than Oracle's RDBM system, if you want to handle a huge amount of data and make complex analytics.
It's 40,000€ for 10 Hadoop nodes vs 1.7 million Euros for an Oracle server with 40 cores.
What other advice do I have?
In short, I recommend this product simply because Hortonworks is the only distribution that runs on Linux and Windows Servers.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Consultant at a tech services company with 51-200 employees
It enables customers to perform sentimental analysis from social media data to engineering analytics. Name Node High Availability is still not stable.
Pros and Cons
- "Hortonworks is 100% Open Source."
- "Security- Although they support Knox and Ranger and Kerberos, they are still missing attribute-level encryption features."
Valuable Features:
Hortonworks is 100% Open Source. Hortonworks does a great job in managing all different components of Hadoop.
Improvements to My Organization:
We've done multiple implementations of it. It enables customers to perform sentimental analysis from social media data to engineering analytics.
Room for Improvement:
Security- Although they support Knox and Ranger and Kerberos, they are still missing attribute-level encryption features.
Name Node High Availability is still not stable (memory issues).
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Principal Consultant - Big Data with 501-1,000 employees
It is improving rapidly, but like other flavors of Hadoop there is room for improvement.
Pros and Cons
- "The Hadoop value proposition is in expanded functionality, linear scalability, and reduced software and infrastructure costs."
- "Hadoop, does not provide improved performance, compared to traditional RDBMS, unless processing batches in the TB-PB range, or if the Hadoop platform has significantly more resources available."
What is most valuable?
- Ambari
- Hive
- Sqoop
- Flume
- Spark
How has it helped my organization?
The Hadoop value proposition is in expanded functionality, linear scalability, and reduced software and infrastructure costs. Hadoop offers several generic frameworks for batch, real-time, and iterative processing, such as map-reduce, spark, and spark streaming. Additionally, these frameworks provide libraries for predictive analytics and machine learning. This type of expanded functionality is not easily achieved on any other single platform.
What needs improvement?
File system to provide indexed access to individual records with in-place update/delete. Also, Security integration through a common interface for authentication, authorization, disk encryption, network encryption, data access layer, data masking, etc.
Hadoop, does not provide improved performance, compared to traditional RDBMS, unless processing batches in the TB-PB range, or if the Hadoop platform has significantly more resources available.
For how long have I used the solution?
I have implemented various flavors of Hadoop over the past five years, including platform configuration and application development.
What was my experience with deployment of the solution?
Deployment is improving rapidly, but like other flavors of Hadoop there are always issues.
What do I think about the stability of the solution?
Stability is improving rapidly, but like other flavors of Hadoop there are always issues.
How are customer service and technical support?
5/10 - Responsive, but like all flavors of Hadoop, there are too many tickets to be reasonably triaged and supported.
Which solution did I use previously and why did I switch?
I have implemented various flavors of Hadoop such as Hortonworks and Cloudera over the past five years, including platform configuration and application development.
How was the initial setup?
Straightforward once you know what you’re doing.
What about the implementation team?
I work for a vendor team.
What was our ROI?
ROI is one of the main reasons organization pursue Hadoop. Cost per TB is a compelling factor.
Which other solutions did I evaluate?
Hadoop is complex. It takes a dedicated approach from individuals with a broad range of technology skills and commitment to overcome challenges that do not normally present themselves in well-established technologies.
What other advice do I have?
Hadoop is complex. It takes a dedicated approach from individuals with a broad range of technology skills and commitment to overcome challenges that do not normally present themselves in well-established technologies.
Disclosure: My company has a business relationship with this vendor other than being a customer. We're partners.
Lead IT Consultant at a tech services company with 5,001-10,000 employees
We've integrated our current distribution of it with Tableau, but we had issues upgrading to the newer versions, but these were resolved with their help.
Pros and Cons
- "Customer service is great."
What is most valuable?
The features I've found most valuable are--
- Ambari UI
- Hive
- Pig
- Hive
- Also integrated Tableau with this distribution
How has it helped my organization?
It's easy to deploy and we've used this distribution for some of our recommendation and trend analysis use cases.
For how long have I used the solution?
I've used it for almost one year.
What was my experience with deployment of the solution?
No issues encountered.
What do I think about the stability of the solution?
No issues encountered.
What do I think about the scalability of the solution?
We faced some issues while upgrading to newer versions with current distributions, but with their support we solved it.
How are customer service and technical support?
Customer Service:
Customer service is great.
Technical Support:Technical support is great.
Which solution did I use previously and why did I switch?
No, we did not use a previous solution.
How was the initial setup?
Initial setup was straightforward.
What about the implementation team?
We implemented it with our in-house team.
Disclosure: My company has a business relationship with this vendor other than being a customer. We're partners.
Associate Consultant at a tech vendor with 501-1,000 employees
The Ambari UI is valuable for cluster monitoring, but there are certain features that need tuning, such as the Hue UI.
Pros and Cons
- "From a product standpoint, their Ambari UI is incredibly valuable for cluster monitoring."
- "As this is open source, there are certain features that need tuning, such as the Hue UI."
What is most valuable?
From a product standpoint, their Ambari UI is incredibly valuable for cluster monitoring. It simplifies the deployment and maintenance of hosts, and we can provision, configure and test Hadoop services.
How has it helped my organization?
From an overall perspective, Hortonworks support is crucial to our operations.
What needs improvement?
As this is open source, there are certain features that need tuning, such as the Hue UI. More stability on this would be helpful.
For how long have I used the solution?
I've used it for one year.
What was my experience with deployment of the solution?
As this is all new technology, we face issues at every level. However, hardware support and documentation have been instrumental in helping us resolve the majority of those issues.
What do I think about the stability of the solution?
We've had some issues with stability, but hardware support and documentation have helped us resolve most of those.
What do I think about the scalability of the solution?
No issues with scalability.
How are customer service and technical support?
They have outstanding customer support. Their responses are prompt, and they resolve issues quickly.
Which solution did I use previously and why did I switch?
I have not used a solution of this nature before.
How was the initial setup?
The set up is straightforward enough, but at every level there are many parameters to be tuned. Ensuring all these parameters are set is the complex part, as poorly set parameters can cause unwanted issues.
What about the implementation team?
We have an in-house team to do implementations. I would advise that all implementations get seen through all the way to having users smoke test applications to ensure correct functionality.
What other advice do I have?
I would suggest that if you are implementing this at an enterprise level, the support is compulsory. Additionally having a high degree of patience is key, as this is open source and road bumps can be frequent when moving at a fast pace.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Buyer's Guide
Download our free Cloudera Data Platform Report and get advice and tips from experienced pros
sharing their opinions.
Updated: June 2026
Product Categories
Data Management Platforms (DMP) Cloud Master Data Management (MDM) AI Data AnalysisPopular Comparisons
Informatica Intelligent Data Management Cloud (IDMC)
Databricks
Qlik Talend Cloud
Palantir Foundry
Cohesity Data Cloud
Reltio Cloud
TIBCO EBX
Cognite Data Fusion
Stibo STEP MDM
Boomi Data Hub
Amazon DataZone
Buyer's Guide
Download our free Cloudera Data Platform Report and get advice and tips from experienced pros
sharing their opinions.
Quick Links













