We use Hortonworks as a storage platform and then we create machine learning models and do the execution using Cloudera Data Science Workbench. (Cloudera and Hortonworks merged in January of 2019.)
Data Science and Data Engineering Leader | Senior Principal Data Scientist at a healthcare company with 10,001+ employees
A modeling and analysis data platform that is inexpensive and stable
Pros and Cons
- "The scalability is the key reason why we are on this platform."
- "Hortonworks should not be expensive at all to those looking into using it."
- "The version control of the software is also an issue."
What is our primary use case?
What is most valuable?
The most valuable part of this product is what Cloudera Data Science Workbench can do as a whole for modeling and analysis.
What needs improvement?
We are happy with the platform but we are also looking at what else is out there. We are comparing what other teams are using in our company to the solution we have adopted.
The main difference, and what seems better with other tools, is the deployment. That part is not entirely clear to us. We can create models, but once we create the models we are not sure as to how to deploy them.
The version control of the software is also an issue.
Maybe these improvements are already included in the newest release but we are not aware of it because nobody on our team has had the opportunity to try it. I do not know how well this product is supported.
For how long have I used the solution?
We have been using Hortonworks Data Platform for almost a year.
Buyer's Guide
Cloudera Data Platform
January 2026
Learn what your peers think about Cloudera Data Platform. Get advice and tips from experienced pros sharing their opinions. Updated: January 2026.
881,082 professionals have used our research since 2012.
What do I think about the stability of the solution?
We do not maintain it in our department. We just use it. It has been mostly available other than when they had to upgrade. The IT department did run into some upgrade issues. But as users, it has been stable for us.
What do I think about the scalability of the solution?
We expect to have a lot more data. The scalability is the key reason why we are on this platform.
How was the initial setup?
My IT team is the user group when it comes to installation and setup. They are the ones who do the product installation and management.
What's my experience with pricing, setup cost, and licensing?
I think it is priced well and it is affordable. Hadoop, which we use with the solution, is open-source. That part is free. We pay only for whatever wrappers Cloudera provides on top of the open-source product, Hadoop. I do not know about the actual pricing in total. The whole point of Hadoop is that it is open-source and they have created their own cluster. Cloudera is just the vendor that they are using.
My guess is Hortonworks should not be expensive at all to those looking into using it.
What other advice do I have?
It is important to note that the IT team has to support the product. We are not the IT team so if we have to scale it, someone has to be able to do that administrative job of adding another server and managing the distribution of the data across all the servers. To work with this, you need to have that skill set within the IT department.
On a scale from one to ten (where one is the worst and ten is the best), I would rate Hortonworks Data Platform as an eight or nine out-of-ten. It is meeting a need.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Manager at a tech services company with 201-500 employees
A seamless solution with a solid workflow
Pros and Cons
- "The data platform is pretty neat. The workflow is also really good."
- "It would also be nice if there were less coding involved."
What is our primary use case?
We use this solution for the hospitality industry.
How has it helped my organization?
It was for end to end data processing and data manipulations.
What is most valuable?
The data platform is pretty neat. The workflow is also really good.
What needs improvement?
The NiFi platform could be enhanced. This refers to the data ingestion in a workflow.
It would also be nice if there was less coding involved.
For how long have I used the solution?
I have been using this solution for six years.
How are customer service and support?
The technical support is okay, but not excellent. They can take a while to respond.
What other advice do I have?
If you wish to use this solution, make sure you compare it with some other solutions first to make sure it's right for your needs.
Overall, on a scale from one to ten, I would give this solution a rating of nine.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Buyer's Guide
Cloudera Data Platform
January 2026
Learn what your peers think about Cloudera Data Platform. Get advice and tips from experienced pros sharing their opinions. Updated: January 2026.
881,082 professionals have used our research since 2012.
Senior IT Officer- Head of Administration, System Administration Division for Unix and Linux Servers at a financial services firm with 10,001+ employees
A cost-effective alternative for managing our big data
Pros and Cons
- "Now, using this solution, it is much cheaper to have all of the data available for searching, not in real-time, but whenever there is a pending request."
- "I would like to see more support for containers such as Docker and OpenShift."
What is our primary use case?
We use this solution to look at and manage big data. It's mostly historical data that we offload from our data warehouse, as well as from other databases in other platforms.
We have two different installations. The first one is based on IBM POWER CPUs, and the other one is based on Intel CPUs. Our data center is on-premise. There is some thought on moving to a private could, or a private IBM cloud, but we have not proceeded with that as of yet.
How has it helped my organization?
This solution is a cheaper way for us to offload the otherwise expensive data. We can move data from outdated database versions, such as Oracle 10. It is now out of support, but still hosts some of our historical data. This solution has helped us move our data to the current version.
Previously, we had our data on more expensive platforms. Now, using this solution, it is much cheaper to have all of the data available for searching, not in real-time, but whenever there is a pending request.
What needs improvement?
We have had problems with the backup and with services that require a disaster site. We are still struggling with some of these issues.
We are having trouble with Active Directory and Hive integration.
I would like to see more support for containers such as Docker and OpenShift.
For how long have I used the solution?
About a year and a half.
What do I think about the stability of the solution?
We have had some issues with the code, but it's mostly from the developers. From our side, we don't see any issues with stability, although it may be that we have a lot of unused CPU capacity.
What do I think about the scalability of the solution?
We have not acquired any additional hardware since our initial purchase. However, we expect more use cases to be added, at which point we may have performance or scalability problems.
How was the initial setup?
The initial setup is not very difficult. The configuration is not easy, but somebody with some experience is able to set it up. We had users for which we had to set up quotas and queues. For us, the basic installation was completed within a matter of a week.
What about the implementation team?
We had IBM set up both of our installations.
What other advice do I have?
This is a good product, but we still have some issues with backup, and the performance monitors that we install on every system. There may be solutions, but we're struggling to integrate them.
This is a product that I recommend. It's a solution that comes at a lower price, and it works well if you don't have expectations that it will behave like a much more expensive system.
I would rate this solution an eight out of ten.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Senior HPC and BigData Architect at a comms service provider with 1-10 employees
Provides a complete solution and just one user interface that can manage all the packages
Pros and Cons
- "The Hortonworks solution is so stable. It is working as a production system, without any error, without any downtime. If I have downtime, it is mostly caused by the hardware of the computers."
- "I work a lot with banking, IT and communications customers. Hortonworks must improve or must upgrade their services for these sectors."
What is our primary use case?
Hortonworks actually provides a complete solution and just one user interface that can manage all the packages. It can monitor all the requirements, all the versions and additionally all the quays and all the hardware-dependent services. What I want is a useful user interface which is the reason why I currently prefer to use Hortonworks.
What is most valuable?
One of the most valuable features is that you can configure your data nodes in the big data. Whatever you want. Normally, for example, if you are testing websites and Hash clouds and other sites, in most of them you must manage more than three or four requirements. For example, you must install each feature, you must compile some additional things, and also you must manage more than three configuration files to enable all the nodes to work together.
In the Hortonworks solution, you just need the service, and you just want to install it once to get started on projects easily. You can just click run and it's already installed and you can create and communicate between your services.
What needs improvement?
I work a lot with banking, IT and communications customers. Hortonworks must improve or must upgrade their services for these sectors.
Each customer has different requirements. From the IT side, someone who has some experience of the cluster, computer clustering, computer networking, different fire defense, for example, it is so important that they have some additional graphics, some additional service reports added to the Hortonworks current user interface, which could provide easy images. Especially if they are using it without any experience first.
For how long have I used the solution?
I've been using the solution for three years.
What do I think about the stability of the solution?
The Hortonworks solution is very stable. It works as a production system, without any error, without any downtime. If I have downtime, it is mostly caused by the hardware of the computers.
What do I think about the scalability of the solution?
The solution is very scalable. In our procedure, there are two we are using in production without any downtime. The most important thing is the hardware from the computer cluster. I can work on more than 1,000 servers at the same time. On the communication solution, currently, 50 people use it at the same time.
How was the initial setup?
The initial setup is so easy. You can just watch a video. It's handy. If you have some knowledge of computer networking and computer clusters, it is so easy. Deployment time depends on the project and the project size. Sometimes it takes more than three hours to complete.
What's my experience with pricing, setup cost, and licensing?
The solution is comprehensible but it also depends on the customers and the customer's stability requirements. I know that Hortonworks is stable, but sometimes when you are talking with the customers, they wonder if Hortonworks is free, how can it be enterprise. But I explain that Hortonworks is open-source.
What other advice do I have?
The solution is an open-source project. If you don't want to use their professional support services, you don't pay anything for Hortonworks and its solutions. When you want to call there or use its user interface, it's paid. This is why prefer Hortonworks solutions in my projects.
I would rate this solution 10 out of 10.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Solution Architect at a tech vendor with 10,001+ employees
We use it for data science activities. Security and workload management need improvement.
Pros and Cons
- "We use it for data science activities."
- "Security and workload management need improvement."
What is our primary use case?
We use it for data science activities.
How has it helped my organization?
Data is now available.
What is most valuable?
I have no preferences towards any feature.
What needs improvement?
- Security
- Performance
- Workload management
For how long have I used the solution?
Less than one year.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Works at a comms service provider with 10,001+ employees
Enabled us to implement fraud detection and improve performance at a lower cost
Pros and Cons
- "Ranger for security; with Ranger we can manager user’s permissions/access controls very easily."
- "Hive performance. If Hive performance increased, Hadoop would replace (not everywhere) traditional databases."
What is most valuable?
A few of them, namely: Hive/Tez, HBase, Ranger, Yarn and Ambari. Ambari helps managing the platform, Hive is very easy to use. Ranger for security; with Ranger we can manager user’s permissions/access controls very easily.
How has it helped my organization?
We have successfully ported a Microsoft SSIS product application into Hadoop, that saved millions of dollars for the company and, at the same time, they are getting better performance. Also, we implemented fraud detection, as quickly as possible, for the online orders. (Fraudulent orders became a big headache for our company. The early detection of fraud is saving the company a lot of money).
What needs improvement?
Hive performance. If Hive performance increased, Hadoop would replace (not everywhere) traditional databases (Oracle/Teradata, etc.), which would save a lot of money for the company.
For how long have I used the solution?
I have been working on this HDP platform since Jan 2015.
What do I think about the stability of the solution?
No, our company is a satisfied customer.
What do I think about the scalability of the solution?
No, not at all.
What other advice do I have?
Product is good. Reason I gave a rating of eight is that their community is very large and relatively very quick in bug fixes.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
BigData(QA & RnD) with 51-200 employees
The user-friendly feature of the Ambari Web UI is one of its best features. On the other hand, the Ambari upgrade is difficult.
Pros and Cons
- "Ambari Web UI: user-friendly."
- "Deleting any service requires a lot of clean up, unlike Cloudera."
What is most valuable?
- Ambari Web UI: user-friendly
- Views for Hive, Tez, Pig
- Spark and Ranger
How has it helped my organization?
It has helped our organisation cater to clients who are using Big Data for data storage and analysis combined with our security product.
What needs improvement?
Deleting any service requires a lot of clean up, unlike Cloudera.
For how long have I used the solution?
Five years.
What do I think about the stability of the solution?
Not until now.
What do I think about the scalability of the solution?
No.
How are customer service and technical support?
Very supportive, prompt responses.
Which solution did I use previously and why did I switch?
We didn't use a previous solution.
How was the initial setup?
The Ambari upgrade is not very user-friendly.
What's my experience with pricing, setup cost, and licensing?
Not applicable.
Which other solutions did I evaluate?
Cloudera and MapR.
What other advice do I have?
It's a great company with a great product employing dedicated people.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Big Data - Senior Solutions Architect at a tech vendor with 10,001+ employees
It is open and there is no lock-in.
What is most valuable?
We evaluated Cloudera and Hortonworks. Based on our evaluation and actual experience in production of 60 nodes and development of 12 nodes, the most valuable features of Hortonworks are:
- 100% open
- No lock-in like Cloudera
- Fast and accurate support instantly
- Largest number of committers to Hadoop by any means
- Hive is better in performance and ease of use compared to Impala
How has it helped my organization?
It helps a lot in data in motion (ingestion and manage in real time). We are able to do 3rd-party data monetization of our data within a t+20 minute time frame to our end customers.
What needs improvement?
- Cost
- Reliability
- Speed
- Ease of use
For how long have I used the solution?
I have used it for three years.
What was my experience with deployment of the solution?
I initially encountered deployment issues, but they were very good in resolving them.
What do I think about the stability of the solution?
I have not encountered stability issues.
What do I think about the scalability of the solution?
I have not encountered any scalability issues at all. That's the key reason we picked HDP over Cloudera, as Cloudera have issues & don't support compression of Hive in ORC format. They push only their products (not good).
How are customer service and technical support?
Customer Service:
Customer service has been excellent from the day one until now... and our Admin is comfortable with the SLA and turnaround time.
Technical Support:Technical support is very good and proactive with SmartSense.
Which solution did I use previously and why did I switch?
We previously used a different solution. We switched from Cloudera. Initially, we went with Cloudera due to it being a popular choice in the market, etc, then realized it was bad choice. Before we scaled from 6 nodes to 12 nodes and before we went livein production, we scrapped it due to Impala's performance and lock-in.
How was the initial setup?
Using Ambari, it was easy to set up and we even tried the AWS for a test cluster.
What about the implementation team?
An in-house team implemented it: two admins, seven developers, one data scientist, one PM and 22 business users at the customer (end-user side).
What was our ROI?
ROI is 300%.
What's my experience with pricing, setup cost, and licensing?
Hortonworks is the best, comparing all three flavors. If all is well, we might use open source alone in the next three years; others you can't due to lock-in...
Which other solutions did I evaluate?
Before choosing this product, we also evaluate Cloudera.
What other advice do I have?
It is the best in terms of product vision and actual delivery.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Buyer's Guide
Download our free Cloudera Data Platform Report and get advice and tips from experienced pros
sharing their opinions.
Updated: January 2026
Product Categories
Data Management Platforms (DMP) Cloud Master Data Management (MDM) AI Data AnalysisPopular Comparisons
Informatica Intelligent Data Management Cloud (IDMC)
Databricks
Palantir Foundry
Buyer's Guide
Download our free Cloudera Data Platform Report and get advice and tips from experienced pros
sharing their opinions.
Quick Links





