Try our new research platform with insights from 80,000+ expert users
Cloudera Distribution for Hadoop Logo

Cloudera Distribution for Hadoop pros and cons

Vendor: Cloudera
4.0 out of 5
Badge Leader

Pros & Cons summary

Buyer's Guide

Get pricing advice, tips, use cases and valuable features from real users of this product.
Get the report

Prominent pros & cons

PROS

Cloudera Distribution for Hadoop is highly scalable, supporting expansion with adequate hardware.
It provides a fast and efficient querying engine, Impala, for rapidly processing data.
The platform supports extensive data processing capabilities for unstructured data like XML and social media logs.
Cloudera Distribution for Hadoop offers strong security features and excellent customer service and support.
It integrates various features into one platform, allowing for powerful processing and flexibility in handling large data volumes both on-premise and in the cloud.

CONS

Cloudera Distribution for Hadoop struggles with support, being unreliable and deteriorating over time.
The infrastructure and performance need significant improvement, with better performance engines required.
Pricing is too high, limiting accessibility for some users.
Multiple bugs occur during updates, affecting stability and reliability.
The configuration and installation process is complex and challenging, especially for on-premise setups.
 

Cloudera Distribution for Hadoop Pros review quotes

LS
Nov 4, 2022
Very good end-to-end security features.
Thishen Govender - PeerSpot reviewer
Jul 14, 2019
Provides a viable open-source solution for enterprise implementations and reliable, intelligent data analysis.
Shahan Rehman - PeerSpot reviewer
Mar 21, 2024
The tool can be deployed using different container technologies, which makes it very scalable.
Learn what your peers think about Cloudera Distribution for Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: August 2025.
865,295 professionals have used our research since 2012.
reviewer1850319 - PeerSpot reviewer
Apr 29, 2022
We're now able to store large volumes of data through Cloudera Distribution for Hadoop. We're able to push large volumes of data to the platform, and that used to be a challenge, especially when storing a terabyte of information. This is the area where Cloudera Distribution for Hadoop improved the organization.
Miodrag-Stanic - PeerSpot reviewer
Dec 19, 2023
We had a data warehouse before all the data. We can process a lot more data structures.
Mohamed-Saied - PeerSpot reviewer
Apr 10, 2024
The tool's most interesting features are the distributed file system and unstructured data processing capability. Because we have a lot of unstructured data, like XML and social media logs, these features make it more valuable than the usual data warehousing solutions.
Miodrag Milojevic - PeerSpot reviewer
Jul 21, 2023
The scalability of Cloudera Distribution for Hadoop is excellent.
it_user900987 - PeerSpot reviewer
Jul 16, 2019
In terms of scalability, if you have enough hardware you can scale out. Scalability doesn't have any issues.
it_user1296588 - PeerSpot reviewer
Mar 9, 2020
The most valuable feature is Impala, the querying engine, which is very fast.
MG
Mar 25, 2020
We also really like the Cloudera community. You can have any question and will have your answer within a few hours.
 

Cloudera Distribution for Hadoop Cons review quotes

LS
Nov 4, 2022
The Cloudera training has deteriorated significantly.
Thishen Govender - PeerSpot reviewer
Jul 14, 2019
The solution does not support multiple languages very well and this means users need to create work-arounds to implement some solutions.
Shahan Rehman - PeerSpot reviewer
Mar 21, 2024
The tool's ability to be deployed on a cloud model is an area of concern where improvements are required.
Learn what your peers think about Cloudera Distribution for Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: August 2025.
865,295 professionals have used our research since 2012.
reviewer1850319 - PeerSpot reviewer
Apr 29, 2022
Cloudera Distribution for Hadoop has a limited feature list and a lot of costs involved.
Miodrag-Stanic - PeerSpot reviewer
Dec 19, 2023
The solution is not fit for on-premise distributions.
Mohamed-Saied - PeerSpot reviewer
Apr 10, 2024
The tool doesn't support reporting, and relational databases are still the major source of reporting data. Apache Iceberg will be launched soon within the Cloudera cluster for analytical purposes. The Cloudera Machine Learning aspect could be tuned and enhanced to enable us to host some predictive analytics machine learning and AI use cases.
Miodrag Milojevic - PeerSpot reviewer
Jul 21, 2023
Cloudera Distribution for Hadoop is not always completely stable in some cases, which can be a concern for big data solutions.
it_user900987 - PeerSpot reviewer
Jul 16, 2019
The one thing that we struggled with predominately was support. Because it was relatively new, support was always a big issue and I think it's still a bit of an ongoing concern with the team currently managing it.
it_user1296588 - PeerSpot reviewer
Mar 9, 2020
There is a maximum of a one-gigabyte block size, which is an area of storage that can be improved upon.
MG
Mar 25, 2020
Without the big data environment, we cannot store all of this data live. We have billions of records and terabytes of storage to be used. It's not an option actually for us to have a big data environment.