

Cloudera Distribution for Hadoop and ScyllaDB are both in the enterprise data management sector. Cloudera is favored for its comprehensive administrative features, while ScyllaDB leads in speed and efficiency, particularly in high-volume data environments.
Features: Cloudera Distribution for Hadoop includes Cloudera Manager for cluster management, Impala for fast data processing, and Sentry for enterprise security. ScyllaDB excels in performance efficiency using C++ and the Seastar library, offers fine-tuned resource management, and provides rapid data response times even in distributed systems.
Room for Improvement: Cloudera faces stability and processing speed issues with HBase 1.0 and struggles with configurations and technology integrations. Its licensing and pricing need revision. ScyllaDB requires enhancements in documentation, handling heavy deletions, and data modeling. Users seek easier setup processes and improvements in transaction management.
Ease of Deployment and Customer Service: Cloudera offers on-premises and cloud deployment with mixed customer support experiences. ScyllaDB focuses on hybrid and on-premises deployments and receives praise for efficient technical support, although documentation clarity is sometimes an issue.
Pricing and ROI: Cloudera is seen as expensive, suitable for large enterprises with difficult-to-measure long-term ROI. ScyllaDB is also costly but valued for its speed and efficiency. The enterprise version provides added value with direct support, though pricing remains a concern.
| Product | Mindshare (%) |
|---|---|
| ScyllaDB | 6.6% |
| Cloudera Distribution for Hadoop | 4.9% |
| Other | 88.5% |

| Company Size | Count |
|---|---|
| Small Business | 16 |
| Midsize Enterprise | 9 |
| Large Enterprise | 31 |
| Company Size | Count |
|---|---|
| Small Business | 3 |
| Midsize Enterprise | 2 |
| Large Enterprise | 8 |
Cloudera Distribution for Hadoop provides a comprehensive platform for efficient data management and analytics, integrating advanced analytics tools with enterprise-grade security and hybrid cloud support.
Designed for handling vast datasets, Cloudera Distribution for Hadoop facilitates seamless data processing through its components such as Hive, Pig, and Spark. It supports both structured and unstructured data management with robust scalability and powerful data handling capabilities. While the latest version focuses on enhancing speed and integration, challenges remain with HBase stability and processing in Cloudera 5 clusters. Organizations leverage it for big data management tasks like data warehousing, log analytics, and real-time data processing using tools like Hadoop and Spark.
What are the key features of Cloudera Distribution for Hadoop?In industries such as finance, retail, and healthcare, Cloudera Distribution for Hadoop is implemented to enhance data-driven decision-making and operational efficiency. It aids in processing large volumes of data for analytics, data warehousing, and infrastructure building. Companies utilize it to streamline machine learning and log analytics, serving as a data lake for preprocessing substantial datasets.
ScyllaDB is known for its remarkable speed, efficiency, and reliability. It operates with minimal resources, supports large data volumes, and allows for meticulous fine-tuning. Enhanced by Seastar, it offers a performance edge over Cassandra and prioritizes ease of use with robust data synchronization.
ScyllaDB stands out for its capability in high-volume data handling, making it ideal for distributed systems. It integrates easily with APIs and supports user-defined types, providing flexibility through Cassandra SDK and DynamoDB. High availability, effective partitioning, easy updates, and rapid query responses enhance its appeal for organizations dealing with real-time analytics and time-series data management. However, users have noted the complexity in setup compared to MongoDB or Postgres and have highlighted the need for improvements in areas like data export, deletion processes, and documentation. There are also challenges with transactions, support response, compatibility, and AWS integration, with some requiring custom commands which may affect universality.
What are the key features of ScyllaDB?Industries like telecommunications, advertising, mobility services, and security implement ScyllaDB for managing extensive datasets and securing real-time data access. Its scalability and efficiency make it a preferred choice for data transformation services, harmonizing complex data flows with real-time analytics needs.
We monitor all NoSQL Databases reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.