

Cloudera Data Science Workbench and Dremio compete in the data platform category. Cloudera has an advantage in support and pricing, while Dremio offers compelling features for real-time analytics.
Features: Cloudera Data Science Workbench is known for its collaboration tools, scalability, and security features, facilitating extensive data processing workflows. Dremio offers self-service analytics, data lake optimization, and intuitive data exploration capabilities, which enhance performance and enable real-time data insights.
Ease of Deployment and Customer Service: Cloudera provides flexible deployment options, including cloud and on-premises solutions, paired with reliable customer service for a smoother implementation. Dremio, with its cloud-native approach, simplifies integration and offers responsive support, making deployment straightforward.
Pricing and ROI: Cloudera Data Science Workbench has competitive initial costs and offers a favorable return on investment through cost-efficiency and productivity gains. Dremio, though potentially more expensive at first, delivers rapid ROI through enhanced analytics efficiency and performance-driven returns.
| Product | Market Share (%) |
|---|---|
| Dremio | 2.3% |
| Cloudera Data Science Workbench | 1.6% |
| Other | 96.1% |


| Company Size | Count |
|---|---|
| Small Business | 1 |
| Midsize Enterprise | 5 |
| Large Enterprise | 5 |
Cloudera Data Science Workbench (CDSW) makes secure, collaborative data science at scale a reality for the enterprise and accelerates the delivery of new data products. With CDSW, organizations can research and experiment faster, deploy models easily and with confidence, as well as rely on the wider Cloudera platform to reduce the risks and costs of data science projects. Access any data anywhere – from cloud object storage to data warehouses, CDSW provides connectivity not only to CDH but the systems your data science teams rely on for analysis.
Dremio offers a comprehensive platform for data warehousing and data engineering, integrating seamlessly with data storage systems like Amazon S3 and Azure. Its main features include scalability, query federation, and data reflection.
Dremio's core strength lies in its ability to function as a robust data lake query engine and data warehousing solution. It facilitates the creation of complex queries with ease, thanks to its support for Apache Airflow and query federation across endpoints. Despite challenges with Delta connector support, complex query execution, and expensive licensing, users find it valuable for managing ad-hoc queries and financial data analytics. The platform aids in SQL table management and BI traffic visualization while reducing storage costs and resolving storage conflicts typical in traditional data warehouses.
What are Dremio's most valuable features?Dremio is primarily implemented in industries requiring extensive data engineering and analytics, including finance and technology. Companies use it for constructing data frameworks, efficiently processing financial analytics, and visualizing BI traffic. It acts as a viable alternative to AWS Glue and Apache Hive, integrating seamlessly with multiple databases, including Oracle and MySQL, offering robust solutions for data-driven strategies. Despite some challenges, its ability to reduce data storage costs and manage complex queries makes it a favorable choice among enterprise users.
We monitor all Data Science Platforms reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.