Try our new research platform with insights from 80,000+ expert users

Apache Hadoop vs Dremio comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

ROI

Sentiment score
5.4
Apache Hadoop offers cost-effective storage and processing, benefiting some with analytics and optimizing data applications for resource savings.
Sentiment score
5.6
Dremio reduces manpower costs, enhances efficiency, and eliminates infrastructure concerns, improving operations by accessing multiple data sources.
Dremio surely saves time, reduces costs, and all those things because we don't have to worry so much about the infrastructure to make the different tools communicate.
SR BI developer at BRQ Digital Solutions
 

Customer Service

Sentiment score
6.1
Customer service for Apache Hadoop varies, with differing satisfaction levels and reliance on external resources and forums for support.
Sentiment score
5.2
Dremio's customer service is responsive and helpful, facing staffing challenges as demand grows, requiring more integrators for support.
It's not structured support, which is why we don't use purely open-source projects without additional structured support.
Financial Advisor at a financial services firm with 10,001+ employees
We have had to reach out for customer support many times, and they respond, so they are pretty supportive about some long-term issues.
SR BI developer at BRQ Digital Solutions
 

Scalability Issues

Sentiment score
7.4
Apache Hadoop is valued for its scalability, supporting large data and users effectively, especially in cloud environments.
Sentiment score
7.1
Dremio scales well, offering flexibility and built-in capabilities, though community users face scaling limits due to licensing.
It is a distributed file system and scales reasonably well as long as it is given sufficient resources.
Financial Advisor at a financial services firm with 10,001+ employees
Dremio's scalability can handle growing data and user demands easily.
SR BI developer at BRQ Digital Solutions
Internally, if it's on Docker or Kubernetes, scalability will be built into the system.
Senior Software Architect at USEReady
 

Stability Issues

Sentiment score
7.1
Apache Hadoop is stable and reliable in multi-node clusters, performing well with minimal instability during high-load operations.
Sentiment score
7.2
Dremio is generally stable, scoring high ratings with occasional performance issues, especially with large datasets, requiring maintenance restarts.
Continuous management in the way of upgrades and technical management is necessary to ensure that it remains effective.
Financial Advisor at a financial services firm with 10,001+ employees
I rate Dremio a nine in terms of stability.
SR BI developer at BRQ Digital Solutions
 

Room For Improvement

Apache Hadoop needs user-friendly enhancements, better integration, improved security, streamlined setup, and modernized features and support.
Dremio struggles with Delta connector support, performance issues, SQL limitations, high costs, and fewer connectors than competitors.
The problem with Apache Hadoop arose when the guys that originally set it up left the firm, and the group that later owned it didn't have enough technical resources to properly maintain it.
Financial Advisor at a financial services firm with 10,001+ employees
Starburst comes with around 50 connectors now.
Senior Software Architect at USEReady
It should be easier to get Arctic or an open-source version of Arctic onto the software version so that development teams can experiment with it.
Senior Consultant - Data Analytics at a comms service provider with 201-500 employees
I see that many times the new versions of Dremio have not fixed old bugs, and in some new versions, old problems that were previously fixed come back again, so I think the upgrade part could use improvement.
SR BI developer at BRQ Digital Solutions
 

Setup Cost

Enterprise Apache Hadoop pricing varies greatly, influenced by distribution choice, deployment type, and specific usage requirements.
Dremio's pricing, though costly for scaling, is seen as valuable compared to competitors, requiring careful evaluation based on needs.
 

Valuable Features

Apache Hadoop offers scalable, cost-effective data processing, supporting diverse environments with fault tolerance, integration, and analytics tools like Hive.
Dremio offers efficient data management and visualization with seamless integration, native SQL, and role-based access control features.
Hadoop is a distributed file system, and it scales reasonably well provided you give it sufficient resources.
Financial Advisor at a financial services firm with 10,001+ employees
I assess Apache Hadoop's fault tolerance during hardware failures positively since we have hardware failover, which works without problems.
Principle Network and Database Engr at Parsons Corporation
Having everything under one system and an easier-to-work-with interface, along with having API integrations, adds significant value to working with Dremio.
Senior Consultant - Data Analytics at a comms service provider with 201-500 employees
Dremio has positively impacted my organization as nowadays we are connected to multiple databases from multiple environments, multiple APIs, and applications, and Dremio organizes everything in an amazing way for me.
Data Analyst at a insurance company with 501-1,000 employees
You just get the source, connect the data, get visualization, get connected, and do whatever you want.
Senior Software Architect at USEReady
 

Categories and Ranking

Apache Hadoop
Average Rating
8.0
Reviews Sentiment
6.6
Number of Reviews
41
Ranking in other categories
Data Warehouse (7th)
Dremio
Average Rating
8.4
Reviews Sentiment
6.6
Number of Reviews
11
Ranking in other categories
Cloud Data Warehouse (5th), Data Science Platforms (11th)
 

Featured Reviews

NR
Financial Advisor at a financial services firm with 10,001+ employees
Reliable performance maintained but requires ongoing management and support
Hadoop was used for years, but there were problems since the people who originally set it up left the firm. The group that owned it later didn't have the technical resources to properly maintain it. Although there was nothing wrong with Hadoop itself, issues arose without proper management and upgrades.
Corrr Moray - PeerSpot reviewer
SR BI developer at BRQ Digital Solutions
Has simplified complex data integration workflows and supported consistent reporting across multiple sources
We also have a close relationship with the team that does the Dremio maintenance for the database, like upgrading the versions and they know about some specific problems we had in the past, such as a memory leak. We had a memory leak on some versions, which sometimes stopped the service. Since we are using Dremio installed like a server, not a SaaS solution, many times we need to stop and restart the service to clear all the cache and all that, and this is the thing I should add. I see that many times the new versions of Dremio have not fixed old bugs, and in some new versions, old problems that were previously fixed come back again, so I think the upgrade part could use improvement. I remember using some features in the past, like pivot tables, which proved to be really difficult, but I know this is a fault also for other vendors. Pivoting, transposing, and unpivoting are often not so good. CTEs also many times prove to be not so good, so I think these two main items could be improved significantly if they standardize them.
report
Use our free recommendation engine to learn which Cloud Data Warehouse solutions are best for your needs.
881,082 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
34%
Computer Software Company
7%
University
5%
Manufacturing Company
5%
Financial Services Firm
28%
Computer Software Company
9%
Manufacturing Company
6%
Healthcare Company
5%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business14
Midsize Enterprise8
Large Enterprise21
By reviewers
Company SizeCount
Small Business1
Midsize Enterprise5
Large Enterprise5
 

Questions from the Community

What do you like most about Apache Hadoop?
It's primarily open source. You can handle huge data volumes and create your own views, workflows, and tables. I can also use it for real-time data streaming.
What is your experience regarding pricing and costs for Apache Hadoop?
The product is open-source, but some associated licensing fees depend on the subscription level. While it might be free for students, organizations typically need to pay for their subscriptions. Th...
What needs improvement with Apache Hadoop?
The problem with Apache Hadoop arose when the guys that originally set it up left the firm, and the group that later owned it didn't have enough technical resources to properly maintain it. This wa...
What do you like most about Dremio?
Dremio allows querying the files I have on my block storage or object storage.
What is your experience regarding pricing and costs for Dremio?
I don't have information about pricing, setup cost, and licensing for Dremio, so I am not entitled to discuss it.
What needs improvement with Dremio?
I wouldn't say there is anything Dremio can be improved on. If I could change something, I would say many developers and programmers, when they are starting to work in this specific field or area, ...
 

Comparisons

 

Also Known As

No data available
Dremio AWS - BYOL
 

Overview

 

Sample Customers

Amazon, Adobe, eBay, Facebook, Google, Hulu, IBM, LinkedIn, Microsoft, Spotify, AOL, Twitter, University of Maryland, Yahoo!, Cornell University Web Lab
UBS, TransUnion, Quantium, Daimler, OVH
Find out what your peers are saying about Apache Hadoop vs. Dremio and other solutions. Updated: December 2025.
881,082 professionals have used our research since 2012.