Apache Hadoop Reviews and Pricing

Data Architect at a computer software company with 51-200 employees

Jan 6, 2024

Allows for customization and optimization of applications and performance using in-house resources but lacks community support

Pros and Cons

"It's open-source, so it's very cost-effective."

"The main thing is the lack of community support. If you want to implement a new API or create a new file system, you won't find easy support."

What is our primary use case?

We work on Apache Hadoop for various customers.

What is most valuable?

It's open-source, so it's very cost-effective. Apache Hadoop has its strengths. For example, in my previous organization, which was a small startup, we used it because it was cost-effective.

We only had to pay for the servers, and we could optimize applications and performance using our employees, which was especially cost-effective in India. So, human resources were the main investment, not software.

That was five years ago, though. In the last five years, I've mainly seen Redshift, Azure, and Oracle in the market.

What needs improvement?

The main thing is the lack of community support. If you want to implement a new API or create a new file system, you won't find easy support.

And then there's the server issue. You have to create and maintain servers on your own, which can be hectic. Sometimes, the configurations in the documentation don't work, and without a strong community to turn to, you can get stuck. That's where cloud services play a vital role.

In future releases, the community needs to be improved a lot. We need a better community, and the documentation should be more accurate for the setup process.

Sometimes, we face errors even when following the documentation for server setup and configuration. We need better support.

Even if we raise a ticket, it takes a long time to get addressed, and they don't offer online support. They ask for screenshots, which takes even more time. Instead of direct screensharing or hopping on a call. But it's free, so we can't complain too much.

For how long have I used the solution?

I've been working with Apache Hadoop for ten years. I started my career with Hadoop. I've worked with it at Infinia, Microsoft, and AWS, for a total of about eight years.

Buyer's Guide

Apache Hadoop

September 2025

Free Report: Apache Hadoop Reviews and More

Learn what your peers think about Apache Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: September 2025.

DOWNLOAD NOW

868,759 professionals have used our research since 2012.

What do I think about the stability of the solution?

I would rate the stability a seven out of ten. There is room for improvement in performance.

What do I think about the scalability of the solution?

It can be scalable in certain cases. Typically, for startups or product-based companies with limited budgets during product development, Apache Hadoop is often the only viable option. They cannot afford the costs of other cloud-based systems, so Apache Hadoop plays a main role in those scenarios.

Which solution did I use previously and why did I switch?

For some customers, we use Oracle Autonomous Database. Now, I cannot compare Apache Hadoop with Oracle Autonomous Data Warehouse when it comes to value for money. They're not directly comparable.

How was the initial setup?

The initial setup is a hectic task. Configuring servers and nodes takes a long time. That's one of the big advantages of an Autonomous Data Warehouse. You can start implementing within half the time.

With Apache Hadoop, you have to wait for the setup, architecture, and data evaluation. But with Autonomous, those things are automated. It scales as you use more data, so you can focus on the business rather than infrastructure.

What's my experience with pricing, setup cost, and licensing?

We just use the free version.

What other advice do I have?

We can't use Apache Hadoop for everything, like storage and data errors. But we can use some tools that are native to Hadoop, like Kafka.

For the current situation, I'd rate it a seven out of ten.

However, five years ago, I would have rated it a nine out of ten. Back then, I was working with it fully. But now we're used to working with cloud systems. Creating servers is more difficult nowadays.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.

Miodrag Milojevic

Senior Data Archirect at Yettel

Aug 9, 2023

Download

A file system for data collection that contains needed information and files

Pros and Cons

"It is a file system for data collection. There are nodes in this cluster that contain all the information, directories, and other files. The nodes are based on the MySQL database."

"The stability of the solution needs improvement."