Apache HBase Reviews

Name: Apache HBase
Brand: Apache
Rating: 3.6 (4 reviews)

Vendor: Apache

3.6 out of 5

4 reviews
75% willing to recommend

Leave a review

What is Apache HBase?

Apache HBase is a distributed, scalable, NoSQL database built on Hadoop, designed to handle large volumes of structured data across commodity servers, providing real-time access and management.

Get the NoSQL Databases Buyer's Guide and find out what your peers are saying about Apache HBase, Redis, Microsoft Azure Cosmos DB and more!

Apache HBase is the #12 ranked solution in top NoSQL Databases. PeerSpot users give Apache HBase an average rating of 7.2 out of 10. Apache HBase is most commonly compared to Redis: Apache HBase vs Redis. Apache HBase is popular among the large enterprise segment, accounting for 54% of users researching this solution on PeerSpot. The top industry researching this solution are professionals from a financial services firm, accounting for 19% of all views.

Buyer's Guide

NoSQL Databases

May 2026

Get the category report

Helped 900,644 peers since 2012

Featured Apache HBase reviews

Ephrem Sisay

Senior Software Engineer at a computer software company with 501-1,000 employees

Apache HBase could be improved by optimizing the integration with Apache Phoenix; sometimes the abstraction and lookup jobs lead to issues when there are too many requests. Resource optimization isn't always as successful as it should be, which can cause some query and lookup jobs to fail. For instance, during eligibility checks for credit, if there are many requests on the database, it might fail, and after such a failure, it doesn't allow us to run queries from the moment they stop. If there could be optimization to require less resource usage and allow those jobs and queries to pick up from where they stopped, that would be a great addition to the tool.

Read full review

YuQing Ding

Principle Network and Database Engr at Parsons Corporation

The best features of Apache HBase include being embedded, making it very fast. When it's linking, it operates with virtually no delay. With SQL, it's a bit slower from my experience, and all of the queries are very fast too. It has some optimization inside, so I think it's very sufficient and efficient. The in-memory processing for two thousand cameras is crucial, and the most important aspect is the queries. When we write the queries, we have to be very clear. The rules we have to optimize by ourselves to associate to do the queries, and to do the normal rules is very important. We do the minimum to link with the maximum.

Read full review

Sekhar Reddy B

Principal Software Engineer at Securonix

We use it for real-time data grouping The most valuable part is the column family structure. We mainly use it for real-time aggregations. That's why we prefer it as a NoSQL database. We've seen performance issues when we have more regions. The product needs improvement in that area. So we…

Read full review

Apache HBase mindshare

As of June 2026, the mindshare of Apache HBase in the NoSQL Databases category stands at 4.9%, down from 5.9% compared to the previous year, according to calculations based on PeerSpot user engagement data.

NoSQL Databases Mindshare Distribution
Product	Mindshare (%)
Apache HBase	4.9%
MongoDB Enterprise Advanced	13.2%
Redis	8.5%
Other	73.4%

NoSQL Databases

Key learnings from peers

Valuable Features

Apache HBase offers a valuable column family structure and abstraction layer via Apache Phoenix, enabling SQL queries. Users benefit from its integration with Hadoop and HDFS, enhancing data storage and transport through pipelines. It excels in in-memory processing, optimizing queries with impressive speed, ideal for concurrent jobs and real-time aggregations. Linkage operations have minimal delay, making Apache HBase highly efficient and crucial for operations involving extensive data handling.

"The best features of Apache HBase include being embedded, making it very fast; when it's linking, it operates with virtually no delay, and all of the queries are very fast too due to some internal optimization which makes it very sufficient and efficient."
"The in-memory processing lets us optimize our queries and helps us run concurrent queries and other jobs such as the lookup jobs we always use Apache HBase for."
"The most valuable part is the column family structure."

Room for Improvement

Apache HBase experiences performance issues with large data loads and increased regions, needing enhanced integration with Apache Phoenix and better resource optimization. Frequent problems occur during database requests and eligibility checks for credit, causing failure and hindering query resumption. Setup requires significant time due to network connectivity, often resulting in crashes. Improvements should reduce resource usage and ensure tasks continue seamlessly after interruptions.

"The setup of Apache HBase needs a lot of time, and the linkage is not the program itself, but the activation and connecting to the NYPD engine always takes considerable time."
"Apache HBase could be improved by optimizing the integration with Apache Phoenix; sometimes the abstraction and lookup jobs lead to issues when there are too many requests."
"We've seen performance issues."

Deployment

Users noted that deploying Apache HBase requires Apache Hadoop, which is complex due to numerous dependencies. Some found initial deployment straightforward, implementing it both on cloud platforms like AWS and on-premises. However, others encountered challenges, particularly with firewalls, leading to frequent problem-solving meetings. Participants in deployment processes highlighted varying experiences, indicating diverse levels of difficulty based on specific setup environments and requirements.

These insights are based on the in-depth reviews provided by peers to help you make a better buying decision.

Download our NoSQL Databases Buyer's Guide for additional reliable information.

Top industries

By visitors reading reviews

Financial Services Firm

19%

Comms Service Provider

Manufacturing Company

University

Educational Organization

Construction Company

Computer Software Company

Outsourcing Company

Transportation Company

Energy/Utilities Company

Performing Arts

Media Company

Wholesaler/Distributor

Government

Insurance Company

Legal Firm

Pharma/Biotech Company

Retailer

Leisure / Travel Company

Recreational Facilities/Services Company

Renewables & Environment Company

Hospitality Company

Marketing Services Firm

Sports Company

Training & Coaching Company

Logistics Company

Non Profit

Museum Or Institution

Healthcare Company

Compare Apache HBase with alternative products

Learn more about Apache HBase

Apache HBase serves as a robust tool for handling vast amounts of data because it is optimized for random access and rapidly changing workloads. Its architecture supports massive storage capacities, making it ideal for applications requiring linear scalability and low latency. It integrates seamlessly with big data ecosystems, enhancing data processing capabilities for dynamic web applications and analytic databases. Leveraging column-family-oriented storage, it ensures efficient data retrieval and management, vital for real-time computational tasks.

What are the essential features of Apache HBase?

Linear Scalability: Easily expand storage capabilities to accommodate growing data needs without compromising performance.
Strong Consistency: Ensures data accuracy and reliability across distributed clusters.
Table Sharding: Facilitates automatic data distribution for optimized load balancing and access speed.
Real-Time Queries: Supports real-time read/write access crucial for time-sensitive applications.
Fault Tolerance: Automatically manages faults and ensures high availability through data replication.

What benefits should be considered in reviews?

Cost Efficiency: Reduces costs through commodity hardware usage and open-source infrastructure.
Scalability: Capacity to scale with business needs, providing a future-proof data management strategy.
Performance: High-speed data retrieval and management enable fast, responsive applications.
Integration: Seamless compatibility with Hadoop and other big data tools enhances ecosystem capability.

Apache HBase finds widespread application in industries like finance, telecommunications, and e-commerce, where high-speed data analysis and real-time processing are critical. In finance, it analyzes transactional data for fraud detection. In telecommunications, it manages customer data for service improvement. E-commerce giants use it for personalized recommendations and inventory management, underscoring its versatility across different sectors.

Apache HBase was previously known as HBase.

Apache HBase customers

Bloomberg, Wells Fargo, Apple, Capital One, NVIDIA

Product Categories

NoSQL Databases

Popular Comparisons

Redis vs Apache HBase

Microsoft Azure Cosmos DB vs Apache HBase

MongoDB Enterprise Advanced vs Apache HBase

InfluxDB vs Apache HBase

Cloudera Distribution for Hadoop vs Apache HBase

Cassandra vs Apache HBase

Couchbase Enterprise vs Apache HBase

ScyllaDB vs Apache HBase

Neo4j Graph Database vs Apache HBase

DataStax Enterprise vs Apache HBase

Google Cloud Firestore vs Apache HBase

Aerospike Database vs Apache HBase

Oracle NoSQL vs Apache HBase

CouchDB vs Apache HBase

Accumulo vs Apache HBase

See all alternatives

Apache HBase Reviews Summary
Author info	Rating	Review Summary
Senior Software Engineer at a computer software company with 501-1,000 employees	4.0	I've used Apache HBase mainly for customer data lookups and eligibility checks. It integrates well with Hadoop and Phoenix but needs better resource optimization during high loads. Overall, it's scalable, reliable, and suits our Big Data platform needs.
Principle Network and Database Engr at Parsons Corporation	4.5	I used Apache HBase for a year-long security system project and found it fast and efficient, though setup was time-consuming and network issues caused crashes. Support was excellent, and I’d rate it nine out of ten.
Principal Software Engineer at Securonix	4.0	We use Apache HBase for real-time data grouping, primarily appreciating its column family structure for real-time aggregations. However, we encounter performance issues when the number of regions increases, particularly under heavier loads.
Cloud and Big Data Engineer \| Developer at Huawei	2.0	I use Apache HBase for managing consumer data sets due to its database capabilities. However, I face performance issues when storing large amounts of data. Despite these challenges, I have not considered other solutions or providers yet.

Ephrem Sisay

Senior Software Engineer at a computer software company with 501-1,000 employees

Aug 22, 2025

In-memory processing and integration capabilities have optimized query performance

What is our primary use case?

We are using Apache HBase as a lookup database for queries that require doing lookups on customer data and eligibility checks for different kinds of customers. The customer data is stored in the Apache HBase database where we perform the lookup jobs.

What is most valuable?

The most valuable feature of Apache HBase is its abstraction; it's not directly an SQL database, but it adds an abstraction layer. There is a tool called Apache Phoenix, which we use as an abstraction for Apache HBase because it doesn't directly allow querying SQL statements. By using Apache Phoenix integrated with Apache HBase, we can run SQL queries and some other queries that we want to do. Even the lookup jobs I mentioned earlier, we use Apache Phoenix as an abstraction over Apache HBase.

HBase's integration with Hadoop and HDFS has definitely influenced our data storage strategy because Hadoop is the base or the foundation for those tools to run on. HDFS, being the storage for Hadoop, allows the query results from our lookup jobs to be placed there and transported through data pipelines to other data sources. Basically, Hadoop is a distributed system that everything operates on, and HDFS is the storage.

The impact of in-memory processing on our data operations is significant; it makes our processes fast. The in-memory processing lets us optimize our queries and helps us run concurrent queries and other jobs such as the lookup jobs we always use Apache HBase for.

What needs improvement?

For how long have I used the solution?

I have been working with Apache HBase for one year and six months.

What was my experience with deployment of the solution?

The deployment process of Apache HBase is actually big and complex. I cannot describe it entirely in this review, but we set up Hadoop for a distributed system, along with HDFS, Hive for metastore needs, and other tools such as Apache NiFi for data pipelines, Apache Airflow for scheduling jobs, and Apache Superset for visualization. Integrating those tools to set up the whole data lake for the Big Data platform took months to configure everything to function properly. The first-time setup is complex because it involves many different tools.

What do I think about the stability of the solution?

The stability and reliability of Apache HBase can be quite good as long as you maintain a solid cluster environment and a good resource optimization process. However, issues might arise regarding compute resources such as CPU and memory, which is something to consider. Ultimately, the environment you select is a key factor in determining whether you experience good stability and reliability or not.

What do I think about the scalability of the solution?

Apache HBase is good in terms of scalability; its scalability largely depends on the type of deployment. Ours is configured using the official Helm charts for Apache HBase on a Kubernetes cluster, which makes it quite scalable.

How are customer service and support?

I don't often communicate with the technical support and customer service for Apache HBase. Most of the time, we rely on the official documentation and community forums for information about new features or to resolve the problems we encounter.

How would you rate customer service and support?

Neutral

Which solution did I use previously and why did I switch?

We did not use a different solution for these use cases before Apache HBase.

How was the initial setup?

I participate in the initial setup and deployment of Apache HBase.

Which other solutions did I evaluate?

We did not evaluate other options when choosing Apache HBase; we decided to go with Apache HBase along with the abstraction of Apache Phoenix at the time of implementation, and it's been working for us.

What other advice do I have?

I'm working for a corporate that uses Apache HBase for their Big Data platform and I'm a Big Data engineer there.

We're using a version of Apache HBase that is compatible with the other Big Data tools that we are using on the platform, but it's not the latest one.

For Apache HBase, mostly we use it as a lookup database for queries that require doing lookups on the customer data or eligibility checks that we have to do for different kinds of customers. We store customer data on the Apache HBase database, and we do lookup jobs from those databases.

I utilize the automatic sharding of Apache HBase. Sharding is a way of partitioning the data sets into readable segments to run the queries in the most optimized way. We use those sharding capabilities to optimize our queries and run them as fast as possible to utilize fewer resources because a Big Data platform uses many resources. To remove those necessities, we use sharding to partition and optimize our queries, which allows us to run our queries quickly without consuming as much CPU and memory resources.

Apache HBase processing works by using in-memory data resources and takes advantage of the in-memory utilities without relying on storage capabilities.

The documentation I used is generally good, but the visualization could improve; it seems outdated. However, since it's an open-source tool, one cannot expect everything to be perfect, and the maintainers are typically driven by passion rather than finances. Overall, it's good documentation, and I've referenced it to address various problems and implementations.

Based on my experience, I would rate Apache HBase an eight out of ten.

I wonder if there are any other options that you would recommend?

Which deployment model are you using for this solution?

On-premises

YuQing Ding

Principle Network and Database Engr at Parsons Corporation

Aug 22, 2025

Real-time query processing streamlines security system integration

What is our primary use case?

I have experience with Apache HBase for a project that lasted one year. It involved a database for cameras and security systems, with all devices associated with each other, including doors, cameras, and IP devices that needed to be linked together.

Real-time linkage is important because we need to associate it with a camera if there's any alarm. The camera has to report to the data center to associate with the NYPD so NYPD can get to the data center immediately. It's related to the subway security systems.

What is most valuable?

The in-memory processing for two thousand cameras is crucial, and the most important aspect is the queries. When we write the queries, we have to be very clear. The rules we have to optimize by ourselves to associate to do the queries, and to do the normal rules is very important. We do the minimum to link with the maximum.

What needs improvement?

The setup of Apache HBase needs a lot of time, and the linkage is not the program itself, but the activation and connecting to the NYPD engine always takes considerable time. The network connections are the most important, and it crashes often because of them.

For how long have I used the solution?

I have experience with Apache HBase for a project that lasted one year.

What was my experience with deployment of the solution?

The setup was difficult, especially with the PD having their own firewall. We had meetings after meetings on how to activate it.

What do I think about the scalability of the solution?

How are customer service and support?

Their customer service with Apache is great. Whenever we ask questions, we always get very good support, and they help us significantly.

How would you rate customer service and support?

Positive

Which solution did I use previously and why did I switch?

We use Hadoop only in university projects. I remember many of the projects in the university utilized it, but I have to refresh myself now because I'm on other projects.

How was the initial setup?

The setup was difficult, especially with the PD having their own firewall. We had meetings after meetings on how to activate it.

What other advice do I have?

I'm very familiar with Apache SQL. Apache HBase is very similar to SQL, but it is a bit different.

The licensing of Apache HBase is not a problem; we love it. It's embedded, which is the reason we use it extensively.

I don't use API management tools such as API6.

On a scale of one to ten, I rate Apache HBase a nine.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Microsoft Azure

Sekhar Reddy B

Principal Software Engineer at Securonix

Aug 18, 2024

Offers real-time aggregations and easy for a beginner to learn to use this

What is our primary use case?

We use it for real-time data grouping.

What is most valuable?

The most valuable part is the column family structure. We mainly use it for real-time aggregations. That's why we prefer it as a NoSQL database.

What needs improvement?

We've seen performance issues when we have more regions. The product needs improvement in that area.

So we experience performance issues sometimes when the load increases.

For how long have I used the solution?

It's one of our legacy systems. We've been using it for eight to nine years.

What do I think about the stability of the solution?

We've only seen issues when migrating from an older version to the latest one. Otherwise, it's good.

How are customer service and support?

We don't rely on Apache support. It's our own infrastructure, maintained by our Hadoop team.

How was the initial setup?

The initial deployment is easy.

We deploy both on the cloud and on-premises, depending on the customer.

We use AWS.

What's my experience with pricing, setup cost, and licensing?

The cost depends on the EC2 instances and the size of the data you're indexing.

What other advice do I have?

It's better to use AWS DynamoDB or Cassandra.

I would rate it an eight out of ten. It is easy for a beginner to learn.

Atif Tariq

Cloud and Big Data Engineer | Developer at Huawei

Nov 20, 2023

The solution has many performance issues, though it helps manage consumer data sets

What is most valuable?

Apache HBase is a database used for data storage. It's managing our data set as a consumer data set.

What needs improvement?

I don't like using Apache HBase to store huge amounts of data because of many performance issues.

For how long have I used the solution?

I have been using Apache HBase for seven years.

How was the initial setup?

You cannot deploy Apache HBase without Apache Hadoop, and Hadoop is complex for many. Apache HBase is dependent on many services, and when there are a huge number of dependencies, it's a problem.

What other advice do I have?

I would not recommend Apache HBase to other users. There are more efficient solutions available in the market that have fixed many limitations presented by Apache HBase.

Overall, I rate Apache HBase a four out of ten.

Apache HBase Reviews

What is Apache HBase?

Featured Apache HBase reviews

Apache HBase mindshare

Valuable Features

Room for Improvement

Deployment

Top industries

Compare Apache HBase with alternative products

Learn more about Apache HBase

Apache HBase customers

Related questions

Product Categories

Popular Comparisons

What is our primary use case?

What is most valuable?

What needs improvement?

For how long have I used the solution?

What was my experience with deployment of the solution?

What do I think about the stability of the solution?

What do I think about the scalability of the solution?

How are customer service and support?

How would you rate customer service and support?

Which solution did I use previously and why did I switch?

How was the initial setup?

Which other solutions did I evaluate?

What other advice do I have?

Which deployment model are you using for this solution?

What is our primary use case?

What is most valuable?

What needs improvement?

For how long have I used the solution?

What was my experience with deployment of the solution?

What do I think about the scalability of the solution?

How are customer service and support?

How would you rate customer service and support?

Which solution did I use previously and why did I switch?

How was the initial setup?

What other advice do I have?

Which deployment model are you using for this solution?

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

What is our primary use case?

What is most valuable?

What needs improvement?

For how long have I used the solution?

What do I think about the stability of the solution?

How are customer service and support?

How was the initial setup?

What's my experience with pricing, setup cost, and licensing?

What other advice do I have?

What is most valuable?

What needs improvement?

For how long have I used the solution?

How was the initial setup?

What other advice do I have?