Try our new research platform with insights from 80,000+ expert users

Apache Spark vs SAP HANA comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Apache Spark
Average Rating
8.4
Reviews Sentiment
7.7
Number of Reviews
66
Ranking in other categories
Hadoop (1st), Compute Service (5th), Java Frameworks (2nd)
SAP HANA
Average Rating
8.4
Reviews Sentiment
6.5
Number of Reviews
85
Ranking in other categories
Data Virtualization (2nd), Embedded Database (4th), Relational Databases Tools (4th)
 

Featured Reviews

Ilya Afanasyev - PeerSpot reviewer
Reliable, able to expand, and handle large amounts of data well
We use batch processing. It works well with our formats and file versions. There's a lot of functionality. In our pipeline each hour, we make a copy of data from MongoDB, of the changes from MongoDB to some specific file. Each time pipeline copied all of the data, it would do it each time without changes to all of the tables. Tables have a lot of data, and in the last MongoDB version, there is a possibility to read only changed data. This reduced the cost and configuration of the cluster, and we saved about $150,000. The solution is scalable. It's a stable product.
Jayarami Reddy Pujeri - PeerSpot reviewer
Comprehensive system with real-time analytics for versatile industry applications
Our primary use case is working with various clients in industries such as pharmaceuticals and other services. We support clients as implementers of SAP HANA, providing expertise in functionality, finance, logistics, and processes The solution is very user-friendly and supports all kinds of…

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Apache Spark provides a very high-quality implementation of distributed data processing."
"The data processing framework is good."
"We use Spark to process data from different data sources."
"The tool's most valuable feature is its speed and efficiency. It's much faster than other tools and excels in parallel data processing. Unlike tools like Python or JavaScript, which may struggle with parallel processing, it allows us to handle large volumes of data with more power easily."
"It's easy to prepare parallelism in Spark, run the solution with specific parameters, and get good performance."
"The most valuable feature of Apache Spark is its ease of use."
"Features include machine learning, real time streaming, and data processing."
"The memory processing engine is the solution's most valuable aspect. It processes everything extremely fast, and it's in the cluster itself. It acts as a memory engine and is very effective in processing data correctly."
"SAP HANA's best features are its programmability and extensibility - you can size and shape the software however you need."
"It has a very huge bandwidth and data transfer."
"The solution is easy to scale."
"The solution is stable."
"Provides us with predictive capabilities for asset maintenance, and real-time forecasts."
"The speed at which it gets the data is great."
"The most value for us was in terms of using it to issue tenders online. We host our server, but it is open to the public, so clients who want to buy those tenders were able to go online, put their tender documents up, and we could evaluate them using SAP."
"We have found the solution to be customizable and it is beneficial it comes as a bundled package. Additionally, it is user-friendly."
 

Cons

"For improvement, I think the tool could make things easier for people who aren't very technical. There's a significant learning curve, and I've seen organizations give up because of it. Making it quicker or easier for non-technical people would be beneficial."
"It would be beneficial to enhance Spark's capabilities by incorporating models that utilize features not traditionally present in its framework."
"Stream processing needs to be developed more in Spark. I have used Flink previously. Flink is better than Spark at stream processing."
"Include more machine learning algorithms and the ability to handle streaming of data versus micro batch processing."
"The main concern is the overhead of Java when distributed processing is not necessary."
"The Spark solution could improve in scheduling tasks and managing dependencies."
"Its UI can be better. Maintaining the history server is a little cumbersome, and it should be improved. I had issues while looking at the historical tags, which sometimes created problems. You have to separately create a history server and run it. Such things can be made easier. Instead of separately installing the history server, it can be made a part of the whole setup so that whenever you set it up, it becomes available."
"The management tools could use improvement. Some of the debugging tools need some work as well. They need to be more descriptive."
"The releases need to be more stable. It's surprising to still encounter significant bugs after ten years of the product being available."
"One notable issue is the difficulty in finding consultants with experience in the SuccessFactors product, a human resource management tool part of SAP's cloud-based solutions. For example, learning the Oracle database is straightforward. You can easily go to the Oracle website, download the database, install it on your laptop, and access technical resources and books."
"SAP HANA isn't user-friendly, and it's very hard to train newcomers to use it."
"The bid process needs to be improved."
"The product lacks some flexibility in its settings and configurations."
"In terms of improvement, the speed is not as good as we thought it would be. That is why we are trying different solutions that will be built with different technologies."
"If the developers were to enhance or improve the application logic while processing the transactions, that would be great."
"The support is lacking and not worth the premium price. If support was just a nominal fee, it would be perceived differently."
 

Pricing and Cost Advice

"It is an open-source platform. We do not pay for its subscription."
"They provide an open-source license for the on-premise version."
"Apache Spark is open-source. You have to pay only when you use any bundled product, such as Cloudera."
"The solution is affordable and there are no additional licensing costs."
"Spark is an open-source solution, so there are no licensing costs."
"Considering the product version used in my company, I feel that the tool is not costly since the product is available for free."
"We are using the free version of the solution."
"Apache Spark is an open-source tool."
"Price-wise, the product falls on the higher side of the spectrum. There is no need to pay for maintenance and support additionally. Support is available for bug fixes in the product."
"It is expensive."
"People who are technical will accept the cost, but financially they will assess whether this solution will bring them revenue or not. People often ask, how will I profit when the cost is so high?"
"Setup and licensing require planning and proper budgeting, as it is not cheap."
"The pricing is relatively high for both customers and partners."
"The price is high and could be a bit cheaper."
"The cost of SAP HANA is high, and I would rate the price at eight out of ten."
"The licensing could improve."
report
Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
851,604 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
26%
Computer Software Company
13%
Manufacturing Company
8%
Comms Service Provider
6%
Manufacturing Company
14%
Computer Software Company
11%
Financial Services Firm
10%
Government
7%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Apache Spark?
We use Spark to process data from different data sources.
What is your experience regarding pricing and costs for Apache Spark?
Apache Spark is open-source, so it doesn't incur any charges.
What needs improvement with Apache Spark?
There is complexity when it comes to understanding the whole ecosystem, especially for beginners. I find it quite complex to understand how a Spark job is initiated, the roles of driver nodes, work...
What are the biggest benefits of using SAP HANA?
Based on my work with SAP HANA, the biggest benefit that it can bring to your business is total data management. This product is by SAP - a company that serves almost all needs a client may have co...
Is SAP HANA’s customer and technical support reliable?
We have been using SAP HANA for a fairly short period of time and have only taken advantage of their customer support. So far, we have not had issues that required specialized help from technical s...
Is SAP HANA difficult to set up and start using?
SAP HANA is fairly easy to set up, however, I do not think a complete beginner can do it. You certainly need some preparation - either you need to have experience with similar solutions, or with ot...
 

Comparisons

 

Also Known As

No data available
SAP High-Performance Analytic Appliance, HANA
 

Overview

 

Sample Customers

NASA JPL, UC Berkeley AMPLab, Amazon, eBay, Yahoo!, UC Santa Cruz, TripAdvisor, Taboola, Agile Lab, Art.com, Baidu, Alibaba Taobao, EURECOM, Hitachi Solutions
Unilever, NHS 24, adidas Group, CHIO Aachen, Hamburg Port Authority (HPA), Bangkok Airways Public Company Limited
Find out what your peers are saying about Apache Spark vs. SAP HANA and other solutions. Updated: April 2025.
851,604 professionals have used our research since 2012.