Try our new research platform with insights from 80,000+ expert users

Apache Spark vs SAP HANA comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Apache Spark
Average Rating
8.4
Reviews Sentiment
7.7
Number of Reviews
65
Ranking in other categories
Hadoop (1st), Compute Service (4th), Java Frameworks (2nd)
SAP HANA
Average Rating
8.2
Reviews Sentiment
6.5
Number of Reviews
84
Ranking in other categories
Data Virtualization (2nd), Embedded Database (4th), Relational Databases Tools (4th)
 

Featured Reviews

Ilya Afanasyev - PeerSpot reviewer
Reliable, able to expand, and handle large amounts of data well
We use batch processing. It works well with our formats and file versions. There's a lot of functionality. In our pipeline each hour, we make a copy of data from MongoDB, of the changes from MongoDB to some specific file. Each time pipeline copied all of the data, it would do it each time without changes to all of the tables. Tables have a lot of data, and in the last MongoDB version, there is a possibility to read only changed data. This reduced the cost and configuration of the cluster, and we saved about $150,000. The solution is scalable. It's a stable product.
Jayarami Reddy Pujeri - PeerSpot reviewer
Comprehensive system with real-time analytics for versatile industry applications
Our primary use case is working with various clients in industries such as pharmaceuticals and other services. We support clients as implementers of SAP HANA, providing expertise in functionality, finance, logistics, and processes The solution is very user-friendly and supports all kinds of…

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"AI libraries are the most valuable. They provide extensibility and usability. Spark has a lot of connectors, which is a very important and useful feature for AI. You need to connect a lot of points for AI, and you have to get data from those systems. Connectors are very wide in Spark. With a Spark cluster, you can get fast results, especially for AI."
"The product is useful for analytics."
"The product’s most valuable feature is the SQL tool. It enables us to create a database and publish it."
"The most valuable feature of Apache Spark is its flexibility."
"One of Apache Spark's most valuable features is that it supports in-memory processing, the execution of jobs compared to traditional tools is very fast."
"I found the solution stable. We haven't had any problems with it."
"The most significant advantage of Spark 3.0 is its support for DataFrame UDF Pandas UDF features."
"The deployment of the product is easy."
"This solution is very fast."
"SAP HANA's most valuable features are monitoring, reporting, and price and stock control."
"SAP HANA's best features are its programmability and extensibility - you can size and shape the software however you need."
"It is very flexible to integrate with SaaS components."
"I like the integration process. Also, the data is trusted by our management, and we use the data from transactions for analysis."
"The most valuable features I have found are speed, dashboard, and reporting."
"What I like best about SAP HANA is that it's faster than Microsoft SQL Server."
"Anyone currently using SAP will be transitioning to HANA."
 

Cons

"The setup I worked on was really complex."
"Apache Spark is very difficult to use. It would require a data engineer. It is not available for every engineer today because they need to understand the different concepts of Spark, which is very, very difficult and it is not easy to learn."
"This solution currently cannot support or distribute neural network related models, or deep learning related algorithms. We would like this functionality to be developed."
"Stream processing needs to be developed more in Spark. I have used Flink previously. Flink is better than Spark at stream processing."
"It should support more programming languages."
"There were some problems related to the product's compatibility with a few Python libraries."
"In data analysis, you need to take real-time data from different data sources. You need to process this in a subsecond, do the transformation in a subsecond, and all that."
"Include more machine learning algorithms and the ability to handle streaming of data versus micro batch processing."
"The user experience should be better. Its user interface is not good. I also don't like the transition concept."
"The bid process needs to be improved."
"The performance and integration with other products are areas in need of improvement."
"One notable issue is the difficulty in finding consultants with experience in the SuccessFactors product, a human resource management tool part of SAP's cloud-based solutions. For example, learning the Oracle database is straightforward. You can easily go to the Oracle website, download the database, install it on your laptop, and access technical resources and books."
"The installation process could be more straightforward."
"It would be nice to know when SAP plans to stop its maintenance of a previous version of SAP ECC ERP because, at this point, anyone utilizing SAP will have no choice but to go on S/4HANA Database."
"Per SAP, you can do both transactional and analytical processes in SAP HANA. Though that's true, the speed is slower when you combine the two functions, so this is what I'd like SAP to improve in SAP HANA. In the next release, I want to see better diagrams in SAP HANA and a more user-friendly interface."
"There's an issue in the partition. When you record more than two million records, partitioning does not work well. In Oracle it's easy. SAP must resolve this issue in order to be more competitive with Oracle."
 

Pricing and Cost Advice

"Licensing costs can vary. For instance, when purchasing a virtual machine, you're asked if you want to take advantage of the hybrid benefit or if you prefer the license costs to be included upfront by the cloud service provider, such as Azure. If you choose the hybrid benefit, it indicates you already possess a license for the operating system and wish to avoid additional charges for that specific VM in Azure. This approach allows for a reduction in licensing costs, charging only for the service and associated resources."
"Apache Spark is open-source. You have to pay only when you use any bundled product, such as Cloudera."
"The tool is an open-source product. If you're using the open-source Apache Spark, no fees are involved at any time. Charges only come into play when using it with other services like Databricks."
"The solution is affordable and there are no additional licensing costs."
"Considering the product version used in my company, I feel that the tool is not costly since the product is available for free."
"It is an open-source solution, it is free of charge."
"Apache Spark is not too cheap. You have to pay for hardware and Cloudera licenses. Of course, there is a solution with open source without Cloudera."
"They provide an open-source license for the on-premise version."
"The licensing could improve."
"SAP HANA is affordable. I rate it a seven out of ten."
"The price of this product is good."
"We are spending about 20,000 to 30,000 euros on the solution."
"The price is on the expensive side, at eight out of ten, with ten being expensive."
"Its licensing is expensive for SMEs and large enterprises alike."
"There is an annual payment needed to use the solution."
"The price of SAP HANA is very expensive and it is paid annually."
report
Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
845,406 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
28%
Computer Software Company
13%
Manufacturing Company
8%
Comms Service Provider
5%
Manufacturing Company
15%
Computer Software Company
12%
Financial Services Firm
10%
Government
7%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Apache Spark?
We use Spark to process data from different data sources.
What is your experience regarding pricing and costs for Apache Spark?
Compared to other solutions like Doc DB, Spark is more costly due to the need for extensive infrastructure. It requires significant investment in infrastructure, which can be expensive. While cloud...
What needs improvement with Apache Spark?
The Spark solution could improve in scheduling tasks and managing dependencies. Spark alone cannot handle sequential tasks, requiring environments like Airflow scheduler or scripts. For instance, o...
What are the biggest benefits of using SAP HANA?
Based on my work with SAP HANA, the biggest benefit that it can bring to your business is total data management. This product is by SAP - a company that serves almost all needs a client may have co...
Is SAP HANA’s customer and technical support reliable?
We have been using SAP HANA for a fairly short period of time and have only taken advantage of their customer support. So far, we have not had issues that required specialized help from technical s...
Is SAP HANA difficult to set up and start using?
SAP HANA is fairly easy to set up, however, I do not think a complete beginner can do it. You certainly need some preparation - either you need to have experience with similar solutions, or with ot...
 

Comparisons

 

Also Known As

No data available
SAP High-Performance Analytic Appliance, HANA
 

Overview

 

Sample Customers

NASA JPL, UC Berkeley AMPLab, Amazon, eBay, Yahoo!, UC Santa Cruz, TripAdvisor, Taboola, Agile Lab, Art.com, Baidu, Alibaba Taobao, EURECOM, Hitachi Solutions
Unilever, NHS 24, adidas Group, CHIO Aachen, Hamburg Port Authority (HPA), Bangkok Airways Public Company Limited
Find out what your peers are saying about Apache Spark vs. SAP HANA and other solutions. Updated: March 2025.
845,406 professionals have used our research since 2012.