Try our new research platform with insights from 80,000+ expert users

AWS Lambda vs Apache Spark comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on May 21, 2025

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Apache Spark
Ranking in Compute Service
4th
Average Rating
8.4
Reviews Sentiment
7.4
Number of Reviews
66
Ranking in other categories
Hadoop (1st), Java Frameworks (2nd)
AWS Lambda
Ranking in Compute Service
1st
Average Rating
8.6
Reviews Sentiment
7.3
Number of Reviews
88
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of July 2025, in the Compute Service category, the mindshare of Apache Spark is 11.5%, up from 11.1% compared to the previous year. The mindshare of AWS Lambda is 20.7%, up from 19.5% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Compute Service
 

Featured Reviews

Dunstan Matekenya - PeerSpot reviewer
Open-source solution for data processing with portability
Apache Spark is known for its ease of use. Compared to other available data processing frameworks, it is user-friendly. While many choices now exist, Spark remains easy to use, particularly with Python. You can utilize familiar programming styles similar to Pandas in Python, including object-oriented programming. Another advantage is its portability. I can prototype and perform some initial tasks on my laptop using Spark without needing to be on Databricks or any cloud platform. I can transfer it to Databricks or other platforms, such as AWS. This flexibility allows me to improve processing even on my laptop. For instance, if I'm processing large amounts of data and find my laptop becoming slow, I can quickly switch to Spark. It handles small and large datasets efficiently, making it a versatile tool for various data processing needs.
Andrew-Wong - PeerSpot reviewer
Convenience in deployment process with room for code preview improvement
Having a better preview would be helpful. Sometimes, if my Lambda code is too big, it can be inconvenient as I'm unable to see my code when it exceeds a certain size. AWS has a limit, like a three-megabyte limit, beyond which I cannot view or edit the code easily.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"I found the solution stable. We haven't had any problems with it."
"ETL and streaming capabilities."
"Provides a lot of good documentation compared to other solutions."
"Spark helps us reduce startup time for our customers and gives a very high ROI in the medium term."
"It's easy to prepare parallelism in Spark, run the solution with specific parameters, and get good performance."
"AI libraries are the most valuable. They provide extensibility and usability. Spark has a lot of connectors, which is a very important and useful feature for AI. You need to connect a lot of points for AI, and you have to get data from those systems. Connectors are very wide in Spark. With a Spark cluster, you can get fast results, especially for AI."
"We use it for ETL purposes as well as for implementing the full transformation pipelines."
"The most valuable feature is the Fault Tolerance and easy binding with other processes like Machine Learning, graph analytics."
"We are building a Twitter-like application in the boot camp. I have used Lamda for the integration of the post-confirmation page in the application. This will help you get your one-time password via mail. You can log in with the help of a post-confirmation page. We didn’t want to setup an instance specifically for confirmation. We used the Lambda function so that it goes back to sleep after pushing up."
"Thanks to this solution, we do not need to worry about hardware or resource utilization. It saves us time."
"The basic feature that I like is that there is no server installation. It also has good support for various languages, such as Java, .NET, C#, and Python."
"It is a scalable solution."
"The most valuable feature of AWS Lambda, from a conceptual point, is its functions. For example, it's mathematical templates into which you can write, and create your solution. You write small pieces of a solution under given parameters."
"What I like best about AWS Lambda is that it's feature-rich, and I appreciate that. I also like that it's stable and supports many languages."
"It's a fairly easy solution to learn."
"It is easy to use."
 

Cons

"There could be enhancements in optimization techniques, as there are some limitations in this area that could be addressed to further refine Spark's performance."
"If you have a Spark session in the background, sometimes it's very hard to kill these sessions because of D allocation."
"This solution currently cannot support or distribute neural network related models, or deep learning related algorithms. We would like this functionality to be developed."
"For improvement, I think the tool could make things easier for people who aren't very technical. There's a significant learning curve, and I've seen organizations give up because of it. Making it quicker or easier for non-technical people would be beneficial."
"We use big data manager but we cannot use it as conditional data so whenever we're trying to fetch the data, it takes a bit of time."
"Include more machine learning algorithms and the ability to handle streaming of data versus micro batch processing."
"The main concern is the overhead of Java when distributed processing is not necessary."
"The solution needs to optimize shuffling between workers."
"AWS Lambda could be improved by increasing the size of the payload. Also, sometimes Lambda doesn't implement well for bigger solutions."
"It currently requires manual user maintenance to upgrade and evaluate, and an automated provision for this would be beneficial."
"Security needs to be improved."
"Lamba functions have cold-starts that can cause some delay."
"I would like to see the five zero four AWS Lambda invocation fixed. This is basically a time-out error."
"The runtime could be improved. There are certain use cases where I need a Lambda function to run longer."
"Lambda has limitations on the amount of memory you can use and is not a good solution for long running processes."
"AWS Lambda could improve by having no-code or low-code options because currently, you need to be able to write code well to use it."
 

Pricing and Cost Advice

"The product is expensive, considering the setup."
"The solution is affordable and there are no additional licensing costs."
"Apache Spark is not too cheap. You have to pay for hardware and Cloudera licenses. Of course, there is a solution with open source without Cloudera."
"It is an open-source platform. We do not pay for its subscription."
"Apache Spark is an open-source solution, and there is no cost involved in deploying the solution on-premises."
"Apache Spark is an expensive solution."
"Spark is an open-source solution, so there are no licensing costs."
"Licensing costs can vary. For instance, when purchasing a virtual machine, you're asked if you want to take advantage of the hybrid benefit or if you prefer the license costs to be included upfront by the cloud service provider, such as Azure. If you choose the hybrid benefit, it indicates you already possess a license for the operating system and wish to avoid additional charges for that specific VM in Azure. This approach allows for a reduction in licensing costs, charging only for the service and associated resources."
"AWS Lambda cost is pretty decent."
"We don't need to pay for licensing to use Lambda."
"The solution is part of the AWS subscription model that is paid annually."
"AWS Lambda is inexpensive."
"AWS Lambda is a very inexpensive solution. They charge for the number of times we run it. If you run AWS Lambda for one time, they charge around 50 cents or 25 cents for the use. I don't know the exact price, but it's less than a dollar."
"It computes by the cycle, and it's very cheap."
"The pricing varies based on the specific solution you're implementing, and in comparison to the value it provides, the overall cost is reasonable."
"We only need to pay for the compute time our code consumes."
report
Use our free recommendation engine to learn which Compute Service solutions are best for your needs.
860,168 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
27%
Computer Software Company
12%
Manufacturing Company
7%
Comms Service Provider
6%
Educational Organization
47%
Financial Services Firm
12%
Computer Software Company
7%
Manufacturing Company
5%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Apache Spark?
We use Spark to process data from different data sources.
What is your experience regarding pricing and costs for Apache Spark?
Apache Spark is open-source, so it doesn't incur any charges.
What needs improvement with Apache Spark?
There is complexity when it comes to understanding the whole ecosystem, especially for beginners. I find it quite complex to understand how a Spark job is initiated, the roles of driver nodes, work...
Which is better, AWS Lambda or Batch?
AWS Lambda is a serverless solution. It doesn’t require any infrastructure, which allows for cost savings. There is no setup process to deal with, as the entire solution is in the cloud. If you use...
What do you like most about AWS Lambda?
The tool scales automatically based on the number of incoming requests.
What is your experience regarding pricing and costs for AWS Lambda?
The pricing of AWS Lambda is reasonable. It's beneficial and cost-effective for users regardless of the number of instances used.
 

Comparisons

 

Overview

 

Sample Customers

NASA JPL, UC Berkeley AMPLab, Amazon, eBay, Yahoo!, UC Santa Cruz, TripAdvisor, Taboola, Agile Lab, Art.com, Baidu, Alibaba Taobao, EURECOM, Hitachi Solutions
Netflix
Find out what your peers are saying about AWS Lambda vs. Apache Spark and other solutions. Updated: June 2025.
860,168 professionals have used our research since 2012.