Try our new research platform with insights from 80,000+ expert users

AWS Data Pipeline [EOL] vs AWS Glue comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

AWS Data Pipeline [EOL]
Average Rating
8.0
Number of Reviews
2
Ranking in other categories
No ranking in other categories
AWS Glue
Average Rating
7.8
Reviews Sentiment
7.0
Number of Reviews
48
Ranking in other categories
Cloud Data Integration (1st)
 

Featured Reviews

Geoffrey Leigh - PeerSpot reviewer
A stable, scalable, and reliable solution for moving and processing data
We're only considering enhancing the presentation layer to give a more multidimensional OLAP view that AWS seems to have decided on. Redshift with the data mart structure is like an OLAP cube. Oracle Analytics Cloud is an over-code killer and is not what we need. I was looking at Mondrian, which used to be part of the open-source stack from another vendor that works. Still, I am also looking at some of the other OLAP environments like Kaiser and perhaps decided to go to Azure with Microsoft Azure analysis cloud, but that's not multidimensional either as SSAS used to be. We tried the Mondrian, and that didn't perform how we expected. So, we are looking at resetting something to perform as an OLAP in the cloud, particularly AWS, so that we might consider an Azure solution.
Muthuvel Sivaraman - PeerSpot reviewer
Handles a huge volume of data and is serverless, but it can be considered costly by some users
We use Amazon's services to provide technical support for the product. If you want to have support, Oracle and others offer a single support, and other tools have a direct support window. For Amazon, we need to pay 10 percent of my billing amount for the tool to get support services. Whether to raise a support ticket or not is an issue since ten percent is a huge amount. My company ends up using all the options without help from support. It is very difficult for any common man to understand why there is a need to pay ten percent for support. If I find an issue in the product, and I need to get support from AWS to fix it, then I need to pay ten percent of the tool's bill amount to Amazon. AWS is a very tricky tool because everything is evolving nowadays. AWS engineers are getting hired from other places, and even after that, if I am not getting any technical support, then things will be very nasty. There are some good engineers who help users outside the normal support cycle, but it doesn't meet their needs. I rate the technical support a four out of ten.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The most valuable feature of the solution is that orchestration and development capabilities are easier with the tool."
"It is a stable solution...It is a scalable solution."
"The most valuable feature of AWS Glue is scalability."
"Transformations are valuable because you can modify or override complex data logic from an open source or Spark to solve issues."
"Data catalog and triggers are the two best features for me. AWS Glue has its own data catalog, which makes it great and really easy to use. Triggers are also really good for scheduling the ETL process."
"The most valuable features currently are glue studio, jobs, and triggers."
"One aspect that I would like to highlight is the Glue Crawler, which we utilize when working with large datasets to ensure the schema updates seamlessly without requiring end-team knowledge."
"If I'm working with big data, common languages like Python work quite nicely, which is advantageous."
"It is a stable and scalable solution."
"The best thing about AWS Glue is its scalability and how easy it is to process a large amount of data."
 

Cons

"The user-defined functions have shortcomings in AWS Data Pipeline."
"It's almost semi-automatic because you must review and approve code push, which works well. Still, we had many problems getting there during the deployment process, but we got there."
"Currently, it supports only two languages in the background: Python and Scala. From our customization point of view, it would be helpful if it can also support Java in the background."
"AWS Glue would be improved by making it easier to switch from single to multi-cloud."
"The mapping area and the use of the data catalog from Glue could be better."
"Only people who can code, either in Java or Python, can use the product freely. Those who don't know Java or Python might find using AWS Glue difficult."
"The solution's visual ETL tool is of no use for actual implementation."
"The drawbacks associated with the product stem from the fact that, based on the data volume, it can become very costly."
"The start-up time is really high right now. For instance, when you start up a new job, you have to wait for five or eight minutes before it starts. If the start-up time is reduced to one or two minutes, it will be great. It will be better to have a direct linkage to Redshift in AWS. If we can use data catalogs from Redshift, it will be so easy to create some data catalogs. Currently, we can only use data catalogs from S3."
"It is not clear how the partition discovery would have been affected by more data coming in."
 

Pricing and Cost Advice

"I rate the pricing between six to eight on a scale from one to ten, where one is low price, and ten is high price."
"The way we use it, I think it is fair as we're getting a good value for money compared to having a server or some other data pipeline."
"I would rate the solution a six or seven on a scale of one to ten, with ten being very expensive. Specifically, I rate its pricing a six out of ten."
"This solution is affordable and there is an option to pay for the solution based on your usage."
"It is an expensive product. I rate its pricing a nine out of ten."
"AWS Glue is a paid service that doesn't come under the free trial of AWS."
"AWS Glue uses a pay-as-you-go approach which is helpful. The price of the overall solution is low and is a great advantage."
"I rate the tool an eight on a scale of one to ten, where one is expensive, and ten is expensive."
"AWS Glue follows a pay-as-you-go model, wherein the cost of the data you use will be counted as a monthly bill."
"The solution's pricing is based on DPUs so it is a good idea to optimize use or it can get expensive."
report
Use our free recommendation engine to learn which Cloud Data Integration solutions are best for your needs.
845,485 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Computer Software Company
24%
Financial Services Firm
22%
Government
6%
Insurance Company
5%
Financial Services Firm
22%
Computer Software Company
13%
Manufacturing Company
8%
Government
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
 

Questions from the Community

What do you like most about AWS Data Pipeline?
The most valuable feature of the solution is that orchestration and development capabilities are easier with the tool.
What is your experience regarding pricing and costs for AWS Data Pipeline?
I rate the pricing between six to eight on a scale from one to ten, where one is low price, and ten is high price.
What needs improvement with AWS Data Pipeline?
The user-defined functions have shortcomings in AWS Data Pipeline. The user-defined functions could be one of the areas where I can write a custom function and embed it as a part of AWS Data Pipeli...
How do you select the right cloud ETL tool?
AWS Glue and Azure Data factory for ELT best performance cloud services.
How does Talend Open Studio compare with AWS Glue?
We reviewed AWS Glue before choosing Talend Open Studio. AWS Glue is the managed ETL (extract, transform, and load) from Amazon Web Services. AWS Glue enables AWS users to create and manage jobs in...
What are the most common use cases for AWS Glue?
AWS Glue's main use case is for allowing users to discover, prepare, move, and integrate data from multiple sources. The product lets you use this data for analytics, application development, or ma...
 

Comparisons

 

Overview

 

Sample Customers

bp, Cerner, Expedia, Finra, HESS, intuit, Kellog's, Philips, TIME, workday
bp, Cerner, Expedia, Finra, HESS, intuit, Kellog's, Philips, TIME, workday
Find out what your peers are saying about Amazon Web Services (AWS), Informatica, Salesforce and others in Cloud Data Integration. Updated: March 2025.
845,485 professionals have used our research since 2012.