I use AWS Glue for data processing. Some of my colleagues have data for software, and I use AWS Glue to transform and inspect this data.
Data Engineer at a tech services company with 501-1,000 employees
Offers good documentation, stability but error handling is difficult
Pros and Cons
- "It's very good to manage."
- "AWS Glue's error handling is difficult."
What is our primary use case?
What is most valuable?
It's very good to manage. It is easy to integrate other products with AWS.
Glue integrates with other AWS processes and networks. So, it's quite easy to integrate.
I've worked with AI integration but I haven't gone into much depth on that topic.
What needs improvement?
AWS Glue's error handling is difficult.
The errors in AWS are very hard to handle. The screen is very hard to understand.
I have to use CloudWatch, but whatever our error was, the new ones, and so on. I would test this with someone. It's not so easy for me, and there are more things related to this.
For how long have I used the solution?
I have been using it for a year and a half.
Buyer's Guide
AWS Glue
February 2026
Learn what your peers think about AWS Glue. Get advice and tips from experienced pros sharing their opinions. Updated: February 2026.
881,757 professionals have used our research since 2012.
What do I think about the stability of the solution?
I would rate the stability a nine out of ten.
What do I think about the scalability of the solution?
I would rate the scalability a seven out of ten.
My data is small, so we need to consider more days. We need to deal with what we have, but I understand the documentation.
Some people find it hard, but I rated it a seven. In my company, TechOps uses AWS with about 1,200 users.
Which solution did I use previously and why did I switch?
I worked with Databricks. In my opinion, Databricks is improving and is easier to use. It's more user-friendly, and I think it's better overall.
How was the initial setup?
I work with a big company, and most of it is already quickly done, like using something that is a blueprint. This configuration stuff is already working in another place. The only thing I have to do with the cloud is the remote configuration.
What's my experience with pricing, setup cost, and licensing?
AWS can be expensive.
What other advice do I have?
Overall, I would rate it a seven out of ten. I would recommend it.
Which deployment model are you using for this solution?
Public Cloud
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Senior Developer for cloud services at a tech consulting company with 10,001+ employees
Efficiently handles and moves data around, contributing to streamlined operations
Pros and Cons
- "One aspect that I would like to highlight is the Glue Crawler, which we utilize when working with large datasets to ensure the schema updates seamlessly without requiring end-team knowledge."
- "The point for improvement in AWS Glue would be the dynamic allocation of resources while utilizing Lambda functions."
What is our primary use case?
I use AWS Glue mostly for data ingestion or data extraction from multiple sources.
How has it helped my organization?
AWS Glue plays a central role in AI-based solutions and machine learning workflows by efficiently handling and moving data around, contributing to streamlined operations.
What is most valuable?
One aspect that I would like to highlight is the Glue Crawler, which we utilize when working with large datasets to ensure the schema updates seamlessly without requiring end-team knowledge.
Additionally, the pipeline orchestration and scheduling of ETL pipelines in AWS Glue are also highly effective. AWS Glue supports AI-driven projects and DAG transformations by facilitating efficient data handling required for machine learning workflows.
What needs improvement?
The point for improvement in AWS Glue would be the dynamic allocation of resources while utilizing Lambda functions. Currently, Lambda functions encounter a time-out error after fifteen minutes, necessitating an improvement in this area. Moreover, more practical examples and resources for advanced features would be beneficial for users.
For how long have I used the solution?
With AWS Glue, I have one year of experience.
What do I think about the stability of the solution?
In terms of stability, there are areas that could be improved, particularly with Lambda functions and step functions. There should be better scaling for parallel computation. Despite this, AWS's overall stability is quite good.
What do I think about the scalability of the solution?
AWS Glue receives a nine out of ten for scalability, although we feel certain functions should scale better for enhanced parallel computation.
How are customer service and support?
I would rate the technical support for AWS Glue as ten out of ten. The support is high-quality.
How would you rate customer service and support?
Positive
How was the initial setup?
The initial setup of AWS Glue could be straightforward for new users of AWS services, but there should be more practical examples and guides for advanced features to assist newcomers in learning complex concepts.
What about the implementation team?
Currently, I work on the implementation within our development team, focusing on cloud services.
What's my experience with pricing, setup cost, and licensing?
I find the pricing for AWS Glue quite affordable. For students or new users, AWS offers free credits, and as usage increases, the pay-as-you-go model provides flexibility without being expensive.
What other advice do I have?
I would recommend AWS Glue to others and rate the overall solution nine out of ten.
Which deployment model are you using for this solution?
Hybrid Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Amazon Web Services (AWS)
Disclosure: My company has a business relationship with this vendor other than being a customer. Implementer
Buyer's Guide
AWS Glue
February 2026
Learn what your peers think about AWS Glue. Get advice and tips from experienced pros sharing their opinions. Updated: February 2026.
881,757 professionals have used our research since 2012.
Site Reliability Engineer (AWS) at a financial services firm with 5,001-10,000 employees
Boosts efficiency with enhanced data processing and seamless integration
Pros and Cons
- "The AWS Glue Data Catalog provides metadata management and schema discovery. AWS Glue simplifies data transformation with automatic schema detection, incremental data updates, and integration with other AWS services."
- "AWS Glue should be more reliable and faster in processing. Enhancing the speed of data processing would be beneficial."
What is our primary use case?
We use AWS Glue for handling data-intensive tasks such as data lake creation, log analysis, machine learning pipelines, data warehouse population for analytics, and real-time data integration with AWS Lambda.
How has it helped my organization?
AWS Glue has increased efficiency and time saving. It simplifies and automates data pipeline processes, enabling faster data processing and analysis.
What is most valuable?
The AWS Glue Data Catalog provides metadata management and schema discovery. AWS Glue simplifies data transformation with automatic schema detection, incremental data updates, and integration with other AWS services.
It enables us to analyze data stored in Amazon S3 using SQL, which is manageable and cost-effective.
What needs improvement?
AWS Glue should be more reliable and faster in processing. Enhancing the speed of data processing would be beneficial.
For how long have I used the solution?
I have been using AWS Glue for more than one year.
What do I think about the stability of the solution?
AWS Glue is generally considered stable and reliable for data integration, especially for larger scale production environments. Its serverless architecture and integration with other AWS services contribute to its stability.
What do I think about the scalability of the solution?
AWS Glue is scalable because of its serverless nature, which allows for easy scaling without needing to manage any infrastructure.
How are customer service and support?
The technical support from AWS is very reliable. I would rate it nine out of ten.
How would you rate customer service and support?
Positive
Which solution did I use previously and why did I switch?
I did not use any other cloud data integration solutions before AWS Glue.
How was the initial setup?
The initial setup process for AWS Glue involved setting up Glue resources, creating roles and permissions, and developing ETL scripts and jobs. It took about half an hour.
What about the implementation team?
The deployment process required three developers.
What was our ROI?
While specific data on time and cost savings was not provided, AWS Glue's benefits include increased efficiency and time-saving.
What's my experience with pricing, setup cost, and licensing?
The approximate cost for ETL jobs is about 0.44 USD, which is mostly covered by the company. Employees do not purchase AWS Glue solutions individually.
What other advice do I have?
AWS Glue is highly recommended for data engineers due to its ability to build and maintain data pipelines, ensure data quality and integrity, and its integration with UI tools. It offers data preparation, machine learning integration, and governance.
I would rate it a ten out of ten.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Amazon Web Services (AWS)
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Senior Vice President & Global Head AWS BU at a tech services company with 10,001+ employees
Boosts data integration with serverless architecture and advanced compatibility
Pros and Cons
- "Its ease of use, cost-effectiveness, and highly secure architecture are some of the most valuable features."
- "There could be an enhanced way of managing pure metadata management or data cataloging."
What is our primary use case?
In my role as the global lead for AWS solutions and offerings, we work with various clients, including large-scale clients, to adopt and implement AWS cloud offerings.
Our primary focus revolves around cloud lift-and-shift migration, modernization, re-platforming, rehosting, data architecture, design strategy, and implementing generative AI-specific solutions across different industries such as banking, capital insurance, energy utilities, manufacturing, automotive, semiconductor, and aerospace and defense.
For example, we have implemented AWS Glue at several client locations, utilizing its serverless data integration capabilities during the data discovery process, enterprise transformation, cleansing, transforming, and centralizing data.
How has it helped my organization?
AWS Glue has significantly improved our data quality, enhancing the data by removing duplicates and providing timely and efficient insights.
It also aids in real-time data processing, reducing effort and cost due to its serverless architecture. These features ensure we maintain the highest level of scalability, reliability, and security compliance.
What is most valuable?
AWS Glue is fully managed, providing an easy-to-use integration environment to create, run, and monitor ETL jobs. It's broadly compatible and seamlessly integrates with other AWS services like Amazon S3, Redshift, and Athena. It's flexible with data integration, manages various data formats (JSON, ORC, CSV, etc.), and is serverless, eliminating the need for infrastructure management.
Its ease of use, cost-effectiveness, and highly secure architecture are some of the most valuable features.
What needs improvement?
There could be an enhanced way of managing pure metadata management or data cataloging.
Additionally, while it covers a wide range of integrations with AWS services, integrating with certain additional or legacy products is not seamless and can be complex.
Increasing support for more programming languages and improving advanced analytics capabilities could also be beneficial.
For how long have I used the solution?
We have been working with AWS Glue for almost three-plus years now.
What do I think about the stability of the solution?
We haven't faced any stability issues with AWS Glue. It is a scalable solution, provided that the right design principles and workload management are implemented.
What do I think about the scalability of the solution?
AWS Glue is a scalable solution due to its serverless architecture and efficient design.
How are customer service and support?
My team handles interactions with AWS for technical support, ensuring our design architectures are scalable, flexible, and well-integrated. We often reach out to the AWS team to double-check our implementation mechanisms and guidelines.
How would you rate customer service and support?
Positive
How was the initial setup?
The initial setup of AWS Glue is straightforward due to its serverless architecture and fully managed nature. Specific prerequisites need to be followed, such as setting up data sources, configuring IAM permissions, creating crawlers, and running ETL jobs.
What about the implementation team?
My team escalates technical questions to AWS support, ensuring our design architectures are optimal. We have a partnership with AWS, and the technical team frequently reaches out to AWS for guidance on scalability, flexibility, and integration mechanisms.
What was our ROI?
We have seen an efficient process with AWS Glue, providing the right return on investment at the right time. It ensures efficiency for our clients, giving them the desired ROI within their expected timelines.
What other advice do I have?
Follow the right design principles and involve AWS at the right time to leverage the most current features and offerings from AWS Glue. Ensuring the right architecture will mitigate any issues. I'd rate the solution eight out of ten.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Amazon Web Services (AWS)
Disclosure: My company has a business relationship with this vendor other than being a customer. Partner
Senior Software Developer at a computer software company with 10,001+ employees
Provides serverless mechanism, easy data transformation and automated infrastructure management
Pros and Cons
- "We no longer had to worry much about infrastructure management because AWS Glue is serverless, and Amazon takes care of the underlying infrastructure."
- "In terms of performance, if they can further optimize the execution time for serverless jobs, it would be a welcome improvement."
What is our primary use case?
I had the source data, which was unstructured and non-fixable, and my responsibility was to convert it into structured data. For this task, I used PySpark as the programming language. With Python, I implemented the creation of a data frame using Glue jobs. Since Glue jobs are a serverless mechanism, I deployed my code into the Glue job, and that's how I got the job done.
How has it helped my organization?
We no longer had to worry much about infrastructure management because AWS Glue is serverless, and Amazon takes care of the underlying infrastructure. This allowed us to focus on the code and application logic without concerns about scaling, CPU management, or handling fluctuations in flow. The serverless nature of Glue jobs relieved us from these infrastructure-related worries.
What is most valuable?
The most valuable feature of AWS Glue for me is the ability to write PySpark code and execute it within Glue. I use Glue functions to publish parameters and interact with Glue.
What needs improvement?
In terms of performance, if they can further optimize the execution time for serverless jobs, it would be a welcome improvement. Faster code execution would be beneficial. If AWS could enhance the serverless execution capabilities, like increasing CPU, RAM, and processing speed, that would be great.
For how long have I used the solution?
I have been using AWS Glue for a year.
What do I think about the stability of the solution?
In terms of stability, I haven't experienced any issues with AWS Glue. It has been quite reliable in my usage, and I would rate it around nine on a scale of ten.
What do I think about the scalability of the solution?
AWS Glue is highly scalable and reliable. The serverless nature ensures that scaling is automatically managed by AWS. It's one of the strengths of Glue, and there is no doubt about its scalability.
How are customer service and support?
The customer service and support were responsive and helpful in resolving the issues I had at that time. The response time was quick. So, overall, the support was okay.
How would you rate customer service and support?
Positive
What about the implementation team?
Our admin team was responsible for setting up the solution.
What's my experience with pricing, setup cost, and licensing?
The pricing is a bit higher than EMR. EMR is a managed Hadoop and Spark platform. AWS Glue has a higher price.
Which other solutions did I evaluate?
I found that AWS Lambda could be a possible alternative, but since AWS Lambda has a 15-minute execution limit, we opted for AWS Glue, which does not have any execution limits. I don't see any other service that can do what AWS Glue does.
Other services that could potentially do the same job as AWS Glue would require your application to be deployed in Elastic Beanstalk, EC2, or Container Services, which is another task altogether. Since AWS Glue is already doing its job perfectly, I don't think any other service can do the same job as AWS Glue.
What other advice do I have?
I would recommend that new users refer to the AWS documentation. The documentation is very well-written and easy to understand. Even new users with no prior experience with AWS should be able to get up and running quickly. I would also recommend that new users learn Python.
Overall, I would rate the solution a nine out of ten.
Disclosure: PeerSpot contacted the reviewer to collect the review and to validate authenticity. The reviewer was referred by the vendor, but the review is not subject to editing or approval by the vendor.
Developer-Data Engineer at a computer software company with 51-200 employees
Good large data processing and scalable but must overcome pipeline challenges
Pros and Cons
- "The best thing about AWS Glue is its scalability and how easy it is to process a large amount of data."
- "Setting up pipelines is challenging, especially with version control and testing requirements."
What is our primary use case?
I use AWS Glue primarily for ETL jobs. In my organization, it's just me using it as we are a small company. The IT team consists of four people, and I am the data engineering specialist.
What is most valuable?
The best thing about AWS Glue is its scalability and how easy it is to process a large amount of data. It integrates well with Redshift, S3, and AWS Glue catalog.
For processing extensive data, having a managed Spark service fulfills that role. If you're already working on AWS and you need to process a lot of data that can't be handled on a single node or server, AWS Glue will serve you well. While it's quite expensive, it's valuable for large data processing needs.
What needs improvement?
Setting up pipelines is challenging, especially with version control and testing requirements. While the initial setup is easy, it doesn't accommodate more complex development needs. You might feel hesitant about changing pipelines that are already running and processing business-critical data due to limited versioning and testing capabilities.
For how long have I used the solution?
I've been using AWS Glue since 2022, so for two years.
What do I think about the stability of the solution?
The stability of AWS Glue is fine. I haven't had any problems with it.
What do I think about the scalability of the solution?
The scalability of AWS Glue is commendable.
Which solution did I use previously and why did I switch?
Previously, in different jobs, I have worked with Databricks for ETL processes. I've also utilized Lambda functions for handling smaller data. I didn’t switch to AWS Glue, but used it in a different context.
How was the initial setup?
The initial setup of AWS Glue is easy, yet not adequate for more complex requirements. If you need to do something robust, like creating a notebook, it is straightforward.
However, when dealing with complex pipelines handling critical business data, it's hard to set up versioning and testing.
What other advice do I have?
AWS Glue receives a hesitant five out of ten from me. I recommend it if you're already on AWS and need to process large data sets. However, for smaller data volumes, I would suggest Airflow because AWS Glue can be quite expensive.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Associate Director - Delivery (Technology DWH & Data Engineer) at a media company with 1,001-5,000 employees
Efficiently integrates and transforms data but lacks in scalability
Pros and Cons
- "I like its integration and ability to handle all data-related tasks."
- "We face performance issues when using AWS Glue for data transformation and integration."
What is our primary use case?
Our primary use cases include pulling data from multiple sources and loading it into the central capacity for data transformation, integration, and processing.
What is most valuable?
I like its performance, integration, and its ability to handle all data-related tasks.
What needs improvement?
We face performance issues when using AWS Glue for data transformation and integration. It takes almost three to four hours to execute single transformations, which is a lot. We want to improve the performance to meet customer requirements.
Mainly, I am focused on improving the performance aspect because the customer is keen on this improvement.
For how long have I used the solution?
I have been using AWS Glue for five years. I am using the latest version.
What do I think about the stability of the solution?
I would rate the stability of AWS Glue a seven out of ten. There are some performance issues.
What do I think about the scalability of the solution?
I would rate the scalability of AWS Glue a six out of ten. There are over 25 users in my company.
How are customer service and support?
The customer service and support team is okay.
How would you rate customer service and support?
Neutral
How was the initial setup?
It is a cloud-based solution; there is no such installation procedure required. One DevOps is required for the maintenance of AWS Glue.
What other advice do I have?
Based on the customer scenario, I have previously recommended AWS Glue. Sometimes, customers directly request either Azure RapidAPI or AWS Glue. It depends on the specific business use case. Both tools have limitations, so it's hard to say which is best. If a customer already uses Microsoft products, I suggest going with Azure. As for a general rating, I would give AWS Glue a seven out of ten.
Overall, I would rate AWS Glue a seven out of ten because it's not about performance. It's because of how the tool is used.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Microsoft Azure
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Technology Specialist at a consultancy with 10,001+ employees
An interpreted language that does not need compilation, but it is very difficult to learn
Pros and Cons
- "You do not need many frameworks to run Glue."
- "It is very difficult to learn the tool and remember the syntaxes comparatively."
What is our primary use case?
We have a lot of microservices written in Glue, which are responsible for triggering based on certain events. The solution will be responsible for another container to containerize them and run over the cloud. We use the solution for different purposes, including data computing.
What is most valuable?
You do not need many frameworks to run Glue. It's an interpreted language that does not require to be compiled at all.
What needs improvement?
It is very difficult to learn the tool and remember the syntaxes comparatively. Sometimes, I face issues integrating the solution with some third-party services or services that are not a part of Glue. Such integrations take a lot of time, and not much content is available over the internet for the same.
For how long have I used the solution?
I have been using the solution for three to four years.
What do I think about the scalability of the solution?
We have eight developers on our team. My team works on almost four to five Glue services. We have four team members working on Glue, including me.
How are customer service and support?
I once faced an issue with Glue. There was a scenario where I wanted Glue to pick certain images, containerize them, and run over the code. That containerization integration wasn't happening successfully. I dropped a couple of messages in the community channel. I got good support from them, which helped me resolve my issue as quickly as possible. The community is very small, but the people are very helpful.
Which solution did I use previously and why did I switch?
I previously worked in Java, .NET, and Python. I have extensive experience with Python and .NET. Since my organization is language-independent, we have microservices written in almost all the languages, including Glue, Python, Java, and .NET.
How was the initial setup?
I'm not handling the solution's end-to-end deployment, but we have a CI/CD pipeline set up for that. The CI/CD pipeline will remain the same. It's all about how you containerize your Glue application. That is the only challenge we have faced while setting up the deployment. The rest of the configuration was pretty smooth.
What other advice do I have?
Glue is not a must-have tool. You can choose Glue if you have the capability to learn Glue as quickly as possible. There are other alternatives where you will find a lot of articles, study material, and certificates over the internet apart from Glue. If you do not have any other option, go for Glue.
If Glue is not mandatory for you, go for something else because it is difficult to learn Glue and remember the syntaxes. You will need support whenever you have a bigger integration or connectivity with third-party libraries or services. You will not receive many articles or help over the internet. Although the community is available, you need to spend some time with them to make them understand the issue.
It is not easy for a beginner to learn to use the solution for the first time. There are a few videos and courses available, but it's difficult. It's not as easy as other languages in terms of content. It's hard, but you can use it once you understand the concept.
Overall, I rate the solution seven and a half out of ten.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Amazon Web Services (AWS)
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Buyer's Guide
Download our free AWS Glue Report and get advice and tips from experienced pros
sharing their opinions.
Updated: February 2026
Product Categories
Cloud Data IntegrationPopular Comparisons
Informatica Intelligent Data Management Cloud (IDMC)
MuleSoft Anypoint Platform
webMethods.io
Palantir Foundry
Qlik Talend Cloud
Elastic Search
AWS Database Migration Service
Fivetran
Denodo
Matillion Data Productivity Cloud
SnapLogic
Zapier
Jitterbit Harmony
IBM Cloud Pak for Integration
Rivery
Buyer's Guide
Download our free AWS Glue Report and get advice and tips from experienced pros
sharing their opinions.
Quick Links
Learn More: Questions:
- Which is the best choice for cloud integration: AWS Glue or Informatica Intelligent Cloud Services (IICS)?
- Is AWS Glue a difficult solution to use if you are a complete beginner?
- Is AWS Glue effective for AWS-related products only?
- Why would you choose AWS Glue over other tools?
- What are the most common use cases for AWS Glue?
- How does Talend Open Studio compare with AWS Glue?
- Does AWS Glue offer more flexibility than other ETL (Extract, Transform, Load) tools in terms of data loading?
- Oracle ICS vs ODI
- When evaluating Cloud Data Integration, what aspect do you think is the most important to look for?
- What is data lake storage?






















