I mainly use AWS Glue for ETL purposes and batch processing of data.
Data Engineer at a tech services company with 501-1,000 employees
Great for ETL and batch processing
Pros and Cons
- "AWS Glue's most valuable features are the data catalog, including crawlers and tables, and Glue Studio, which means you don't have to use custom code."
- "If there's a cluster-related configuration, we have to make worker notes, which is quite a headache when processing a large amount of data."
What is our primary use case?
What is most valuable?
AWS Glue's most valuable features are the data catalog, including crawlers and tables, and Glue Studio, which means you don't have to use custom code.
What needs improvement?
There are a couple of issues with AWS Glue. First, AWS Control randomly logs off, which disturbs coding. Second, if there's a cluster-related configuration, we have to make worker notes, which is quite a headache when processing a large amount of data. In the next release, AWS Glue should include more transformations with AWS Studio.
For how long have I used the solution?
I've been using AWS Glue for around eight months.
Buyer's Guide
AWS Glue
January 2026
Learn what your peers think about AWS Glue. Get advice and tips from experienced pros sharing their opinions. Updated: January 2026.
881,082 professionals have used our research since 2012.
What do I think about the stability of the solution?
AWS Glue is stable.
How are customer service and support?
AWS' technical support responds within an hour on email.
How was the initial setup?
The initial setup was very easy, with only some minimal configuration. However, there is a drawback that once we file the name of a user, it can't be changed.
What's my experience with pricing, setup cost, and licensing?
AWS Glue is quite costly, especially for small organizations. The licensing fee is around $200 per year.
What other advice do I have?
Glue supports Spark, so if you have a team that's good with Spark, definitely go with Glue. I would rate AWS Glue as eight out of ten.
Which deployment model are you using for this solution?
Private Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Amazon Web Services (AWS)
Disclosure: My company has a business relationship with this vendor other than being a customer. Partner
Data Engineer | Developer at a tech services company with 51-200 employees
Data integration solution that hosts metadata before the roll out of actual data
Pros and Cons
- "The key role for Glue is that it hosts our metadata before rolling out our actual data. This is the major advantage of using this solution and our clients client have been very satisfied with it."
- "The technical support for this solution could be improved. In future, we would like to connect more services like Athena or Kinesis to help control more loads of data."
What is our primary use case?
The key role of Glue is that it hosts our metadata before rolling out our actual data. This is the major advantage of using this solution and our clients client have been very satisfied with it.
What is most valuable?
The most valuable aspect of this solution is its automation and ability to sync data from the source to the solution phase.
What needs improvement?
The technical support for this solution could be improved. In future, we would like to connect more services like Athena or Kinesis to help control more loads of data.
For how long have I used the solution?
I have been using this solution for three years.
What do I think about the stability of the solution?
This is a stable solution. We have isolated the environment using containerization so that if anything goes wrong, we have higher levels of scalability and availability. To achieve this, we have configured multiple servers for testing, UAT and development.
What do I think about the scalability of the solution?
This is a scalable solution which is supported in our organization by Docker and Kubernetes. We have 2,000 users.
How are customer service and support?
We used a vendor with an internal IT team who provided us with architecture so that we could leverage those services and reach a solution. They have 50 people in the IT team, who continuously help us and monitor the things that we are working on.
How would you rate customer service and support?
Positive
How was the initial setup?
The initial setup was straightforward and took approximately one month. For deployment, we worked in two teams. One person handled all the scripting which we are developing for automation. Two other members handled the database and servers.
What's my experience with pricing, setup cost, and licensing?
This solution is affordable and there is an option to pay for the solution based on your usage.
What other advice do I have?
I would rate this solution a seven out of ten.
Which deployment model are you using for this solution?
Hybrid Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Amazon Web Services (AWS)
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Buyer's Guide
AWS Glue
January 2026
Learn what your peers think about AWS Glue. Get advice and tips from experienced pros sharing their opinions. Updated: January 2026.
881,082 professionals have used our research since 2012.
ECM CONSULTANT/ARCHITECT/SOFTWARE DEVELOPER, DELUXE MN at a tech services company with 5,001-10,000 employees
Easy to perform ETL on multiple data sources, and easy to use after you learn it
Pros and Cons
- "Glue is a NoSQL-based data ETL tool that has some advantages over IIS and ISAs."
- "There is a learning curve to this tool."
What is our primary use case?
Glue is a NoSQL-based data ETL tool that has some advantages over IIS and ISAs. It is tailored and customized to use with SQL Server, which works very well in that platform.
If you want to use other data sources, the NoSQL concept makes it very easy, because missing data can be inserted as a new column or with null values.
That is not the case with many other tools. If you have on-premises tools, such as IIS, they don't manage missing data well.
What is most valuable?
If you want extremely high-performance functionality, you have to use both AWS Glue or Data Lake to store it in some temporary table. First, you will have to do some cleaning of the data, then if you need performance and speed, you have to use IIS with an IBM tool.
You have to use the right tool in the right places. For example, if you're using Oracle, you have got to use the Oracle tools. If you are using SQL, you have to use the SQL tools. There is no other tool that provides the performance.
It's context-based and project-based. In the projects that I have used, it has worked well.
What needs improvement?
There is a learning curve to this tool.
For how long have I used the solution?
I have been working with AWS Glue for four years.
Everything runs on AWS, even if it belongs to a third party. For example, if you have a Netflix subscription, it runs on AWS. We have other products or vendor subscriptions that run on AWS.
What do I think about the stability of the solution?
Undoubtedly, the cloud is built to handle failure. If you have your devices, and your resources configured correctly, you won't have any issues. I haven't seen a problem.
How are customer service and support?
You have to pay for their technical support, and depending on which level of subscription, you will receive a call within an hour; otherwise, you will have to wait for days.
Which solution did I use previously and why did I switch?
We also use Azure's Data Lake, and I worked with Tipco in the past, though it's been a few years since we used it.
You should select the best tool for the job or the projects that are currently being worked on. Tipco was heavily used in the previous project we worked on.
How was the initial setup?
It takes some time to learn, but once you get the hang of it, you'll be fine. It's like any other IT tool, where nobody is an expert or isn't an expert, it is just the way you are exposed to a tool.
You've chosen the right tool if you understand how the data works and what it needs to do. It's like going to Home Depot to get the right tool. You can purchase a set of tools, and it will work for you, but you will still need to purchase something else.
It's one of those tools in which someone must be an expert. After that, all tools and platforms become secondary.
What's my experience with pricing, setup cost, and licensing?
With AWS Glue, you pay more, but if you want to process the data, with speed and performance, you need the correct EC2 instances.
There is a price to pay. It doesn't come free.
Technical support is a paid service, and which subscription you have is dependent on that. You must pay one of them, and it ranges from $15,000 to $25,000 per year.
You sign up for a level of service, and it does not come for free. As previously stated, everything is based on performance, ELAs.
It was very expensive, at that time. If a company wants to pay the money, it makes my job easier. However, if the company or enterprise does not have the funds to pay for it, then it is a hassle.
What other advice do I have?
In that environment, there is a lot going on. There are some things that you can get for free, and there are some add-ons that you can develop or use that have been tested. It's all about convenience and service. You will get what you pay for if you pay for what you want.
I'm not a fan of any tools; it all depends on the organization I work for, where their data is, what they want to do with it, how quickly they want to get there, and what their budget is, and you work around that. For me, I would not choose one over the other, unless I know the details of the project.
I would rate AWS Glue a nine out of ten.
Which deployment model are you using for this solution?
Public Cloud
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
CEO and Founder at a computer software company with 201-500 employees
Improved our time to implement a new ETL process and has a good price and scalability, but only works with AWS
Pros and Cons
- "The facility to integrate with S3 and the possibility to use Jupyter Notebook inside the pipeline are the most valuable features."
- "The crucial problem with AWS Glue is that it only works with AWS. It is not an agnostic tool like Pentaho. In PowerCenter, we can install the forms from Google and other vendors, but in the case of AWS Glue, we can only use AWS."
What is our primary use case?
It is a good tool for us. All the implementation in our company is done with AWS Glue. We use it to execute all the ETL processes. We have collected more or less five terabytes of information from the internet by now. We process all this data in our cloud platform and normalize the information. We first put it on a data lake that we have here on the AWS tool. After that, we use AWS Glue to transform all the information collected around the internet and put the normalized information into a data warehouse.
How has it helped my organization?
It has improved the time to implement a new ETL process by 30%. We have also seen a big improvement in the data science area.
What is most valuable?
The facility to integrate with S3 and the possibility to use Jupyter Notebook inside the pipeline are the most valuable features.
What needs improvement?
The crucial problem with AWS Glue is that it only works with AWS. It is not an agnostic tool like Pentaho. In PowerCenter, we can install the forms from Google and other vendors, but in the case of AWS Glue, we can only use AWS.
For how long have I used the solution?
I have been using this solution for two years.
What do I think about the stability of the solution?
In terms of stability, we had some problems in the past, but now, it is okay. AWS provides SLA, and the integration of the tools is good.
What do I think about the scalability of the solution?
Scalability is a very strong point of this solution as compared to other solutions like PowerCenter and Pentaho. In Pentaho, you need to install a lot of machines, but in AWS Glue, you just need to find out how many instances do you need. You just put this information in a form and click okay. Magically, you have the scaled processes.
We have 35 users of this solution, and they are engineers, DevOps, and data scientists. We have a lot of plans to increase the usage of AWS Glue in 2021.
How are customer service and technical support?
In the first year of using it, we had a lot of problems with the solution. Our team found more or less five bugs if I remember correctly. Our experience with AWS support was very good. The team in the US helped us to resolve the problems and fix the bugs. We are AWS partners.
Which solution did I use previously and why did I switch?
Before AWS Glue, we worked with Talend, PowerCenter, and Pentaho. In the case of PowerCenter, the biggest problem for us was the plugins because they were too expensive. That was the negative point of PowerCenter.
In the case of Talend, the problem was that in Brazil, we didn't have professionals with the skills to work with Talend. In addition, we had to use the command-line interface, which was a terrible thing because it took more time as compared to other solutions.
In the case of Pentaho, we had the same problem as Talend. We didn't have a lot of professionals. Of course, we have some courses to train people in Pentaho. We work with the biggest companies in Brazil, and we need professionals every day, but we don't have professionals with experience in Pentaho.
How was the initial setup?
The initial setup process is totally easy. You just need to put some information in the forms, and then you just need to click some buttons, and it is complete. The process to provide a new infrastructure with AWS Glue takes from 10 minutes to an hour.
What about the implementation team?
We have all the professionals inside the company.
What's my experience with pricing, setup cost, and licensing?
Its price is good. We pay as we go or based on the usage, which is a good thing for us because it is simple to forecast for the tool. It is also good in terms of the financial planning of the company, and it is a good way to estimate the cost. It is also simple for our clients.
In my opinion, it is one of the best tools in the market for ETL processes because of the fact that you pay as you use, which separates it from other big tools such as PowerCenter, Pentaho Data Integration, and Talend.
What other advice do I have?
I would rate AWS Glue a seven out of ten.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Amazon Web Services (AWS)
Disclosure: My company has a business relationship with this vendor other than being a customer. Partner
Senior Software Engineer at a consumer goods company with 10,001+ employees
It comes with its own data catalog and supports triggers for scheduling the ETL process
Pros and Cons
- "Data catalog and triggers are the two best features for me. AWS Glue has its own data catalog, which makes it great and really easy to use. Triggers are also really good for scheduling the ETL process."
- "The start-up time is really high right now. For instance, when you start up a new job, you have to wait for five or eight minutes before it starts. If the start-up time is reduced to one or two minutes, it will be great. It will be better to have a direct linkage to Redshift in AWS. If we can use data catalogs from Redshift, it will be so easy to create some data catalogs. Currently, we can only use data catalogs from S3."
What is our primary use case?
We are collecting some TV audience data and analyzing it.
What is most valuable?
Data catalog and triggers are the two best features for me.
AWS Glue has its own data catalog, which makes it great and really easy to use. Triggers are also really good for scheduling the ETL process.
What needs improvement?
The start-up time is really high right now. For instance, when you start up a new job, you have to wait for five or eight minutes before it starts. If the start-up time is reduced to one or two minutes, it will be great.
It will be better to have a direct linkage to Redshift in AWS. If we can use data catalogs from Redshift, it will be so easy to create some data catalogs. Currently, we can only use data catalogs from S3.
For how long have I used the solution?
We have been using the AWS Glue for approximately one and a half years.
What do I think about the stability of the solution?
There is no problem related to stability.
What do I think about the scalability of the solution?
Scalability is good. I can reduce or increase the number of DPUs, which I find very useful.
We are trying to increase the usage of AWS Glue because of customer needs. When the data increases, our application needs some more analyzers and user interfaces. We will increase our data analyzer and user interfaces.
How are customer service and technical support?
I didn't take any technical support because I didn't have a big problem or issue. I just used some information from various communities and forums about the maintenance.
What's my experience with pricing, setup cost, and licensing?
The pricing is a bit higher than other solutions like Athena and EC2. If the pricing becomes more scaled or flexible, it will be good because you have to pay 44 cents just for one DPU for an hour. If you increase DPUs to 5 or 10, the pricing gets multiplied.
There are also some time limits like 0 to 10 minutes or 10 to 20 minutes. If the pricing is according to the minutes, it would be better because you have to limit your job to 10 minutes or 20 minutes.
What other advice do I have?
I would recommend AWS Glue. It is a great choice.
I would rate this solution a nine out of ten.
Which deployment model are you using for this solution?
Private Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Amazon Web Services (AWS)
Disclosure: My company has a business relationship with this vendor other than being a customer. partner
Cloud Solution Architect at a tech services company with 1-10 employees
Cost-effective and stable
Pros and Cons
- "I appreciate AWS Glue for its cost-effectiveness."
- "In terms of improvement, the performance of AWS Glue could be faster."
What is our primary use case?
AWS Glue is a versatile tool and we mostly use it for "lift and shift" server migrations.
What is most valuable?
I appreciate AWS Glue for its cost-effectiveness. The service provides a good balance between its capabilities and the cost associated with using it.
What needs improvement?
In terms of improvement, the performance of AWS Glue could be faster.
For how long have I used the solution?
I have been using AWS Glue for five years.
What do I think about the stability of the solution?
It is a stable product.
What do I think about the scalability of the solution?
It is fairly scalable.
How are customer service and support?
The partner program support is very good.
How was the initial setup?
The initial setup is not too complex. To deploy and maintain a data platform, a general data team of around four to five skilled individuals is typically required.
What's my experience with pricing, setup cost, and licensing?
For AWS Glue, there is no separate license fee. It is part of the AWS service, and you pay for its usage as part of your overall AWS bill.
What other advice do I have?
Overall, I would rate AWS Glue as an eight out of ten.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Consultant Data junior at a computer software company with 51-200 employees
User-friendly visual interface, but only a few built-in transformations
Pros and Cons
- "The most valuable feature for me is the visual interface of AWS Glue."
- "The product has only a few built-in transformations."
What is our primary use case?
The primary use cases of AWS Glue in our organization are for implementing ETL processes and for data flow.
What is most valuable?
The most valuable feature for me is the visual interface of AWS Glue. It is user-friendly and it is not complicated. Moreover, the coding part of AWS Glue allows users to upload their scripts after dropping some components. The product has flexibility and scalability, which is common in most cloud tools.
What needs improvement?
The product has only a few built-in transformations; additional custom-building transformations could be improved in the next release.
For additional features, I would like documentation on the equivalent of legacy ETL tools and their equivalent in AWS to make it easier for users to migrate their ETL processing to the cloud. It would save time and help users find the best transformation or solution to satisfy their new business needs.
For how long have I used the solution?
I have been using this solution for three months, and I am using the latest version.
What do I think about the stability of the solution?
The stability is good; I have not faced any crashes so far.
What do I think about the scalability of the solution?
I would rate its scalability a seven out of ten.
Which solution did I use previously and why did I switch?
I used a product called SysTrack. For me, it was just a switch from SysTrack to AWS Glue.
What's my experience with pricing, setup cost, and licensing?
The pricing depends on the usage, such as the number of users, computers, and the time jobs run.
What other advice do I have?
Overall, I would rate this product a seven out of ten. It is a good product, but I have not experienced all the additional features.
Which deployment model are you using for this solution?
Private Cloud
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Net Full-Stack developer at a tech services company with 201-500 employees
A stable solution which can easily integrate with other AWS services
Pros and Cons
- "One of the best features of the solution is its ability to easily integrate with other AWS services."
- "Overall, I consider the technical support to be fine, although the response time could be faster in certain cases."
What is our primary use case?
We use the solution as a level of loading data from the source systems.
What is most valuable?
One of the best features of the solution is its ability to easily integrate with other AWS services. So, it's like we are using AWS as a main cloud provider. It's easy to put everything together. it is very flexible when it comes to compute features. We find the solution very useful when we make use of certain scripts. In some cases, it allows us to get rid of duplicates.
What needs improvement?
When there is a need to configure connections to different database sources in respect of the target, it would be good if it were easier to deal with roles. I am referring to the need to configure connections in a different target process, something which would require a certain time outlay for configuring VPC and checking that everything is okay, in respect of the creation of required roles. It would save time were this process to be made easier and more user friendly.
The technical support depends on the type of question, whether there is a need to understand additional inter-related information on multiple levels. Overall, I consider the technical support to be fine, although the response time could be faster in certain cases.
For how long have I used the solution?
I have been using AWS Glue for about two years.
What do I think about the stability of the solution?
The solution is stable.
How are customer service and support?
While the technical support can vary with the type of question, I feel that, overall, it is okay, although receipt of information could be faster in certain cases.
Which solution did I use previously and why did I switch?
We previously had experience with Database Migration Service at AWS. I recommend it over AWS Glue if one needs to do full database migration from on-premises deployment or in cases involving large volumes of data.
How was the initial setup?
I handled the installation on my own.
What's my experience with pricing, setup cost, and licensing?
I consider the the price to be standard-plus when it comes to optimal usage.
What other advice do I have?
I rate AWS Glue as an eight out of ten.
Which deployment model are you using for this solution?
Hybrid Cloud
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Buyer's Guide
Download our free AWS Glue Report and get advice and tips from experienced pros
sharing their opinions.
Updated: January 2026
Product Categories
Cloud Data IntegrationPopular Comparisons
Informatica Intelligent Data Management Cloud (IDMC)
MuleSoft Anypoint Platform
webMethods.io
Palantir Foundry
Qlik Talend Cloud
Elastic Search
AWS Database Migration Service
Fivetran
Denodo
Matillion Data Productivity Cloud
SnapLogic
Zapier
Jitterbit Harmony
IBM Cloud Pak for Integration
Rivery
Buyer's Guide
Download our free AWS Glue Report and get advice and tips from experienced pros
sharing their opinions.
Quick Links
Learn More: Questions:
- Which is the best choice for cloud integration: AWS Glue or Informatica Intelligent Cloud Services (IICS)?
- Is AWS Glue a difficult solution to use if you are a complete beginner?
- Is AWS Glue effective for AWS-related products only?
- Why would you choose AWS Glue over other tools?
- What are the most common use cases for AWS Glue?
- How does Talend Open Studio compare with AWS Glue?
- Does AWS Glue offer more flexibility than other ETL (Extract, Transform, Load) tools in terms of data loading?
- Oracle ICS vs ODI
- When evaluating Cloud Data Integration, what aspect do you think is the most important to look for?
- What is data lake storage?


















