AWS Glue is a serverless data integration service offering seamless integration with AWS services like S3, Redshift, and Athena. Known for its flexibility with data formats and automation of ETL tasks, AWS Glue enhances data management and transformation.
| Product | Mindshare (%) |
|---|---|
| AWS Glue | 7.6% |
| Informatica Intelligent Data Management Cloud (IDMC) | 6.8% |
| AWS Database Migration Service | 6.4% |
| Other | 79.2% |
| Type | Title | Date | |
|---|---|---|---|
| Category | Cloud Data Integration | May 8, 2026 | Download |
| Product | Reviews, tips, and advice from real users | May 8, 2026 | Download |
| Comparison | AWS Glue vs Informatica Intelligent Data Management Cloud (IDMC) | May 8, 2026 | Download |
| Comparison | AWS Glue vs AWS Database Migration Service | May 8, 2026 | Download |
| Comparison | AWS Glue vs MuleSoft Anypoint Platform | May 8, 2026 | Download |
| Title | Rating | Mindshare | Recommending | |
|---|---|---|---|---|
| Informatica Intelligent Data Management Cloud (IDMC) | 4.0 | 6.8% | 92% | 214 interviewsAdd to research |
| MuleSoft Anypoint Platform | 4.0 | 5.6% | 92% | 61 interviewsAdd to research |
| Company Size | Count |
|---|---|
| Small Business | 11 |
| Midsize Enterprise | 6 |
| Large Enterprise | 29 |
| Company Size | Count |
|---|---|
| Small Business | 242 |
| Midsize Enterprise | 122 |
| Large Enterprise | 585 |
AWS Glue facilitates seamless data extraction, transformation, and loading for businesses, integrating with key AWS services, allowing efficient data pipeline automation. It's valued for a user-friendly GUI, scalability, and cost-effectiveness, supporting PySpark for complex datasets and includes a robust data catalog, real-time backup capabilities, and code generation. Despite its strengths, improvements are needed in documentation, training, and broader programming language support. Users face challenges with its complex interface and integration with non-AWS products, driving demand for enhancements in its usability and performance.
What are AWS Glue's most important features?Businesses leverage AWS Glue in industries for ETL processes, data integration, and transformation. It is used to optimize data lakes or warehouses integration, enhancing data cataloging and real-time integration. Its serverless feature enables efficient data processing in sectors like finance and healthcare, where handling complex data-intensive tasks is crucial.
bp, Cerner, Expedia, Finra, HESS, intuit, Kellog's, Philips, TIME, workday
| Author info | Rating | Review Summary |
|---|---|---|
| application security engineer at Hyperspace IT India | 4.0 | I consistently use AWS Glue for ETL, finding it efficient, scalable, and well-integrated. While job logs and workflows could improve, it reduces effort significantly, offers reasonable pricing, and I highly recommend it. |
| Principal Consultant at a retailer with 1,001-5,000 employees | 4.0 | I use AWS Glue for ETL processes, including data transformation and cleansing for our data warehouse. Its serverless nature and excellent performance are beneficial, though the UI and version upgrades need improvement. Despite some challenges, Glue remains my preferred solution. |
| Data Architect at a financial services firm with 10,001+ employees | 2.5 | I use AWS Glue primarily for data ingestion, curation, and transformation, benefiting from its compatibility with Python for big data. Though clunky and code-heavy, it suits limited pipelines well, especially with AWS, despite preferring GUI-based tools. |
| Python AWS & AI Expert at a tech consulting company | 4.0 | I use AWS Glue primarily for serverless integration across various services. Its valuable features include robust transformation capabilities and seamless data preparation, though the deployment process could be simplified. It's integrated with AWS for comprehensive data workflow management. |
| AVP at a manufacturing company with 10,001+ employees | 3.0 | I use AWS Glue in my company for building data lakes and processing data from various sources like Oracle and MongoDB. It's valuable for managing large data volumes serverlessly, but its high cost, especially if systems are poorly designed, poses significant challenges. |
| Data Engineer at a tech services company with 501-1,000 employees | 3.5 | I use AWS Glue for data processing and find it easy to integrate with other AWS products. However, error handling is challenging, and I find Databricks to be more user-friendly and generally a better solution. |
| Senior Developer for cloud services at Coforge Growth Agency | 4.5 | I primarily use AWS Glue for data ingestion and extraction from multiple sources. The Glue Crawler efficiently updates schemas for large datasets, and the orchestration of ETL pipelines is effective. Improvements are needed in Lambda functions' resource allocation and timeout management. |
| Offshore Delivery | AWS architect | Manager - Projects at Cognizant | 4.0 | I use AWS Glue for efficient data transformation and integration with Apache Airflow, enabling smooth orchestration without cold starts like AWS Lambda. Although managing environment variables could improve, Glue's extended session capability suits our needs better than other solutions. |