

AWS Glue and Stitch are competitors in the data integration market. AWS Glue has an upper hand due to its strong integration capabilities with other AWS services, lower cost for large-scale operations, and flexibility in handling data processing tasks.
Features: AWS Glue stands out with its built-in data catalog, which enhances metadata management efficiency and supports seamless integration with AWS services like S3. The serverless architecture ensures cost-effective scaling for Spark jobs. Users benefit from its ability to execute PySpark code, making it attractive for large-scale data processing. Stitch is known for its straightforward integration capabilities and efficient data handling without extensive coding. Users find its ability to manage deleted records and its ease of use without requiring a dedicated data engineering team particularly valuable.
Room for Improvement: AWS Glue could enhance integration with databases like Redshift and reduce startup times. The setup process is complex for those lacking advanced knowledge, and better documentation is suggested. Stitch would benefit from more connectors and enhanced error explanations. Users also face challenges in licensing, indicating a need for simplified processes.
Ease of Deployment and Customer Service: AWS Glue offers robust support services though noted for high premium support costs and needs improvement in response times for basic queries. Stitch is praised for quick deployment and basic maintenance information, but its support could be improved for complex use cases.
Pricing and ROI: AWS Glue's pay-as-you-go model is cost-effective for scalable needs but can be expensive for smaller operations. Despite some users reporting high annual expenses, it provides good ROI in scenarios with limited pipelines. Stitch, perceived as costlier due to its licensing process, remains appealing because of its simplicity and fair pricing, offering effective ROI over time.
I advocate using Glue in such cases.
We've got a project at the moment that we estimated the integration was going to be around $200,000 to $300,000, and we've been able to achieve the integration for less than a tenth of that, doing it in-house using Stitch.
I think I have seen a return on investment with Stitch in terms of time saved.
Upgrades occur every four months, and new developments coincide with version updates.
For complex Glue-related problems such as job failures or permission issues, their documentation is good, but having direct access to support helps cut down troubleshooting time significantly.
The best skill set they've got is that they know when the issue is outside of their knowledge, and they escalate really quickly so that we get to the right people when we need them.
It is beneficial to upgrade jobs, and we conduct extensive testing in development before migrating to production.
It can easily handle data from one terabyte to 100 terabytes or more, scaling nicely with larger datasets.
I would advise that you should not use Stitch if you are going to build a big number of screens or a heavy UI application with complex designs because it is not ready for that kind of work.
We just spin up a new server and add it into a cluster, and then it pretty much manages the load balancing across all the servers in the cluster.
Stitch can handle a massive amount of data, so I do not think that is a problem.
AWS Glue is highly stable, and I would rate its stability as nine.
Stitch is really stable.
I didn't notice any explicit crashes or bugs with Stitch, as it is actually stable.
Migrating jobs from version 3.0 to 4.0 can present compatibility issues.
With AWS, I gather data from multiple sources, clean it up, normalize it, de-duplicate it, and make it presentable.
A more user-friendly and simpler process would help speed up the deployment process.
Stitch cannot connect to all databases or third-party apps, such as Amazon Seller.
I saved a lot of time getting from having no design inspiration to having full-fledged designs.
I suggest developing a featured interface that is easier to use.
Costing depends on resource usage, and cost optimization may involve redesigning jobs for flexibility.
AWS charges based on runtime, which can be quite pricey.
The smallest cost for a project is around €700, while the largest can reach up to €7,000 based on the scale of the usage.
My experience with pricing, setup cost, and licensing is that it is pretty easy, pretty straightforward, and the cheapest of them all.
The cost of the seats is actually cheaper by the amount of value that you're adding to the business.
My experience with pricing, setup cost, and licensing for Stitch shows that it is a bit costlier.
For ETL, I feel the performance is excellent. If I create jobs in a standard way, the performance is great, and maintenance is also seamless.
AWS Glue's most valuable features include its transformation capabilities, which provide data quality and shape for processing in ML or AI models.
AWS Glue has reduced efforts by 60%, which is the main benefit.
The image to HTML conversion helps me in my projects because it allows you to acquire professional designs without starting from scratch.
We take one week of time to design an application, but now we can design that application within two days, which is 16 hours.
We can easily move and do time-to-market for a new pipeline and new integration, positively impacting our organization.
| Product | Mindshare (%) |
|---|---|
| AWS Glue | 8.8% |
| Stitch | 1.3% |
| Other | 89.9% |

| Company Size | Count |
|---|---|
| Small Business | 11 |
| Midsize Enterprise | 6 |
| Large Enterprise | 32 |
| Company Size | Count |
|---|---|
| Small Business | 4 |
| Midsize Enterprise | 3 |
| Large Enterprise | 4 |
AWS Glue is a serverless cloud data integration tool that facilitates the discovery, preparation, movement, and integration of data from multiple sources for machine learning (ML), analytics, and application development. The solution includes additional productivity and data ops tooling for running jobs, implementing business workflows, and authoring.
AWS Glue allows users to connect to more than 70 diverse data sources and manage data in a centralized data catalog. The solution facilitates visual creation, running, and monitoring of extract, transform, and load (ETL) pipelines to load data into users' data lakes. This Amazon product seamlessly integrates with other native applications of the brand and allows users to search and query cataloged data using Amazon EMR, Amazon Athena, and Amazon Redshift Spectrum.
The solution also utilizes application programming interface (API) operations to transform users' data, create runtime logs, store job logic, and create notifications for monitoring job runs. The console of AWS Glue connects all of these services into a managed application, facilitating the monitoring and operational processes. The solution also performs provisioning and management of the resources required to run users' workloads in order to minimize manual work time for organizations.
AWS Glue Features
AWS Glue groups its features into four categories - discover, prepare, integrate, and transform. Within those groups are the following features:
AWS Glue Benefits
AWS Glue offers a wide range of benefits for its users. These benefits include:
Reviews from Real Users
Mustapha A., a cloud data engineer at Jems Groupe, likes AWS Glue because it is a product that is great for serverless data transformations.
Liana I., CEO at Quark Technologies SRL, describes AWS Glue as a highly scalable, reliable, and beneficial pay-as-you-go pricing model.
Stitch is a cloud-based ETL service designed to synchronize data between a variety of sources and destinations, offering robust and scalable data integration capabilities.
Stitch facilitates seamless data integration, providing users with real-time data movement across their tech stack. Its flexible architecture allows easy connectivity between diverse systems and ensures data consistency. With its user-friendly setup, Stitch empowers data teams to efficiently manage complex data workflows, enhancing decision-making and operational efficiency.
What are Stitch's most important features?In industries like e-commerce and finance, Stitch is instrumental in integrating data from sales platforms and financial systems to analytics tools. Retailers can combine online and offline sales data, while financial firms streamline data into centralized repositories, ensuring comprehensive analysis and reporting.
We monitor all Cloud Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.