Try our new research platform with insights from 80,000+ expert users

AWS Glue vs Pentaho Data Integration and Analytics comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

ROI

Sentiment score
5.9
Organizations find AWS Glue efficient and cost-effective despite overhead costs, though some consider alternatives due to budget constraints.
Sentiment score
7.9
Pentaho offers cost-effective integration, reducing ETL time, lowering expenses, and enhancing competitiveness with open-source flexibility and efficiency.
I advocate using Glue in such cases.
 

Customer Service

Sentiment score
6.5
AWS Glue customer service is praised for responsiveness and effectiveness, with mixed feedback on support speed, costs, and consistency.
Sentiment score
5.2
Users rely on community support over customer service due to mixed experiences, despite responsive technical support and Hitachi's involvement.
For complex Glue-related problems such as job failures or permission issues, their documentation is good, but having direct access to support helps cut down troubleshooting time significantly.
AWS's documentation is reliable, and careful reference often resolves missed upgrade details.
Communication with the vendor is challenging
 

Scalability Issues

Sentiment score
7.8
AWS Glue is highly scalable and serverless, praised for easy resource management, but needs better parallel computation.
Sentiment score
7.3
Pentaho excels in scalability and efficient data handling but faces challenges with exceptionally large data and complex growth scenarios.
It is beneficial to upgrade jobs, and we conduct extensive testing in development before migrating to production.
It can easily handle data from one terabyte to 100 terabytes or more, scaling nicely with larger datasets.
Pentaho Data Integration handles larger datasets better.
 

Stability Issues

Sentiment score
7.9
AWS Glue is stable and reliable with minor issues, scaling well, and efficient due to serverless architecture and tool integration.
Sentiment score
7.1
Pentaho Data Integration offers reliability for small to midsize operations but may lag and freeze with complex uses.
As a managed service, it reduces management burdens.
It's pretty stable, however, it struggles when dealing with smaller amounts of data.
 

Room For Improvement

AWS Glue faces challenges with startup times, interface complexity, language limitations, cost, performance, integration, and multi-cloud compatibility.
Pentaho needs improvements in big data performance, error handling, UI, scheduling, backward compatibility, cloud integration, and Python support.
A more user-friendly and simpler process would help speed up the deployment process.
With AWS, I gather data from multiple sources, clean it up, normalize it, de-duplicate it, and make it presentable.
Learning the latest functionalities is crucial, and while challenging, it is a vital part of staying current and ensuring an efficient ETL process.
Pentaho Data Integration is very friendly, it is not very useful when there isn't a lot of data to handle.
 

Setup Cost

AWS Glue offers flexible, efficient serverless architecture but can be costly and unpredictable, especially for smaller organizations.
Pentaho offers a cost-effective solution with its free Community Edition and affordable subscription-based Enterprise Edition for varying needs.
AWS charges based on runtime, which can be quite pricey.
Costing depends on resource usage, and cost optimization may involve redesigning jobs for flexibility.
The smallest cost for a project is around €700, while the largest can reach up to €7,000 based on the scale of the usage.
 

Valuable Features

AWS Glue excels with its easy interface, scalable ETL processing, seamless AWS integration, affordability, and serverless architecture.
Pentaho provides an intuitive, open-source platform for efficient ETL development and data integration with minimal coding and broad compatibility.
AWS Glue has reduced efforts by 60%, which is the main benefit.
AWS Glue also enhances job scheduling and orchestration capabilities, integrating with AWS Glue Studio for comprehensive data workflow management.
For ETL, I feel the performance is excellent. If I create jobs in a standard way, the performance is great, and maintenance is also seamless.
I find the drag and drop feature in Pentaho Data Integration very useful for integration.
 

Categories and Ranking

AWS Glue
Average Rating
7.8
Reviews Sentiment
6.9
Number of Reviews
50
Ranking in other categories
Cloud Data Integration (1st)
Pentaho Data Integration an...
Average Rating
8.0
Reviews Sentiment
6.9
Number of Reviews
53
Ranking in other categories
Data Integration (18th)
 

Featured Reviews

Saurabh Jaiswal - PeerSpot reviewer
Enables seamless integration and data preparation with robust transformation capabilities
AWS Glue's most valuable features include its transformation capabilities, which provide data quality and shape for processing in ML or AI models. It offers transformation options on canvas or through ETL pipelines, notebooks, and code. Additionally, it supports data preparation, cleaning, and filtering seamlessly. AWS Glue also enhances job scheduling and orchestration capabilities, integrating with AWS Glue Studio for comprehensive data workflow management.
Aqeel UR Rehman - PeerSpot reviewer
Transform data efficiently with rich features but there's challenges with large datasets
Currently, I am using Pentaho Data Integration for transforming data and then loading it into different platforms. Sometimes, I use it in conjunction with AWS, particularly S3 and Redshift, to execute the copy command for data processing Pentaho Data Integration is easy to use, especially when…
report
Use our free recommendation engine to learn which Cloud Data Integration solutions are best for your needs.
865,295 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
21%
Computer Software Company
12%
Manufacturing Company
8%
Government
6%
Financial Services Firm
17%
Computer Software Company
12%
Government
7%
Manufacturing Company
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

How do you select the right cloud ETL tool?
AWS Glue and Azure Data factory for ELT best performance cloud services.
How does Talend Open Studio compare with AWS Glue?
We reviewed AWS Glue before choosing Talend Open Studio. AWS Glue is the managed ETL (extract, transform, and load) from Amazon Web Services. AWS Glue enables AWS users to create and manage jobs in...
What are the most common use cases for AWS Glue?
AWS Glue's main use case is for allowing users to discover, prepare, move, and integrate data from multiple sources. The product lets you use this data for analytics, application development, or ma...
Which ETL tool would you recommend to populate data from OLTP to OLAP?
Hi Rajneesh, yes here is the feature comparison between the community and enterprise edition : https://www.hitachivantara.com/en-us/pdf/brochure/leverage-open-source-benefits-with-assurance-of-hita...
What do you think can be improved with Hitachi Lumada Data Integrations?
In my opinion, the reporting side of this tool needs serious improvements. In my previous company, we worked with Hitachi Lumada Data Integration and while it does a good job for what it’s worth, ...
What do you use Hitachi Lumada Data Integrations for most frequently?
My company has used this product to transform data from databases, CSV files, and flat files. It really does a good job. We were most satisfied with the results in terms of how many people could us...
 

Also Known As

No data available
Hitachi Lumada Data Integration, Kettle, Pentaho Data Integration
 

Overview

 

Sample Customers

bp, Cerner, Expedia, Finra, HESS, intuit, Kellog's, Philips, TIME, workday
66Controls, Providential Revenue Agency of Ro Negro, NOAA Information Systems, Swiss Real Estate Institute
Find out what your peers are saying about AWS Glue vs. Pentaho Data Integration and Analytics and other solutions. Updated: July 2025.
865,295 professionals have used our research since 2012.