No more typing reviews! Try our Samantha, our new voice AI agent.

Pentaho Data Integration and Analytics vs Spring Cloud Data Flow comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Jun 3, 2026

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Pentaho Data Integration an...
Ranking in Data Integration
8th
Average Rating
8.0
Reviews Sentiment
6.7
Number of Reviews
61
Ranking in other categories
No ranking in other categories
Spring Cloud Data Flow
Ranking in Data Integration
31st
Average Rating
7.8
Reviews Sentiment
6.8
Number of Reviews
9
Ranking in other categories
Streaming Analytics (16th)
 

Mindshare comparison

As of June 2026, in the Data Integration category, the mindshare of Pentaho Data Integration and Analytics is 1.7%, down from 1.7% compared to the previous year. The mindshare of Spring Cloud Data Flow is 1.0%, down from 1.1% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Integration Mindshare Distribution
ProductMindshare (%)
Pentaho Data Integration and Analytics1.7%
Spring Cloud Data Flow1.0%
Other97.3%
Data Integration
 

Featured Reviews

Michelle Lawson - PeerSpot reviewer
Principal Software Engineer at a tech vendor with 10,001+ employees
Streamlines complex data workflows and has supported automated customer payment notifications
I haven't used Pentaho Data Integration and Analytics in a couple of years, so I don't know how it can be improved. I was pretty pleased with it and was self-taught on it, working a lot with their team at various times, but they were surprised that I was able to learn it all by myself. The documentation is not bad, and documentation is the main thing that any product can do to make themselves better because the easier it is to find examples of what you're trying to do improves the learning curve. I think it took me the longest to learn how to do the asynchronous processing and have things wait for other things to finish processing before continuing on in the workflow. I choose 8 out of 10 because the one reason that it's been rejected at T-Mobile is that everything has to go through a provisioning process and has to get approved, meaning the actual code base has to be investigated by T-Mobile before they'll allow us to use tools of that nature. For whatever reason, we just haven't been able to get that approval; I don't know if it's on Pentaho Data Integration and Analytics' side or if it's on our side. The more you can make it easier for companies to feel comfortable that your product is secure, robustly tested and bug-free, and free of any other kind of negative hacks, the more quickly it will get accepted.
NitinGoyal - PeerSpot reviewer
Engineering Lead at Naukri.com
Has a plug-and-play model and provides good robustness and scalability
The solution's community support could be improved. I don't know why the Spring Cloud Data Flow community is not very strong. Community support is very limited whenever you face any problem or are stuck somewhere. I'm not sure whether it has improved in the last six months because this pipeline was set up almost two years ago. I struggled with that a lot. For example, there was limited support whenever I got an exception and sought help from Stack Overflow or different forums. Interacting with Kubernetes needs a few certificates. You need to define all the certificates within your application. With the help of those certificates, your Java application or Spring Cloud Data Flow can interact with Kubernetes. I faced a lot of hurdles while placing those certificates. Despite following the official documentation to define all the replicas, readiness, and liveliness probes within the Spring Cloud Data Flow application, it was not working. So, I had to troubleshoot while digging in and debugging the internals of Spring Cloud Data Flow at that time. It was just a configuration mismatch, and I was doing nothing weird. There was a small spelling difference between how Spring Cloud Data Flow was expecting it and how I passed it. I was just following the official documentation.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Using the solution we were able to reduce our ETL deployment time by between 10 and 20 percent, and when it comes to personnel costs, we have gained 10 percent."
"Pentaho Kettle has a very intuitive and easy to use graphical user interface (GUI) and it is possible to understand how to develop an ETL solution even when using it for the first time."
"We've seen a 50 percent reduction in our ETL development time using the free version of Pentaho, saving about 1,000 euros per week and at least 50,000 euros annually."
"When we get a question from our CEO that needs a response and that requires a little bit of legwork of pulling in from various market data, our own in-house repositories, and everything else, it allows me to arrive at the solutions much faster than having to do it through scripting in Python, coding, or anything else."
"Lumada has allowed us to interact with our employees more effectively and compensate them properly. One of the cool things is that we use it to generate commissions for our salespeople and bonuses for our warehouse people. It allows us to get information out to them in a timely fashion. We can also see where they're at and how they're doing."
"Pentaho Data Integration and Analytics has positively impacted my organization by saving costs, managing large data sets, and integrating multiple sources."
"The graphical nature of the development interface is most useful because we've got people with quite mixed skills in the team. We've got some very junior, apprentice-level people, and we've got support analysts who don't have an IT background. It allows us to have quite complicated data flows and embed logic in them. Rather than having to troll through lines and lines of code and try and work out what it's doing, you get a visual representation, which makes it quite easy for people with mixed skills to support and maintain the product. That's one side of it."
"The way it has improved our product is by giving our users the ability to do ad hoc reports, which is very important to our users. We can do predictive analysis on trends coming in for contracts, which is what our product does. The product helps users decide which way to go based on the predictive analysis done by Pentaho. Pentaho is not doing predictions, but reporting on the predictions that our product is doing. This is a big part of our product."
"The dashboards in Spring Cloud Dataflow are quite valuable."
"Overall, Spring Cloud Data Flow is a really good solution and a lot cheaper than a lot of infrastructure provided by big companies like Google or Amazon."
"The solution's most valuable feature is that it allows us to use different batch data sources, retrieve the data, and then do the data processing, after which we can convert and store it in the target."
"The ease of deployment on Kubernetes, the seamless integration for orchestration of various pipelines, and the visual dashboard that simplifies operations even for non-specialists such as quality analysts."
"There are a lot of options in Spring Cloud. It's flexible in terms of how we can use it. It's a full infrastructure."
"The best thing I like about Spring Cloud Data Flow is its plug-and-play model."
"The product is very user-friendly."
"This product will assist us in saving costs in many ways: No longer need to continue paying high fees for proprietary software, reduce the number of software engineers needed to support the product, and achieve faster time to market by using this product for our middleware."
 

Cons

"The testing and quality could really improve. Every time that there is a major release, we are very nervous about what is going to get broken. We have had a lot of experience with that, as even the latest one was broken. Some basic things get broken. That doesn't look good for Hitachi at all. If there is one place I would advise them to spend some money and do some effort, it is with the quality. It is not that hard to start putting in some unit tests so basic things don't get broken when they do a new release. That just looks horrible, especially for an organization like Hitachi."
"Pentaho Dashboard Designer - needs an improvement on the various features of the Dashboards, since there are CTools available and which help to fulfil the gaps, but it needs developers involvement."
"In terms of the flexibility to deploy in any environment, such as on-premise or in the cloud, we can do the cloud deployment only through virtual machines. We might also be able to work on different environments through Docker or Kubernetes, but we don't have an Azure app or an AWS app for easy deployment to the cloud. We can only do it through virtual machines, which is a problem, but we can manage it. We also work with Databricks because it works with Spark. We can work with clustered servers, and we can easily do the deployment in the cloud. With a right-click, we can deploy Databricks through the app on AWS or Azure cloud."
"To improve Pentaho Data Integration and Analytics, I suggest developing capabilities for cloud-based solutions instead of being solely on-premises."
"In the Community edition, it would be nice to have more modules that allow you to code directly within the application. It could have R or Python completely integrated into it, but this could also be because I'm using an older version."
"The support from Hitachi is not the greatest, the fixing of bugs can take a really long time."
"Since Hitachi took over, I don't feel that the documentation is as good within the solution. It used to have very good help built right in."
"The technical support does not reply in a timely manner. The support they have in place does not work very well."
"There were instances of deployment pipelines getting stuck, and the dashboard not always accurately showing the application status, requiring manual intervention such as rerunning applications or refreshing the dashboard."
"On the tool's online discussion forums, you may get stuck with an issue, making it an area where improvements are required."
"The solution's community support could be improved."
"The configurations could be better. Some configurations are a little bit time-consuming in terms of trying to understand using the Spring Cloud documentation."
"I would improve the dashboard features as they are not very user-friendly."
"The documentation on offer is not that good."
"The visual user interface could use some help; it needs improvement."
"Some of the features, like the monitoring tools, are not very mature and are still evolving."
 

Pricing and Cost Advice

"It does seem a bit expensive compared to the serverless product offering. Tools, such as Server Integration Services, are "almost" free with a database engine. It is comparable to products like Alteryx, which is also very expensive."
"We did a two or three-year deal the last time we did it. As compared to other solutions, at least so far in our experience, it has been very affordable. The licensing is by component. So, you need to make sure you only license the components that you really intend to use. I am not sure if we have relicensed after the Hitachi acquisition, but previously, multi-year renewals resulted in a good discount. I'm not sure if this is still the case. We've had the full suite for a lot of years, and there is just the initial cost. I am not aware of any additional costs."
"I believe the pricing of the solution is more affordable than the competitors"
"The solution reduced our ETL development time by a lot because a whole project used to take about a month to get done previously. After having Lumada, it took just a week. For a big company in Brazil, it saves a team at least $10,000 a month."
"You don't need the Enterprise Edition, you can go with the Community Edition. That way you can use it for free and, for free, it's a pretty good tool to use."
"I primarily work on the Community Version, which is available to use free of charge."
"Sometimes we provide the licenses or the customer can procure their own licenses. Previously, we had an enterprise license. Currently, we are on a community license as this is adequate for our needs."
"There was a cost analysis done and Pentaho did favorably in terms of cost."
"The solution provides value for money, and we are currently using its community edition."
"This is an open-source product that can be used free of charge."
"If you want support from Spring Cloud Data Flow there is a fee. The Spring Framework is open-source and this is a free solution."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
900,644 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
16%
Educational Organization
8%
Construction Company
8%
Government
7%
Financial Services Firm
18%
Computer Software Company
10%
Retailer
8%
Manufacturing Company
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business18
Midsize Enterprise17
Large Enterprise32
By reviewers
Company SizeCount
Small Business3
Midsize Enterprise1
Large Enterprise5
 

Questions from the Community

Which ETL tool would you recommend to populate data from OLTP to OLAP?
Hi Rajneesh, yes here is the feature comparison between the community and enterprise edition : https://www.hitachivantara.com/en-us/pdf/brochure/leverage-open-source-benefits-with-assurance-of-hita...
What do you think can be improved with Hitachi Lumada Data Integrations?
In my opinion, the reporting side of this tool needs serious improvements. In my previous company, we worked with Hitachi Lumada Data Integration and while it does a good job for what it’s worth, ...
What do you use Hitachi Lumada Data Integrations for most frequently?
My company has used this product to transform data from databases, CSV files, and flat files. It really does a good job. We were most satisfied with the results in terms of how many people could us...
What needs improvement with Spring Cloud Data Flow?
There were instances of deployment pipelines getting stuck, and the dashboard not always accurately showing the application status, requiring manual intervention such as rerunning applications or r...
What is your primary use case for Spring Cloud Data Flow?
We had a project for content management, which involved multiple applications each handling content ingestion, transformation, enrichment, and storage for different customers independently. We want...
What advice do you have for others considering Spring Cloud Data Flow?
I would definitely recommend Spring Cloud Data Flow. It requires minimal additional effort or time to understand how it works, and even non-specialists can use it effectively with its friendly docu...
 

Also Known As

Hitachi Lumada Data Integration, Kettle, Pentaho Data Integration
No data available
 

Overview

 

Sample Customers

66Controls, Providential Revenue Agency of Ro Negro, NOAA Information Systems, Swiss Real Estate Institute
Information Not Available
Find out what your peers are saying about Pentaho Data Integration and Analytics vs. Spring Cloud Data Flow and other solutions. Updated: June 2026.
900,644 professionals have used our research since 2012.