No more typing reviews! Try our Samantha, our new voice AI agent.

Pentaho Data Integration and Analytics vs dbt comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Dec 19, 2024

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

dbt
Ranking in Data Integration
9th
Average Rating
8.0
Reviews Sentiment
6.6
Number of Reviews
10
Ranking in other categories
Data Quality (5th)
Pentaho Data Integration an...
Ranking in Data Integration
7th
Average Rating
8.0
Reviews Sentiment
6.7
Number of Reviews
60
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of May 2026, in the Data Integration category, the mindshare of dbt is 1.4%, down from 1.5% compared to the previous year. The mindshare of Pentaho Data Integration and Analytics is 1.7%, up from 1.6% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Integration Mindshare Distribution
ProductMindshare (%)
Pentaho Data Integration and Analytics1.7%
dbt1.4%
Other96.9%
Data Integration
 

Featured Reviews

Harshwardhan Gullapalli - PeerSpot reviewer
AI Engineer at a educational organization with 51-200 employees
Data pipelines have improved financial accuracy and now build transparent audit-ready reports
As for something I wish we had, dbt's native support for Python transformations came later, and we did some complex financial classification calculations that felt clunky in pure SQL. We ended up writing Python in our n8n workflows and then fed the results back into dbt, which created a bit of a split-brain situation. If we would have had dbt Python models earlier, we could have kept that logic unified. Managing multiple reporting standards was our biggest operational pain point with dbt. We were running UAE corporate tax compliance and IFRS disclosure workflows simultaneously for different clients, and dbt does not have a native concept of multi-tenant or multi-standard project organization. Everything lives in one flat structure, so we had to build more conventions: separate schema folders for IFRS models versus UACT models, custom macros to tag models by compliance regime, and environment variables to control which set of transformations run for which client.
Michelle Lawson - PeerSpot reviewer
Principal Software Engineer at a tech vendor with 10,001+ employees
Streamlines complex data workflows and has supported automated customer payment notifications
I haven't used Pentaho Data Integration and Analytics in a couple of years, so I don't know how it can be improved. I was pretty pleased with it and was self-taught on it, working a lot with their team at various times, but they were surprised that I was able to learn it all by myself. The documentation is not bad, and documentation is the main thing that any product can do to make themselves better because the easier it is to find examples of what you're trying to do improves the learning curve. I think it took me the longest to learn how to do the asynchronous processing and have things wait for other things to finish processing before continuing on in the workflow. I choose 8 out of 10 because the one reason that it's been rejected at T-Mobile is that everything has to go through a provisioning process and has to get approved, meaning the actual code base has to be investigated by T-Mobile before they'll allow us to use tools of that nature. For whatever reason, we just haven't been able to get that approval; I don't know if it's on Pentaho Data Integration and Analytics' side or if it's on our side. The more you can make it easier for companies to feel comfortable that your product is secure, robustly tested and bug-free, and free of any other kind of negative hacks, the more quickly it will get accepted.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The product is developer-friendly."
"Since we migrated from SSIS to dbt model architecture, it takes around four hours only to complete a full refresh, and the client is now happy because our downtime was drastically reduced when we perform a complete refresh of the data."
"It is very convenient because at the end, I have the opportunity to orchestrate all my transformations in just one single place, rather than having them spread out."
"Overall, I find dbt to be optimized compared to other tools."
"The most concrete outcome was a significant reduction in data errors reaching our downstream AI models, and after implementing dbt's testing layer, we caught roughly 70% of those issues at the transformation stage itself, before they ever touched the model."
"From a developer point of view, I find the ease of development and the code to be the most useful capabilities of dbt."
"I would say the best feature or the most desirable feature for dbt is the ability to write everything in code."
"There is operational efficiency achieved, and data quality and governance have also been achieved with modular SQL and version controlling, which reduced duplication of data and data errors."
"Previously, I had three people to collect all the data and integrate all Excel spreadsheets, and it took them a day or two, but now I can do this work in about 15 minutes."
"Using the solution we were able to reduce our ETL deployment time by between 10 and 20 percent, and when it comes to personnel costs, we have gained 10 percent."
"It has a really friendly user interface, which is its main feature. The process of automating or combining SQL code with some databases and doing the automation is great and really convenient."
"My advice for anybody who is considering this product is if they're looking for any kind of custom transformation, or they're gleaning data from multiple sources and sending it to multiple destinations, I definitely recommend this tool."
"Pentaho Data Integration and Analytics positively impacted my organization by organizing data that was scattered across many different databases and APIs, reducing a lot of manual work and centralizing it into one data warehouse."
"As a result of one of the projects that we did in the Middle East, we achieved the main goal of fully digitalizing their population census."
"Lumada Data Integration definitely helps with decision-making for our deans and upper executives, and the fact that we're able to reuse some of the ETLs with the metadata injection saves us time and costs while making it a pretty quick process for our developers to learn and pick up ETLs from each other."
"The way it has improved our product is by giving our users the ability to do ad hoc reports, which is very important to our users."
 

Cons

"Since dbt has a license cost, if a company is small and does not have much budget, they can explore other tools because there are other tools that provide the same functionality at a lower cost."
"The solution must add more Python-based implementations."
"If you compare the cost of those packages with dbt alone, it is more expensive to use dbt alone."
"The initial setup of dbt is somewhat complex."
"If I needed to name a few areas for improvement, I would mention the migration of code to Git and GitHub, which sometimes fails and can be confusing for developers during handover."
"Managing multiple reporting standards was our biggest operational pain point with dbt."
"Dbt is not as stable as preferred, as it has had a few outages in the current year itself, so improvement should be made in the outages section as it is not stable."
"Every upgrade is a little bit of a risk for us because we do not know if the workarounds that we developed will be available for the next version."
"I was not happy with the Pentaho Report Designer because of the way it was set up."
"Lumada could have more native connectors with other vendors, such as Google BigQuery, Microsoft OneDrive, Jira systems, and Facebook or Instagram."
"I also found, in my case, that the statistical data input wasn't working (.sas7bdat input wasn't working)."
"I would like to see improvements made for real-time data processing."
"Pentaho Dashboard Designer - needs an improvement on the various features of the Dashboards, since there are CTools available and which help to fulfil the gaps, but it needs developers involvement."
"The price of the regular version is not reasonable and it should be lower."
"In terms of the flexibility to deploy in any environment, such as on-premise or in the cloud, we can do the cloud deployment only through virtual machines. We might also be able to work on different environments through Docker or Kubernetes, but we don't have an Azure app or an AWS app for easy deployment to the cloud. We can only do it through virtual machines, which is a problem, but we can manage it. We also work with Databricks because it works with Spark. We can work with clustered servers, and we can easily do the deployment in the cloud. With a right-click, we can deploy Databricks through the app on AWS or Azure cloud."
"The more you can make it easier for companies to feel comfortable that your product is secure, robustly tested and bug-free, and free of any other kind of negative hacks, the more quickly it will get accepted."
 

Pricing and Cost Advice

"The solution’s pricing is affordable."
"You need to go through the paid version to have Hitachi Lumada specialized support. However, if you are using the free version, then you will have only the community support. You will depend on the releases from Hitachi to solve some problem or questions that you have, such as bug fixes. You will need to wait for the newest versions or releases to solve these types of problems."
"I mostly used the open-source version. I didn't work with a license."
"When we first started with it, it was much cheaper. It has gone up drastically, especially since Hitachi bought out Pentaho."
"The pricing has been pretty good. I'm used to using everything open-source or freeware-based. I understand that organizations need to make sure that the solutions are secure, and that's basically where I hit a roadblock in my current organization. They needed to ensure that we had a license and we had a secure way of accessing it so that no outside parties could get access to our data, but in terms of pricing, considering how much other teams are spending on cloud solutions or even their existing solutions, its price point is pretty good. At this time, there are no additional costs. We just have the licensing fees."
"It does seem a bit expensive compared to the serverless product offering. Tools, such as Server Integration Services, are "almost" free with a database engine. It is comparable to products like Alteryx, which is also very expensive."
"If a company is looking for an ETL solution and wants to integrate it with their tech stack but doesn't want to spend a bunch of money, Pentaho is a good solution"
"You don't need the Enterprise Edition, you can go with the Community Edition. That way you can use it for free and, for free, it's a pretty good tool to use."
"I believe the pricing of the solution is more affordable than the competitors"
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
893,221 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
16%
Insurance Company
8%
Manufacturing Company
8%
Comms Service Provider
7%
Financial Services Firm
15%
Educational Organization
8%
Government
7%
Manufacturing Company
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business2
Midsize Enterprise3
Large Enterprise5
By reviewers
Company SizeCount
Small Business18
Midsize Enterprise17
Large Enterprise31
 

Questions from the Community

What is your experience regarding pricing and costs for dbt?
I mentioned the cost as one of the advantages, specifically the license cost.
What needs improvement with dbt?
With AI, everything is advancing so fast, so I would say that the most important thing is to try to integrate with more platforms. As of now, dbt has a strong integration with AWS and with Snowflak...
What is your primary use case for dbt?
I am currently working with dbt and use dbt's modular SQL models.
Which ETL tool would you recommend to populate data from OLTP to OLAP?
Hi Rajneesh, yes here is the feature comparison between the community and enterprise edition : https://www.hitachivantara.com/en-us/pdf/brochure/leverage-open-source-benefits-with-assurance-of-hita...
What do you think can be improved with Hitachi Lumada Data Integrations?
In my opinion, the reporting side of this tool needs serious improvements. In my previous company, we worked with Hitachi Lumada Data Integration and while it does a good job for what it’s worth, ...
What do you use Hitachi Lumada Data Integrations for most frequently?
My company has used this product to transform data from databases, CSV files, and flat files. It really does a good job. We were most satisfied with the results in terms of how many people could us...
 

Also Known As

No data available
Hitachi Lumada Data Integration, Kettle, Pentaho Data Integration
 

Overview

 

Sample Customers

Information Not Available
66Controls, Providential Revenue Agency of Ro Negro, NOAA Information Systems, Swiss Real Estate Institute
Find out what your peers are saying about Pentaho Data Integration and Analytics vs. dbt and other solutions. Updated: April 2026.
893,221 professionals have used our research since 2012.