Try our new research platform with insights from 80,000+ expert users

Pentaho Data Integration and Analytics vs dbt comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Dec 19, 2024

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

dbt
Ranking in Data Integration
18th
Average Rating
7.8
Reviews Sentiment
7.2
Number of Reviews
5
Ranking in other categories
Data Quality (8th)
Pentaho Data Integration an...
Ranking in Data Integration
10th
Average Rating
8.0
Reviews Sentiment
6.7
Number of Reviews
59
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of February 2026, in the Data Integration category, the mindshare of dbt is 1.7%, up from 0.8% compared to the previous year. The mindshare of Pentaho Data Integration and Analytics is 1.5%, up from 1.3% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Integration Market Share Distribution
ProductMarket Share (%)
Pentaho Data Integration and Analytics1.5%
dbt1.7%
Other96.8%
Data Integration
 

Featured Reviews

reviewer2780388 - PeerSpot reviewer
Senior Data Engineer at a pharma/biotech company with 10,001+ employees
Streamlined Data engineering and built-in lineages
The best features of dbt include lineage and Jinja templating languages that make it easy for creating pipelines. The built-in lineage feature provides a good understanding of the several layers where data is being loaded in dbt, allowing visibility from different layers into the end product. dbt has positively impacted version controlling as it has different version control steps involved. The specific improvements seen with version control in dbt are that it has helped trace the data lineage, enabled faster trace and rollbacks, and enabled safe collaboration at every scale, which has improved data quality. A return on investment has been seen from using dbt as the time has reduced while utilizing dbt in the form of data pipelines and ETL scripting. There is operational efficiency achieved, and data quality and governance have also been achieved with modular SQL and version controlling, which reduced duplication of data and data errors.
Michelle Lawson - PeerSpot reviewer
Principal Software Engineer at a tech vendor with 10,001+ employees
Streamlines complex data workflows and has supported automated customer payment notifications
I haven't used Pentaho Data Integration and Analytics in a couple of years, so I don't know how it can be improved. I was pretty pleased with it and was self-taught on it, working a lot with their team at various times, but they were surprised that I was able to learn it all by myself. The documentation is not bad, and documentation is the main thing that any product can do to make themselves better because the easier it is to find examples of what you're trying to do improves the learning curve. I think it took me the longest to learn how to do the asynchronous processing and have things wait for other things to finish processing before continuing on in the workflow. I choose 8 out of 10 because the one reason that it's been rejected at T-Mobile is that everything has to go through a provisioning process and has to get approved, meaning the actual code base has to be investigated by T-Mobile before they'll allow us to use tools of that nature. For whatever reason, we just haven't been able to get that approval; I don't know if it's on Pentaho Data Integration and Analytics' side or if it's on our side. The more you can make it easier for companies to feel comfortable that your product is secure, robustly tested and bug-free, and free of any other kind of negative hacks, the more quickly it will get accepted.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"There is operational efficiency achieved, and data quality and governance have also been achieved with modular SQL and version controlling, which reduced duplication of data and data errors."
"The product is developer-friendly."
"dbt has positively impacted my organization by allowing us to create our data pipelines much faster, going from ingestion of data to creating a data product in weeks instead of months, and we can do it in-house with the skillset we already have."
"Since we migrated from SSIS to dbt model architecture, it takes around four hours only to complete a full refresh, and the client is now happy because our downtime was drastically reduced when we perform a complete refresh of the data."
"I find the drag and drop feature in Pentaho Data Integration very useful for integration."
"We can schedule job execution in the BA Server, which is the front-end product we're using right now. That scheduling interface is nice."
"Pentaho Data Integration and Analytics has had a notably positive impact on my organization, enabling us to complete several projects efficiently."
"We use Lumada’s ability to develop and deploy data pipeline templates once and reuse them. This is very important. When the entire pipeline is automated, we do not have any issues in respect to deployment of code or with code working in one environment but not working in another environment. We have saved a lot of time and effort from that perspective because it is easy to build ETL pipelines."
"I can use Python, which is open-source, and I can run other scripts, including Linux scripts. It's user-friendly for running any object-based language. That's a very important feature because we live in a world of open-source."
"Pentaho Data Integration and Analytics has positively impacted my organization by saving costs, managing large data sets, and integrating multiple sources."
"The graphical nature of the development interface is most useful because we've got people with quite mixed skills in the team. We've got some very junior, apprentice-level people, and we've got support analysts who don't have an IT background. It allows us to have quite complicated data flows and embed logic in them. Rather than having to troll through lines and lines of code and try and work out what it's doing, you get a visual representation, which makes it quite easy for people with mixed skills to support and maintain the product. That's one side of it."
"Pentaho Data Integration and Analytics positively impacted my organization by organizing data that was scattered across many different databases and APIs, reducing a lot of manual work and centralizing it into one data warehouse."
 

Cons

"The solution must add more Python-based implementations."
"dbt can be improved as I find the co-pilot in dbt is not very good, and my team has tried using it but opted to move off it and use other co-pilots such as GitHub."
"Since dbt has a license cost, if a company is small and does not have much budget, they can explore other tools because there are other tools that provide the same functionality at a lower cost."
"Dbt is not as stable as preferred, as it has had a few outages in the current year itself, so improvement should be made in the outages section as it is not stable."
"The product needs more plugins."
"Its basic functionality doesn't need a whole lot of change. There could be some improvement in the consistency of the behavior of different transformation steps. The software did start as open-source and a lot of the fundamental, everyday transformation steps that you use when building ETL jobs were developed by different people. It is not a seamless paradigm. A table input step has a different way of thinking than a data merge step."
"I would like to see improvement when it comes to integrating structured data with text data or anything that is unstructured. Sometimes we get all kinds of different files that we need to integrate into the warehouse."
"The more you can make it easier for companies to feel comfortable that your product is secure, robustly tested and bug-free, and free of any other kind of negative hacks, the more quickly it will get accepted."
"Lumada could have more native connectors with other vendors, such as Google BigQuery, Microsoft OneDrive, Jira systems, and Facebook or Instagram. We would like to gather data from modern platforms using Lumada, which is a better approach. As a comparison, if you open Power BI to retrieve data, then you can get data from many vendors with cloud-native connectors, such as Azure, AWS, Google BigQuery, and Athena Redshift. Lumada should have more native connectors to help us and facilitate our job in gathering information from these new modern infrastructures and tools."
"Parallel execution could be better in Pentaho. It's very simple but I don't think it works well."
"I would like to see more improvements with AS400 DB2."
"​I work with the Community Edition, therefore I do not have support. There was an issue that I could not resolve with community support.​"
 

Pricing and Cost Advice

"The solution’s pricing is affordable."
"You need to go through the paid version to have Hitachi Lumada specialized support. However, if you are using the free version, then you will have only the community support. You will depend on the releases from Hitachi to solve some problem or questions that you have, such as bug fixes. You will need to wait for the newest versions or releases to solve these types of problems."
"I believe the pricing of the solution is more affordable than the competitors"
"For most development tasks, the Enterprise edition should be sufficient. It depends on the type of support that you require for your production environment."
"I think Lumada's price is fair compared to some of the others, like BusinessObjects, which is was the other thing that I used at my previous job. BusinessObject's price was more reasonable before SAP acquired it. They jacked the price up significantly. Oracle's OBIEE tool was also prohibitively expensive."
"I primarily work on the Community Version, which is available to use free of charge."
"I use it because it is free. I download from their page for free. I don't have to pay for a license. With other tools, I have to pay for the licenses. That is why I use Pentaho."
"If a company is looking for an ETL solution and wants to integrate it with their tech stack but doesn't want to spend a bunch of money, Pentaho is a good solution"
"When we first started with it, it was much cheaper. It has gone up drastically, especially since Hitachi bought out Pentaho."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
881,733 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
14%
Insurance Company
9%
Computer Software Company
7%
Manufacturing Company
7%
Financial Services Firm
13%
Educational Organization
9%
Computer Software Company
8%
Manufacturing Company
7%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
By reviewers
Company SizeCount
Small Business18
Midsize Enterprise17
Large Enterprise30
 

Questions from the Community

What is your experience regarding pricing and costs for dbt?
The pricing, setup cost, and licensing cost are managed by our infrastructure teams. As data engineers, we are not familiar with these details. I need to check with my infrastructure team on whethe...
What needs improvement with dbt?
I am not very familiar with dbt's version control system. I cannot identify any improvements in dbt because I am still exploring more functionality. I have been working with dbt for only three year...
What is your primary use case for dbt?
I am currently working with dbt and Snowflake together. We use dbt for data transformation purposes. We obtain the data and store the raw data directly into Snowflake, then perform all transformati...
Which ETL tool would you recommend to populate data from OLTP to OLAP?
Hi Rajneesh, yes here is the feature comparison between the community and enterprise edition : https://www.hitachivantara.com/en-us/pdf/brochure/leverage-open-source-benefits-with-assurance-of-hita...
What do you think can be improved with Hitachi Lumada Data Integrations?
In my opinion, the reporting side of this tool needs serious improvements. In my previous company, we worked with Hitachi Lumada Data Integration and while it does a good job for what it’s worth, ...
What do you use Hitachi Lumada Data Integrations for most frequently?
My company has used this product to transform data from databases, CSV files, and flat files. It really does a good job. We were most satisfied with the results in terms of how many people could us...
 

Also Known As

No data available
Hitachi Lumada Data Integration, Kettle, Pentaho Data Integration
 

Overview

 

Sample Customers

Information Not Available
66Controls, Providential Revenue Agency of Ro Negro, NOAA Information Systems, Swiss Real Estate Institute
Find out what your peers are saying about Pentaho Data Integration and Analytics vs. dbt and other solutions. Updated: February 2026.
881,733 professionals have used our research since 2012.