

IBM Cloud Pak for Data and dbt both cater to data management needs. While dbt leads in speed and transformation features, IBM Cloud Pak for Data offers superior integrations and AI capabilities.
Features: IBM Cloud Pak for Data offers comprehensive data management, seamless integration, and AI functionalities with tools like Watson Studio and Watson Machine Learning. It enhances data governance through cognitive capabilities. dbt stands out for its speedy, efficient data transformations using ELT architecture, making it user-friendly for complex data environments with SQL-based tools.
Room for Improvement: IBM Cloud Pak for Data could boost performance, ease of use, and offer more out-of-the-box integrations and analytics enhancements. Improved customer support responsiveness and a simpler installation process are also needed. dbt, while benefiting from being open-source, requires better copilot features, Python integration, and fewer outages. Its migration processes and package management also have room for improvement.
Ease of Deployment and Customer Service: IBM Cloud Pak for Data offers flexible deployment across hybrid, public cloud, and on-premises environments. Its customer support is generally good but could be more responsive. dbt, easily deployed in public cloud environments, benefits greatly from community resources, though it could improve in official support channels.
Pricing and ROI: IBM Cloud Pak for Data is considered pricey but provides significant ROI through time savings and data management improvements. dbt, being open-source, is highly affordable and offers great ROI through efficient data transformation and management.
There is operational efficiency achieved, and data quality and governance have also been achieved with modular SQL and version controlling, which reduced duplication of data and data errors.
I have seen a return on investment as it means we don't have to employ as many people.
Since we migrated from SSIS to dbt model architecture, it takes around four hours only to complete a full refresh.
We have been able to drive responsible, transparent, and explainable AI workflow to operationalize AI and mitigate risk and regulatory compliance easily.
It is easy to collect, organize, and analyze data no matter where it is, hence being able to make data-driven decisions.
It has given my teams an edge in data management through automation while adhering to compliance regulations.
If you type your question, you will likely find that someone has already asked it, so we do not need to contact their support directly.
I would rate the technical support a nine out of ten.
We ran dbt Core, which is open-source, so there is no direct vendor support.
I rate the technical support from IBM a nine out of ten because the support has been very top-notch, unparalleled, and also very professional.
Cloud Pak is a complicated system, and it's often difficult to find the right resource in IBM to help with specific issues.
The customer support for IBM Cloud Pak for Data is great and responsive.
The bottlenecks that we have are not coming from dbt; they are coming from Snowflake.
We were processing large volumes of financial documents, hundreds of trial balances, balance sheets, and invoice sets, and dbt handled the transformation layer without issues.
dbt is quite scalable since it has its own feature set for incorporating business logic.
I have not noticed any downtime or lagging, especially when dealing with large data, so it is relatively very scalable.
IBM Cloud Pak for Data's scalability is very good; it can be used by any size of organization.
For scalability, I rate it a nine out of ten because it is a very scalable solution that has been able to handle my organization's growth efficiently.
Comparing it to tools I have seen in the past, such as Informatica and Alteryx, dbt can easily match up to that rating, specifically for stability.
Every upgrade is a little bit of a risk for us because we do not know if the workarounds that we developed will be available for the next version.
When I conduct dbt tests, the data processed in the data warehouse performs exactly as expected.
The overall performance of IBM Cloud Pak for Data, particularly with IBM DataStage for ETL processes, is very good.
IBM Cloud Pak for Data is stable.
Improvement is needed in the tool itself in terms of the copilot, in terms of covering outages, in terms of testing, and in terms of quality reasons related to governance and collaboration.
The whole data testing field is not very mature. It is not the same as software testing; for example, you have test suites, test tools, and profilers, but for data testing, it is not yet that advanced.
dbt does not have a native concept of multi-tenant or multi-standard project organization.
Setting up the hybrid and multi-cloud environments is a long job and it takes time.
IBM Cloud Pak for Data can be improved because processing speeds are sometimes slow.
To improve IBM Cloud Pak for Data, I suggest more out-of-the-box integration.
The course content that dbt provides is free and excellent for anyone starting out.
dbt is open source for its core modules.
I mentioned the cost as one of the advantages, specifically the license cost.
The setup cost is very expensive.
Regarding my experience with pricing, setup cost, and licensing, for a small organization, the price might be relatively high, but for huge enterprises such as ours, the price is relatively affordable.
The list price is high, but the flexibility in pricing is adequate.
dbt has positively impacted my organization by allowing us to create our data pipelines much faster, going from ingestion of data to creating a data product in weeks instead of months.
There are the benefits of having code, so you have a software development lifecycle; you can use version control, testing, and documentation.
The tests, especially custom tests for financial data like validating that debits equal credits, caught a lot of our data quality issues early.
From there, I can work my way into a more granular level, applying all of that information on top of my actual data to understand what my data looks like, where it came from, and where it went wrong, managing it throughout the cycle.
The benefits of choosing IBM Cognos, in addition to saving on cost, include having institutional knowledge about maintaining this infrastructure and enough people who have developed on Cognos in the past, which creates comfort in its use.
We have been able to save approximately 80 percent of our time. We are not doing data analysis manually, so this relieves our data department of dealing with data.
| Product | Mindshare (%) |
|---|---|
| dbt | 1.4% |
| IBM Cloud Pak for Data | 1.1% |
| Other | 97.5% |


| Company Size | Count |
|---|---|
| Small Business | 2 |
| Midsize Enterprise | 3 |
| Large Enterprise | 6 |
| Company Size | Count |
|---|---|
| Small Business | 10 |
| Large Enterprise | 20 |
dbt is a transformational tool that empowers data teams to quickly build trusted data models, providing a shared language for analysts and engineering teams. Its flexibility and robust feature set make it a popular choice for modern data teams seeking efficiency.
Designed to integrate seamlessly with the data warehouse, dbt enables analytics engineers to transform raw data into reliable datasets for analysis. Its SQL-centric approach reduces the learning curve for users familiar with it, allowing powerful transformations and data modeling without needing a custom backend. While widely beneficial, dbt could improve in areas like version management and support for complex transformations out of the box.
What are the most valuable features of dbt?
What benefits should you expect from using dbt?
In the finance industry, dbt helps in cleansing and preparing transactional data for analysis, leading to more accurate financial reporting. In e-commerce, it empowers teams to rapidly integrate and analyze customer behavior data, optimizing marketing strategies and improving user experience.
IBM Cloud Pak for Data is a comprehensive platform integrating data management, AI, and machine learning capabilities tailored for hybrid environments. It's renowned for enhancing productivity through efficient data analytics and management.
This platform offers data virtualization, robust analytics, and AI-driven processes. Its integration capabilities, including IBM MQ and App Connect, facilitate seamless data connections. Users benefit from containerization, data governance, and compatibility with hybrid systems, improving decision-making and management productivity. However, the requirement of extensive infrastructure and performance challenges can impact scalability for small businesses.
What are the key features of IBM Cloud Pak for Data?In the financial and banking sectors, IBM Cloud Pak for Data is utilized for data management tasks like spend analytics and contract leakage analysis. It's used for data integration, machine learning, and AI-driven analytics to transform data into valuable insights in industries such as FinTech and consultancy.
We monitor all Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.