

Teradata and IBM InfoSphere DataStage compete in the data management category. Teradata is favored for its superior handling of large-scale data tasks and pricing flexibility, while IBM InfoSphere DataStage is appreciated for being affordable for small to medium enterprises.
Features: Teradata excels with massively parallel processing capabilities, stability, and scalability, making it ideal for large data operations. Its robust security and redundancy further enhance its appeal for critical missions. DataStage is known for strong data integration capabilities, versatile transformation options, and effective metadata management, offering ease of integration with other toolsets and leveraging its ETL strengths.
Room for Improvement: Teradata could improve cost-effectiveness and adaptability to unstructured data and cloud environments. Users find its on-premises scalability and documentation needing enhancements. DataStage requires better cloud connectivity and simpler setup and management of complex ETL processes. Its documentation and accessibility also need improvement for ease of use.
Ease of Deployment and Customer Service: Teradata supports diverse environments, including on-premises, cloud, and hybrid, and offers commendable customer support. However, it faces challenges in initial setup complexity and documentation quality. DataStage predominantly serves on-premises with hybrid capabilities, praised for its support but needing a more user-friendly interface and improved documentation.
Pricing and ROI: Teradata is often seen as costly but justified by its output, suitable for large enterprises with significant ROIs. DataStage is an economical option, appealing for tighter budgets, delivering solid data management, although it may lack some advanced functionalities compared to competitors.
At least fifteen to twenty percent of our time has been saved using Teradata, which has positively affected team productivity and business outcomes.
Independent research showed that Teradata VantageCloud users achieved an average ROI of 427% across three years with payback under a year, demonstrating the platform's ability to deliver a strong financial return.
We have realized a return on investment, with a reduction of staff from 27 to eight, and our current return on investment is approximately 14%.
We also have the flexibility to submit a feature request to be included as part of the wishlist, potentially becoming a product feature in subsequent releases.
I rate their support as nine on a scale from one to ten.
IBM tech support has allocated dedicated resources, making it satisfactory.
The customer support for Teradata has been great.
They are responsive and knowledgeable, and the documentation is very helpful.
Customer support is very good, rated eight out of ten under our essential agreement.
If the job provided suggestions about running this kind of parallel processing and how many virtual nodes are required, it would help.
Whenever we need more resources, we can add that in Teradata, and when not needed, we can scale it down as well.
This flexibility allows organizations to scale according to their needs, balancing performance, cost, and compliance requirements.
This expansion can occur without incurring downtime or taking systems offline.
Its massively parallel process architecture allows the platform to distribute workload efficiently, enabling organizations to run heavy analytic queries without compromising speed or stability.
I find the stability to be almost a ten out of ten.
The workload management and software maturity provide a reliable system.
If the job itself gave some guidance, such as running this parallel processing with this many nodes, it would help; I think that is missing.
I wonder if it supports other areas, such as cloud environments with open source support, or EdgeShift.
The solution needs improvement in connectivity with big data technologies such as Spark.
I want to highlight two features for improvement: first, storing data in various formats without requiring a tabular structure, accommodating unstructured data; and second, adding AI ML features to better integrate Gen AI, LLM concepts, and user-friendly experiences such as text-to-SQL capabilities.
Unlike SQL and Oracle, which have in-built replication capabilities, we don't have similar functionality with Teradata.
The most challenging aspect is finding Teradata resources, so we are focusing on internal training and looking for more Teradata experts.
Pricing for IBM InfoSphere DataStage is moderate and not much expensive.
Teradata is much more expensive than SQL, which is well-performed and cheaper.
Initially, it may seem expensive compared to similar cloud databases, however, it offers significant value in performance, stability, and overall output once in use.
Role-based access control (RBAC), strong audit and compliance features, high availability, fault tolerance, and encrypted data at rest and in-transit are key features.
It is straightforward from a design and development perspective, and also for deployment.
As we are a financial organization, security is our main concern, so we prefer enterprise tools.
I have leveraged IBM InfoSphere DataStage's integration with IBM's Information Server suite, and it is indeed beneficial.
Teradata's security helps our organization meet compliance requirements such as GDPR and IFRS, and it is particularly essential for revenue contracting or revenue recognition.
Its architecture allows information to be processed efficiently while maintaining stable performance, even in highly demanding environments.
It facilitates data integration, where we integrate and analyze data from various sources, making it a powerful and high-quality reliable solution for the company.
| Product | Mindshare (%) |
|---|---|
| IBM InfoSphere DataStage | 1.9% |
| Teradata | 0.9% |
| Other | 97.2% |

| Company Size | Count |
|---|---|
| Small Business | 23 |
| Midsize Enterprise | 4 |
| Large Enterprise | 26 |
| Company Size | Count |
|---|---|
| Small Business | 28 |
| Midsize Enterprise | 13 |
| Large Enterprise | 52 |
IBM InfoSphere DataStage is a high-quality data integration tool that aims to design, develop, and run jobs that move and transform data for organizations of different sizes. The product works by integrating data across multiple systems through a high-performance parallel framework. It supports extended metadata management, enterprise connectivity, and integration of all types of data.
The solution is the data integration component of IBM InfoSphere Information Server, providing a graphical framework for moving data from source systems to target systems. IBM InfoSphere DataStage can deliver data to data warehouses, data marts, operational data sources, and other enterprise applications. The tool works with various types of patterns - extract, transform and load (ETL), and extract, load, and transform (ELT). The scalability of the platform is achieved by using parallel processing and enterprise connectivity.
The solution has various versions, catering to different types of companies, which include the Server Edition, the Enterprise Edition, and the MVS Edition. Depending on which version a company has bought, different goals can be achieved. They include the following:
IBM InfoSphere DataStage can be deployed in various ways, including:
IBM InfoSphere DataStage Features
The tool has various features through which users can integrate and utilize their data effectively. The components of IBM InfoSphere DataStage include:
IBM InfoSphere DataStage Benefits
This solution offers many benefits for the companies that utilize it for data integration. Some of these benefits include:
Reviews from Real Users
A data/solution architect at a computer software company says the product is robust, easy to use, has a simple error logging mechanism, and works very well for huge volumes of data.
Tirthankar Roy Chowdhury, team leader at Tata Consultancy Services, feels the tool is user-friendly with a lot of functionalities, and doesn't require much coding because of its drag-and-drop features.
Teradata is a powerful tool for handling substantial data volumes with its parallel processing architecture, supporting both cloud and on-premise environments efficiently. It offers impressive capabilities for fast query processing, data integration, and real-time reporting, making it suitable for diverse industrial applications.
Known for its robust parallel processing capabilities, Teradata effectively manages large datasets and provides adaptable deployment across cloud and on-premise setups. It enhances performance and scalability with features like advanced query tuning, workload management, and strong security. Users appreciate its ease of use and automation features which support real-time data reporting. The optimizer and intelligent partitioning help improve query speed and efficiency, while multi-temperature data management optimizes data handling.
What are the key features of Teradata?
What benefits and ROI do users look for?
In the finance, retail, and government sectors, Teradata is employed for data warehousing, business intelligence, and analytical processing. It handles vast datasets for activities like customer behavior modeling and enterprise data integration. Supporting efficient reporting and analytics, Teradata enhances data storage and processing, whether deployed on-premise or on cloud platforms.
We monitor all Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.