IBM InfoSphere DataStage is a high-quality data integration tool that aims to design, develop, and run jobs that move and transform data for organizations of different sizes. The product works by integrating data across multiple systems through a high-performance parallel framework. It supports extended metadata management, enterprise connectivity, and integration of all types of data.
Product | Market Share (%) |
---|---|
IBM InfoSphere DataStage | 4.6% |
Informatica PowerCenter | 8.1% |
Azure Data Factory | 7.4% |
Other | 79.9% |
Type | Title | Date | |
---|---|---|---|
Category | Data Integration | Aug 28, 2025 | Download |
Product | Reviews, tips, and advice from real users | Aug 28, 2025 | Download |
Comparison | IBM InfoSphere DataStage vs Azure Data Factory | Aug 28, 2025 | Download |
Comparison | IBM InfoSphere DataStage vs SSIS | Aug 28, 2025 | Download |
Comparison | IBM InfoSphere DataStage vs Informatica PowerCenter | Aug 28, 2025 | Download |
Title | Rating | Mindshare | Recommending | |
---|---|---|---|---|
Informatica Intelligent Data Management Cloud (IDMC) | 4.0 | 4.3% | 93% | 186 interviewsAdd to research |
Azure Data Factory | 4.0 | 7.4% | 92% | 92 interviewsAdd to research |
Company Size | Count |
---|---|
Small Business | 15 |
Midsize Enterprise | 4 |
Large Enterprise | 20 |
Company Size | Count |
---|---|
Small Business | 213 |
Midsize Enterprise | 134 |
Large Enterprise | 902 |
The solution is the data integration component of IBM InfoSphere Information Server, providing a graphical framework for moving data from source systems to target systems. IBM InfoSphere DataStage can deliver data to data warehouses, data marts, operational data sources, and other enterprise applications. The tool works with various types of patterns - extract, transform and load (ETL), and extract, load, and transform (ELT). The scalability of the platform is achieved by using parallel processing and enterprise connectivity.
The solution has various versions, catering to different types of companies, which include the Server Edition, the Enterprise Edition, and the MVS Edition. Depending on which version a company has bought, different goals can be achieved. They include the following:
IBM InfoSphere DataStage can be deployed in various ways, including:
IBM InfoSphere DataStage Features
The tool has various features through which users can integrate and utilize their data effectively. The components of IBM InfoSphere DataStage include:
IBM InfoSphere DataStage Benefits
This solution offers many benefits for the companies that utilize it for data integration. Some of these benefits include:
Reviews from Real Users
A data/solution architect at a computer software company says the product is robust, easy to use, has a simple error logging mechanism, and works very well for huge volumes of data.
Tirthankar Roy Chowdhury, team leader at Tata Consultancy Services, feels the tool is user-friendly with a lot of functionalities, and doesn't require much coding because of its drag-and-drop features.
Dubai Statistics Center, Etisalat Egypt
Author info | Rating | Review Summary |
---|---|---|
Senior Officer at State Bank of India | 3.5 | We use IBM InfoSphere DataStage for ETL, valuing its stability, scalability, and strong support crucial for our financial organization's security needs. However, improvements are needed in connectivity with big data technologies like Spark, compared to older RDBMS systems. |
Sr Product Manager at a computer software company with 501-1,000 employees | 4.0 | No summary available |
Manager - Business Technology Solutions at a consultancy with 1,001-5,000 employees | 4.5 | IBM InfoSphere DataStage is a leading ETL tool, known for being cost-effective and user-friendly, transforming large volumes of data efficiently. However, it needs improved logging and troubleshooting features. Alternatives like Informatica are more expensive, while newer tools have emerged. |
Associate Manager at a consultancy with 10,001+ employees | 4.0 | I use IBM InfoSphere DataStage for ETL processes and data quality. Its valuable connectors and excellent debugging capabilities stand out, though I heard support will end by 2026, prompting a move to Cloud Pak for Data for continued support. |
Arquitecto Industrial IoT at Xignux SA de CV | 3.5 | We use IBM InfoSphere DataStage for ETL tasks, valuing its ability to manage large record volumes. It excels in batch data integration but could improve in integrating with modern data sources. We considered Azure Data Factory before selecting DataStage. |
Senior Data Architect at Anadolu Sigorta | 4.0 | I used IBM InfoSphere DataStage for data integration and management across various sectors, appreciating its robust capabilities and unified interface. However, deployment is complex, and high costs impact ROI, despite a 200% performance improvement after optimization. |
Bi Architect at a healthcare company with 10,001+ employees | 4.0 | No summary available |
Solution Architect - Data Engineering at Tenx | 3.5 | I integrated multiple data sources into a single data warehouse using IBM DataStage. The Transformer is highly valuable for complex transformations, but it lacks some features like custom code integration seen in Talend, which needs improvement. |