Try our new research platform with insights from 80,000+ expert users

Informatica Intelligent Data Management Cloud (IDMC) vs StreamSets comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Dec 19, 2024

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Informatica Intelligent Dat...
Ranking in Data Integration
3rd
Average Rating
7.8
Reviews Sentiment
6.8
Number of Reviews
182
Ranking in other categories
Data Quality (1st), Business Process Management (BPM) (13th), Business-to-Business Middleware (4th), API Management (8th), Cloud Data Integration (3rd), Data Governance (2nd), Test Data Management (3rd), Cloud Master Data Management (MDM) Solutions (1st), Data Management Platforms (DMP) (1st), Data Masking (2nd), Metadata Management (1st), Test Data Management Services (3rd), Product Information Management (PIM) (1st), Data Observability (2nd)
StreamSets
Ranking in Data Integration
15th
Average Rating
8.4
Reviews Sentiment
7.0
Number of Reviews
21
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of April 2025, in the Data Integration category, the mindshare of Informatica Intelligent Data Management Cloud (IDMC) is 4.6%, down from 7.5% compared to the previous year. The mindshare of StreamSets is 1.6%, up from 1.3% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Integration
 

Featured Reviews

Raj Sethupathi - PeerSpot reviewer
Offers profiling and address standardization but can be complicated
Informatica Data Quality has its data warehouse, primarily using Oracle and some SQL databases. You need a database to host the data. The cleansed version of the data is stored in the data warehouse. It integrates with PowerCenter and other Informatica tools. The integration details can be complex, but a regional setup is involved in this process. Profiling smaller datasets, such as 10,000-50,000 records, worked fine. However, unexpected issues could arise with larger datasets, such as thousands of records or more, especially with tables containing many columns. Handling tables with fifty or more columns can be challenging, even in Excel. A mismatch in data types could cause the entire system to crash. Continual enhancements are being made to address these issues, which can be unique to specific industries like finance and healthcare.
Nantabo Jackie - PeerSpot reviewer
Simplified pipelines and helped us break down data silos within our organization
The design experience when implementing batch streaming or ECL pipelines is very easy and straightforward. When we initially attempted to integrate StreamSets with Kafka, it was somewhat challenging until we consulted the documentation, after which it became straightforward. We use StreamSets to move data into modern analytics platforms. Moving the data into modern analytics platforms is still complex. It requires a lot of understanding of logic. StreamSets enables us to build data pipelines without knowing how to code. StreamSets' ability to build data pipelines without requiring us to know complex programming is very important, as it allows us to focus on our projects without spending time writing code. StreamSets' Transformer for Snowflake is simple to use for designing both simple and complex transformation logic. StreamSets' Transformer for Snowflake is extremely important to me as it helps me to connect external data sources and keep my internal workflow organized. Transformer for Snowflake's functionality is a perfect ten out of ten. It is important and cost-effective that Transformer for Snowflake is a serverless engine embedded within the platform, as without this feature, it would be very expensive. This feature helps us to sell at lower budget costs, which would otherwise be at a high cost with other servers. StreamSets has helped improve our organization. StreamSets simplified pipelines for our organization. It is easier to complete a project when we know where and how to start, and working with the team remotely makes it more efficient. This helps us to save time and be more organized when creating data pipelines. Being a structured company that produces reliable resources for our application benefits both our clients and contacts. StreamSets' built-in data drift resilience plays a part in our ETL operations. With prior knowledge, the built-in data drift resilience is very effective, but it can be challenging to implement without the preexisting knowledge. The built-in data drift resilience reduced the time it takes us to fix data drift breakages by 45 percent. StreamSets helped us break down data silos within our organization. The use of StreamSets to break down data silos enabled us to be confident in the services and products we provide, as well as the real-time streaming we offer. This has had a positive impact on our business, as it allowed us to accurately determine the analytics we need to present to stakeholders, clients, and our sources while ensuring that the process is secure and transparent. StreamSets saved us time because anyone can use StreamSets not just developers. We can save around 40 percent of our time. StreamSets' reusable assets helped us reduce workload by around 25 percent. StreamSets saved us money by not having to hire developers with specialized skills. We saved around $2,000 US. StreamSets helped us scale our data operations. Since StreamSets makes it easy to scale our data operations, it enabled us to know exactly where to start at any time. We are aware of the timeline for completing the project, and depending on our familiarity with the software, we can come up with a solution quickly.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Informatica MDM has a defined data model we can customize with user and developer options."
"It's good for tool management, maintaining the golden record of customer status."
"In the latest version, I like the outlay of the business roles creation. I like seeing that visualization as you're building it, as opposed to going through metatables or XML mappings. We liked that piece, and it makes it more intuitive for the business."
"The most valuable features are the structure masking and platform masking."
"The most valuable features are data quality, data integrate transformations, match-merge, and a few MDM solutions we build into data quality transformation."
"A great product enrichment tool."
"I think the integration feature is probably one of the key features in Informatica MDM...Stability-wise, I rate the solution a ten out of ten."
"The profiling feature in Informatica Data Quality is incredibly effective for data governance."
"StreamSets Transformer is a good feature because it helps you when you are developing applications and when you don't want to write a lot of code. That is the best feature overall."
"The entire user interface is very simple and the simplicity of creating pipelines is something that I like very much about it. The design experience is very smooth."
"StreamSets is the leader in the market."
"The ETL capabilities are very useful for us. We extract and transform data from multiple data sources, into a single, consistent data store, and then we put it in our systems. We typically use it to connect our Apache Kafka with data lakes. That process is smooth and saves us a lot of time in our production systems."
"The best feature that I really like is the integration."
"StreamSets data drift feature gives us an alert upfront so we know that the data can be ingested. Whatever the schema or data type changes, it lands automatically into the data lake without any intervention from us, but then that information is crucial to fix for downstream pipelines, which process the data into models, like Tableau and Power BI models. This is actually very useful for us. We are already seeing benefits. Our pipelines used to break when there were data drift changes, then we needed to spend about a week fixing it. Right now, we are saving one to two weeks. Though, it depends on the complexity of the pipeline, we are definitely seeing a lot of time being saved."
"The scheduling within the data engineering pipeline is very much appreciated, and it has a wide range of connectors for connecting to any data sources like SQL Server, AWS, Azure, etc. We have used it with Kafka, Hadoop, and Azure Data Factory Datasets. Connecting to these systems with StreamSets is very easy."
"I really appreciate the numerous ready connectors available on both the source and target sides, the support for various media file formats, and the ease of configuring and managing pipelines centrally."
 

Cons

"I would like to see support for more data sources."
"Inserting the GenAI into the master data management will reduce the overall effort of operational activities."
"I would rate my experience with the initial setup a two out of ten, with one being difficult and ten being easy."
"Interoperability is one area where EDC has room for improvement. It was challenging when the faculty took over the data world and had specific vendors they wanted to use, and some were not particularly open platforms."
"I think they should work really hard on UI."
"Informatica MDM could improve the interdependency with integration. The solution sometimes becomes a bit difficult to change considering a lot of interdependency with the integration. There can be some improvement in the workflows and they can introduce more artificial intelligence."
"I would also like to have profiling functionalities and quality transformations in the cloud."
"The ability of the product to leverage the power of big data could be improved."
"One area for improvement could be the cloud storage server speed, as we have faced some latency issues here and there."
"StreamSet works great for batch processing but we are looking for something that is more real-time. We need latency in numbers below milliseconds."
"The logging mechanism could be improved. If I am working on a pipeline, then create a job out of it and it is running, it will generate constant logs. So, the logging mechanism could be simplified. Now, it is a bit difficult to understand and filter the logs. It takes some time."
"The documentation is inadequate and has room for improvement because the technical support does not regularly update their documentation or the knowledge base."
"There aren't enough hands-on labs, and debugging is also an issue because it takes a lot of time. Logs are not that clear when you are debugging, and you can only select a single source for a pipeline."
"One issue I observed with StreamSets is that the memory runs out quickly when processing large volumes of data. Because of this memory issue, we have to upgrade our EC2 boxes in the Amazon AWS infrastructure."
"Sometimes, it is not clear at first how to set up nodes. A site with an explanation of how each node works would be very helpful."
"Currently, we can only use the query to read data from SAP HANA. What we would like to see, as soon as possible, is the ability to read from multiple tables from SAP HANA. That would be a really good thing that we could use immediately. For example, if you have 100 tables in SQL Server or Oracle, then you could just point it to the schema or the 100 tables and ingestion information. However, you can't do that in SAP HANA since StreamSets currently is lacking in this. They do not have a multi-table feature for SAP HANA. Therefore, a multi-table origin for SAP HANA would be helpful."
 

Pricing and Cost Advice

"Its pricing model can be improved."
"I rate Informatica MDM's price a six on a scale of one to ten, where one is a low price, and ten is a high price."
"The price is neither too high nor too low."
"I rate the product's pricing a five on a scale of one to ten, where one is cheap and ten is expensive."
"Informatica MDM is a costly solution because it comes as a bundle. They are also globally positioning themselves and are definitely working on very upgraded technologies. If someone wanted to do it on the cloud, they have a lot of flexibility because they upgrade themselves according to the current needs. It definitely comes with a lot of features and that's the reason why it's costly. The licensing cost should be approximately one million dollars. It's about four to five times that of other vendors."
"Cost-wise, I think it is on the higher side, and that is why we are looking for some better options. Licensing costs are huge compared to other players in the market and for my company."
"The pricing is high compared to other tools on the market."
"Our customers sometimes are able to negotiate a much better price for Informatica Cloud Data Integration based on their relationship with the vendor."
"The pricing is affordable for any business."
"StreamSets is an expensive solution."
"There are different versions of the product. One is the corporate license version, and the other one is the open-source or free version. I have been using the corporate license version, but they have recently launched a new open-source version so that anybody can create an account and use it. The licensing cost varies from customer to customer. I don't have a lot of input on that. It is taken care of by PMO, and they seem fine with its pricing model. It is being used enterprise-wide. They seem to have got a good deal for StreamSets."
"The overall cost is very flexible so it is not a burden for our organization... However, the cost should be improved. For small and mid-size organizations it might be a challenge."
"We are running the community version right now, which can be used free of charge."
"StreamSets Data Collector is open source. One can utilize the StreamSets Data Collector, but the Control Hub is the main repository where all the jobs are present. Everything happens in Control Hub."
"We use the free version. It's great for a public, free release. Our stance is that the paid support model is too expensive to get into. They should honestly reevaluate that."
"It's not so favorable for small companies."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
845,406 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
17%
Computer Software Company
12%
Manufacturing Company
10%
Government
6%
Financial Services Firm
14%
Computer Software Company
11%
Manufacturing Company
10%
Insurance Company
8%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

How does Azure Data Factory compare with Informatica Cloud Data Integration?
Azure Data Factory is a solid product offering many transformation functions; It has pre-load and post-load transformations, allowing users to apply transformations either in code by using Power Q...
Which Informatica product would you choose - PowerCenter or Cloud Data Integration?
Complex transformations can easily be achieved using PowerCenter, which has all the features and tools to establish a real data governance strategy. Additionally, PowerCenter is able to manage huge...
What are the biggest benefits of using Informatica Cloud Data Integration?
When it comes to cloud data integration, this solution can provide you with multiple benefits, including: Overhead reduction by integrating data on any cloud in various ways Effective integration ...
What do you like most about StreamSets?
The best thing about StreamSets is its plugins, which are very useful and work well with almost every data source. It's also easy to use, especially if you're comfortable with SQL. You can customiz...
What needs improvement with StreamSets?
We often faced problems, especially with SAP ERP. We struggled because many columns weren't integers or primary keys, which StreamSets couldn't handle. We had to restructure our data tables, which ...
What is your primary use case for StreamSets?
StreamSets is used for data transformation rather than ETL processes. It focuses on transforming data directly from sources without handling the extraction part of the process. The transformed data...
 

Also Known As

ActiveVOS, Active Endpoints, BPM, Address Verification, Persistent Data Masking, Cloud Test Data Management, PIM, , Enterprise Data Catalog, Data Integration Hub, Cloud Data Integration, Data Quality, Cloud API and App Integration
No data available
 

Overview

 

Sample Customers

The Travel Company, Carbonite
Availity, BT Group, Humana, Deluxe, GSK, RingCentral, IBM, Shell, SamTrans, State of Ohio, TalentFulfilled, TechBridge
Find out what your peers are saying about Informatica Intelligent Data Management Cloud (IDMC) vs. StreamSets and other solutions. Updated: February 2025.
845,406 professionals have used our research since 2012.