Try our new research platform with insights from 80,000+ expert users
Katarzyna Palikowska - PeerSpot reviewer
ETL Developer at Det Norske Veritas
Real User
Stable, scalable solution that's great for copying data
Pros and Cons
  • "Data Factory's most valuable feature is Copy Activity."
  • "Data Factory's cost is too high."

What is our primary use case?

I mainly use Data Factory to load data for ETL processes or to Azure Storage and for testing purposes in our business unit.

What is most valuable?

Data Factory's most valuable feature is Copy Activity.

For how long have I used the solution?

I've been using Data Factory for around two years.

What do I think about the stability of the solution?

Data Factory is stable.

Buyer's Guide
Azure Data Factory
September 2025
Learn what your peers think about Azure Data Factory. Get advice and tips from experienced pros sharing their opinions. Updated: September 2025.
868,759 professionals have used our research since 2012.

What do I think about the scalability of the solution?

We've had no problems with Data Factory's scalability.

How are customer service and support?

Microsoft's technical support is responsive and quick to help.

What about the implementation team?

We used consultants to implement Data Factory.

What's my experience with pricing, setup cost, and licensing?

Data Factory's cost is too high.

What other advice do I have?

I would advise anybody thinking of implementing Data Factory to calculate their costs at the initial stage in order to have some knowledge about future costs for the whole project. I would rate Data Factory as eight out of ten.

Which deployment model are you using for this solution?

On-premises
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Charles Nordine - PeerSpot reviewer
Senior Partner at Collective Intelligence
Real User
Visual, works very well, and makes data ingestion easier
Pros and Cons
  • "The data mapping and the ability to systematically derive data are nice features. It worked really well for the solution we had. It is visual, and it did the transformation as we wanted."
  • "For some of the data, there were some issues with data mapping. Some of the error messages were a little bit foggy. There could be more of a quick start guide or some inline examples. The documentation could be better."

What is our primary use case?

We created data ingestion solutions. We have built interpreters, and we have data factories that pull data from our clients. They submit data via Excel spreadsheets, and we process them into a common homogeneous format.

How has it helped my organization?

It has helped with some automation. Instead of individual people reviewing these files, we were able to automate the ingestion process, which saved a bunch of time. It saved hours of repeated manual work.

What is most valuable?

The data mapping and the ability to systematically derive data are nice features. It worked really well for the solution we had. It is visual, and it did the transformation as we wanted.

What needs improvement?

I couldn't quite grasp it at first because it has a Microsoft footprint on it. Some of the nomenclature around sync and other things is based on how SSRS or SSIS works, which works fine if you know these products. I didn't know them. So, some of the language and some of the settings were obtuse for me to use. It could be a little difficult if you're coming from the Java or AWS platform, but if you are coming from a Microsoft background, it would be very familiar.

For some of the data, there were some issues with data mapping. Some of the error messages were a little bit foggy. There could be more of a quick start guide or some inline examples. The documentation could be better.

There were some latency and performance issues. The processing time took slightly longer than I was hoping for. I wasn't sure if that was a licensing issue or construction of how we did the product. It wasn't super clear to me why and how those occurred. There was think time between steps. I am not sure if they can reduce the latency there. 

For how long have I used the solution?

I have been using this solution for a year and a half.

What do I think about the stability of the solution?

It is very stable.

What do I think about the scalability of the solution?

It is very scalable. It is a cloud product. It is being used by business analysts, business managers, and Azure cloud architects. We have just one developer/integrator for deployment and maintenance purposes.

We have plans to increase its usage. We'll be rolling it out for other clients.

How are customer service and support?

Microsoft has these things well-documented. There were videos. I was able to find answers when I needed them. To the uninitiated, it was a little difficult, but we got there.

How was the initial setup?

It was of medium complexity. Because it goes to the cloud, the duration was short. The deployment was minutes and hours.

What other advice do I have?

We are a consultant and integrator. You can use our company for its implementation.

I would rate this solution a nine out of ten.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Microsoft Azure
Disclosure: My company has a business relationship with this vendor other than being a customer. Consultant/Integrator
PeerSpot user
Buyer's Guide
Azure Data Factory
September 2025
Learn what your peers think about Azure Data Factory. Get advice and tips from experienced pros sharing their opinions. Updated: September 2025.
868,759 professionals have used our research since 2012.
reviewer1286736 - PeerSpot reviewer
IT Analyst at a tech vendor with 10,001+ employees
Real User
Improved data resilience, in the way that we move data from on-prem to the cloud and vice versa
Pros and Cons
  • "The most important feature is that it can help you do the multi-threading concepts."
  • "There should be a way that it can do switches, so if at any point in time I want to do some hybrid mode of making any data collections or ingestions, I can just click on a button."

What is our primary use case?

It's a PaaS service. It's a hybrid solution. The cloud provider is Microsoft.

We are not using Azure Data Factory as for users. Rather, we're using it as a process base. We're just using it for orchestration, not for any kind of ETL stuff.

We have plans to increase usage. It's going to take a major role in any kind of traditional data warehousing. It has big potential, especially as a PaaS offering.

How has it helped my organization?

There has been improvement in data resilience, in the way that we're moving the data from on-prem to cloud and vice versa.

What is most valuable?

The most important feature is that it can help you do the multi-threading concepts. It's in Informatica, but the resourcing is quite robust. You can scale up and scale down as per your needs.

What needs improvement?

There should be a way that it can do switches, so if at any point in time I want to do some hybrid mode of making any data collections or ingestions, I can just click on a button. I can change a switch and make sure a batch can be a streaming process.

For how long have I used the solution?

I've been using Azure Data Factory for more than two years.

What do I think about the stability of the solution?

The stability of Azure as a PaaS could be improved.

What do I think about the scalability of the solution?

It's scalable.

How are customer service and support?

I would rate their technical support 3 out of 5. It's not great, but it isn't bad.

How was the initial setup?

The setup is complex. It has nothing to do with the technology but with the design. We were wondering how to leverage the orchestration layer where we are having the Azure Data Factory and how to integrate with the Databricks. That's where we had some challenges in terms of choosing the right product.

What about the implementation team?

You can do deployment in-house. 

What other advice do I have?

I would rate this solution 8 out of 10.

For someone who is looking to use this solution, my advice is to do proper due diligence of your current application, know where your application is fitting, and look for the requirements. It all depends upon the current use case that you have currently in your system.

Which deployment model are you using for this solution?

Hybrid Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Microsoft Azure
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Richard Griffin - PeerSpot reviewer
Manager Data & Analytics at Fletcher Building
Real User
Simple to use, good performance, and competitive pricing
Pros and Cons
  • "The most valuable feature of this solution would be ease of use."
  • "It does not appear to be as rich as other ETL tools. It has very limited capabilities."

What is our primary use case?

I am a manager of a team that uses this solution.

Azure Data Factory is primarily used for data integration, which involves moving data from sources into a data lake house called Delta Lake.

What is most valuable?

It's fairly simple to use. The most valuable feature of this solution would be ease of use.

What needs improvement?

It does not appear to be as rich as other ETL tools. It has very limited capabilities. It simply moves data around. It's not very good after that because it's taking the data to the next level and modeling it.

For how long have I used the solution?

I have been working with Azure Data Factory for less than a year.

I would say that we are working with the latest version.

What do I think about the stability of the solution?

The stability of Azure Data Factory is good. The performance is good.

What do I think about the scalability of the solution?

I haven't had to scale this solution as of yet.

We have six people in our company who use this solution.

Increasing the usage is not on our strategy pathway.

How are customer service and support?

I have not contacted technical support. I have not required any yet.

I have had very little contact with Microsoft support, but it's been good.

Which solution did I use previously and why did I switch?

I have also worked with Talend. I didn't switch products, but rather companies.

Talend is a more robust enterprise-wide solution that can handle everything from start to finish, whereas Azure Data Factory is more of an ingestion tool.

How was the initial setup?

I was not involved with the initial setup.

What about the implementation team?

We are an enterprise that uses an integrator.

It does not require any maintenance, it's simple.

What's my experience with pricing, setup cost, and licensing?

I don't see a cost; it appears to be included in general support. I have been told that you have to be very careful because it can blow out. I have not experienced it yet, but I've been warned that as Azure ingestion increases, the costs can rise.

In my opinion, the price is competitive.

What other advice do I have?

It's a good tool, a good product that does what it's supposed to do well, which is ingesting data from a source to your target, to another cloud, to another source. Just be conscious to monitor your costs.

I would rate Azure Data Factory an eight out of ten.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Microsoft Azure
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
reviewer1404414 - PeerSpot reviewer
IT Functional Analyst at a energy/utilities company with 1,001-5,000 employees
Real User
Is easy to use and is highly scalable
Pros and Cons
  • "The two most valuable features of Azure Data Factory are that it's very scalable and that it's also highly reliable."
  • "One area for improvement is documentation. At present, there isn't enough documentation on how to use Azure Data Factory in certain conditions. It would be good to have documentation on the various use cases."

What is our primary use case?

We are currently using it as an ETL (Extract, Transform, and Load) tool. We are using it to connect to various information providers or, in general, to various sources, to extract data, and then to insert it to our storage devices, databases, or data warehouses.

What is most valuable?

The two most valuable features of Azure Data Factory are that it's very scalable and that it's also highly reliable.

What needs improvement?

One area for improvement is documentation. At present, there isn't enough documentation on how to use Azure Data Factory in certain conditions. It would be good to have documentation on the various use cases.

Sometimes, it's really difficult to find the answers to very technical questions regarding certain conditions.

For how long have I used the solution?

I've been using Azure Data Factory since 2019.

What do I think about the stability of the solution?

It has been stable so far.  

What do I think about the scalability of the solution?

Azure Data Factory is a very scalable solution. Including internal developers and external consultants working for us, we have about 10-15 people using this solution.

Which solution did I use previously and why did I switch?

We had been using various ETL tools during the years before moving to the cloud. We picked Azure Data Factory because we were moving towards the Azure cloud.

How was the initial setup?

The initial setup is very easy.

What about the implementation team?

We used a consultant as it was a big project. We had five to six specialists, including both internal and external employees, working on it. It took about about three to six months to complete.

What other advice do I have?

Azure Data Factory is a very easy to use tool. If you want to extract, manipulate, and load data to any type of Azure repository, I recommend this solution. However, I would not recommend it if the manipulation of data is very deep and complicated.

I would rate this solution at eight on a scale from one to ten.

Which deployment model are you using for this solution?

Private Cloud
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Technical Director, Senior Cloud Solutions Architect (Big Data Engineering & Data Science) at NorthBay Solutions
Consultant
Great for gathering data and pipeline orchestration; much improved monitoring feature
Pros and Cons
  • "An excellent tool for pipeline orchestration."
  • "The solution needs to be more connectable to its own services."

What is our primary use case?

We generally implement this product for data transformation for our clients. We create the pipelines and provide training before handing it over to them. We generally deal with large-scale organizations. I'm a senior solutions architect. 

How has it helped my organization?

I think the main benefit of this solution is the ease of use, especially for companies that have come from an SSIS type of background where they are used to Microsoft tools. 

What is most valuable?

If you have a very simple pipeline you can use Data Factory for transformations, but it's really for serious analytics. This is an excellent tool for pipeline orchestration; connecting the different components and activities as well as gathering data. It's an orchestration tool, not a transformation tool. The monitoring feature has drastically improved.

What needs improvement?

Data Factory is embedded in the new Synapse Analytics. The problem is if you're using the core Data Factory, you can't call a notebook within Synapse. It's possible to call Databricks from Data Factory, but not the Spark notebook and I don't understand the reason for that restriction. To my mind, the solution needs to be more connectable to its own services.

There is a list of features I'd like to see in the next release, most of them related to oversight and security. AWS has a lake builder, which basically enforces the whole oversight concept from the start of your pipeline but unfortunately Microsoft hasn't yet implemented a similar feature.

For how long have I used the solution?

I've been using this solution for five years. 

What do I think about the stability of the solution?

From what I've seen this is a stable solution. 

What do I think about the scalability of the solution?

The solution is easy to scale keeping in mind that Data Factory doesn't do any computations. We use it mainly to push the computations to Databricks or Synapse. Projects with our clients generally last a few months and only until they go into production. I believe the ability to increase is always there.

How are customer service and support?

We typically do not use customer support, but there were a few cases several years ago as the product was moving to the cloud that things were not so stable and we contacted support services - they were very good. 

Which solution did I use previously and why did I switch?

When I first started in this field, everything was basically Hadoop on-premise and Hadoop infrastructure. With the increase in cloud integrations, things have changed. Once the big data services got introduced, we were probably one of the few companies in North America that were actually into analytics and big data and we were the first to implement related Microsoft products in Canada.

How was the initial setup?

The initial setup is straightforward. I'm a huge fan and user of CI/CD pipelines and never do deployments manually. It's all automated and deployment takes a few minutes.

What's my experience with pricing, setup cost, and licensing?

Licensing costs of Data Factory are reasonable. The cost is mainly on the Synapse and Databricks side of things because they are the tools where the computations are done and where you need more nodes and servers.

What other advice do I have?

It's important to study the solution before purchasing it. The problem in this market is that because most users are generally not very knowledgeable, they typically fall for services that are not compatible with their use case. Data Factory comes with all the transformations but that doesn't work for serious analytics customers who generally need to resort to Databricks or Synapse which involves training and education. Since it's a new field and everything has just blasted off, it's very hard for people to catch on.

In my opinion, Airflow still ranks as number one but I would rate Data Factory an eight out of 10. 

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Chief Technology Officer at cornerstone defense
Real User
Easy to bring in outside capabilities, flexible, and works well
Pros and Cons
  • "It is very modular. It works well. We've used Data Factory and then made calls to libraries outside of Data Factory to do things that it wasn't optimized to do, and it worked really well. It is obviously proprietary in regards to Microsoft created it, but it is pretty easy and direct to bring in outside capabilities into Data Factory."
  • "There is always room to improve. There should be good examples of use that, of course, customers aren't always willing to share. It is Catch-22. It would help the user base if everybody had really good examples of deployments that worked, but when you ask people to put out their good deployments, which also includes me, you usually got, "No, I'm not going to do that." They don't have enough good examples. Microsoft probably just needs to pay one of their partners to build 20 or 30 examples of functional Data Factories and then share them as a user base."

What is our primary use case?

Our customers use it for data analytics on a large volume of data. So, they're basically bringing data in from multiple sources, and they are doing ETL extraction, transformation, and loading. Then they do initial analytics, populate a data lake, and after that, they take the data from the data lake into more on-premise complex analytics.

Its version depends on a customer's environment. Sometimes, we use the latest version, and sometimes, we use the previous versions.

What is most valuable?

It is very modular. It works well. We've used Data Factory and then made calls to libraries outside of Data Factory to do things that it wasn't optimized to do, and it worked really well. It is obviously proprietary in regards to Microsoft created it, but it is pretty easy and direct to bring in outside capabilities into Data Factory.

It is very flexible. You can build any features you want.

What needs improvement?

There is always room to improve. There should be good examples of use that, of course, customers aren't always willing to share. It is Catch-22. It would help the user base if everybody had really good examples of deployments that worked, but when you ask people to put out their good deployments, which also includes me, you usually got, "No, I'm not going to do that." They don't have enough good examples. Microsoft probably just needs to pay one of their partners to build 20 or 30 examples of functional Data Factories and then share them as a user base.

For how long have I used the solution?

I have been using this solution for the last five years, but probably, the last three years have been significant.

What do I think about the stability of the solution?

It has been stable. I have not experienced any issues.

What do I think about the scalability of the solution?

It is decent for most things. I'm not sure if it is necessarily intended for large volume and high-speed streams of data. By large, I mean really big, but for pretty much anything that most users would want to do, including ourselves, it is fine. Our clients are large government organizations.

It scales fine within its environment. You can literally throw another Data Factory in or replicate one and do things pretty quickly. So, it is not at all hard to increase your processing footprint, but you have to pay for it. It doesn't end up being quite expensive. Although I haven't really done it, I would suspect that if I did the equivalent in AWS, Azure would be more expensive than AWS because of the way they price data.

How are customer service and technical support?

They're all right. I would rate them a seven out of 10. They do fine, but there is a lot that they don't do.

I'm not sure if even Microsoft has enough SMEs from a user point of view. They are helpful for getting it set up, making it work, and helping you figure out why it doesn't work. If you want to ask them about something that you are trying to do, they'll try to direct you to a partner, which is fine, but the partners also don't necessarily have an experience. It is Catch-22. There aren't a lot of people out there with Azure experience because Azure started to be in demand only over the last two years.

Which solution did I use previously and why did I switch?

The customer used a lot of homebrew stuff. They were doing a lot of internal stuff and some Oracle stuff. They were doing things, and they made a workaround and said, "Okay, we'll bring it into Oracle Database, and then we'll do all these things to it." We're like, "Okay, that works, but then you're taking it out of that database and putting it over into the data lake. I don't understand why are you doing that?" That's what they were doing.

How was the initial setup?

It is pretty straightforward. Devil is in the details, but you can easily get up and running in a day with Data Factory. Anybody who is comfortable in Azure can set up Data Factory, but it takes experience to know what it can and can't do or should and shouldn't do.

What other advice do I have?

It is proven, and it works. Make sure you have a well-defined use case and build a quick prototype to ensure that it, in fact, does what you need. Give yourself some benchmarks. That's exactly what we did. We defined the use case, and then we set up Data Factory. We found a couple of things that it didn't do. We figured out a way to work around those things and have it do those things. After that, we confirmed it. It is operational, and it is doing its job. It has been pretty much error-free since then.

It would become easier to use as more people become Azure-capable. If I want to find an AWS SME, I can get tons. They're expensive, but I have them. If I want to find an Azure SME, I usually have to create them. Azure was later to market than AWS. So, there are fewer people who are experts in Azure, and they are in high demand.

I would rate Azure Data Factory a nine out of 10. They just don't have enough good examples out there of things.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Microsoft Azure
Disclosure: My company has a business relationship with this vendor other than being a customer. Partner
PeerSpot user
.NET Architect at a computer software company with 10,001+ employees
Real User
A cloud-based data integration service that's easy to understand and use
Pros and Cons
  • "I think it makes it very easy to understand what data flow is and so on. You can leverage the user interface to do the different data flows, and it's great. I like it a lot."
  • "It would be better if it had machine learning capabilities."

What is our primary use case?

I use Azure Data Factory in my company because we are implementing a lot of different projects for a big company based in the USA. We're getting certain information from different sources—for example, some files in the Azure Blob Storage. We're migrating that information to other databases. We are validating and transforming the data. After that, we put that data in some databases in Azure Synapse and SQL databases.

What is most valuable?

I think it makes it very easy to understand what data flow is and so on. You can leverage the user interface to do the different data flows, and it's great. I like it a lot.

What needs improvement?

It would be better if it had machine learning capabilities. For example, at the moment, we're working with Databricks and Azure Data Factory. But Databricks is very complex to do the different data flows. It could be great to have more functionalities to do that in Azure Data Factory.

For how long have I used the solution?

I have been using Azure Data Factory for about one year.

What do I think about the stability of the solution?

Azure Data Factory is a stable solution.

What do I think about the scalability of the solution?

It's scalable. We're doing a lot of different integrations with a lot of data, and scalability is great.

How was the initial setup?

The initial setup is straightforward. I think that it's so easy to start a project using that technology.

What about the implementation team?

We have a team that's in charge of doing the deployments in Azure in different environments.

What other advice do I have?

I would tell potential users that there are many technologies to do this. For example, if you like to manage big data and do something with it, it would be better to use Databricks.

On a scale from one to ten, I would give Azure Data Factory a nine.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Microsoft Azure
Disclosure: My company has a business relationship with this vendor other than being a customer. Partner
PeerSpot user
Buyer's Guide
Download our free Azure Data Factory Report and get advice and tips from experienced pros sharing their opinions.
Updated: September 2025
Buyer's Guide
Download our free Azure Data Factory Report and get advice and tips from experienced pros sharing their opinions.