The most valuable feature of Azure Data Lake Storage is the ability to partition data into various datasets using a directory hierarchy. This folder structure is key for any delivery. Currently, we're not doing much with the data in the tool, but when Databricks comes along, we'll convert it to Parquet format. It's a two-step process: raw data is moved to Parquet, which Databricks can manipulate easily.
Enterprise Architect at a non-profit with 501-1,000 employees
Able to partition data into various datasets using a directory hierarchy
Pros and Cons
- "The most valuable feature of Azure Data Lake Storage is the ability to partition data into various datasets using a directory hierarchy. This folder structure is key for any delivery. Currently, we're not doing much with the data in the tool, but when Databricks comes along, we'll convert it to Parquet format. It's a two-step process: raw data is moved to Parquet, which Databricks can manipulate easily."
- "One improvement I'd suggest is the out-of-the-box conversion of input data, like spreadsheet or table data, to various formats. We'll be using Parquet, which enables transactional integrity."
What is most valuable?
What needs improvement?
One improvement I'd suggest is the out-of-the-box conversion of input data, like spreadsheet or table data, to various formats. We'll be using Parquet, which enables transactional integrity.
For how long have I used the solution?
I have been using the product for a year.
What do I think about the stability of the solution?
Stability is good if you build your Azure Data Lake Storage well in the first place.
Buyer's Guide
Azure Data Lake Storage
October 2025

Learn what your peers think about Azure Data Lake Storage. Get advice and tips from experienced pros sharing their opinions. Updated: October 2025.
868,787 professionals have used our research since 2012.
What do I think about the scalability of the solution?
Scalability depends on process complexity—it is high for simple processes and low for complex ones. This is due to the architecture of a data lake, but once converted to a data lakehouse, scalability is high across the board. I think Azure Data Lake Storage would suit medium—to large enterprises.
How are customer service and support?
Microsoft's documentation is superb, and support is good, especially if you have a relevant intermediate supplier.
Which solution did I use previously and why did I switch?
We haven't compared Azure Data Lake Storage with products from other vendors because we're an Azure shop. We did check that the Azure product was good enough for our needs, and it was, so we didn't explore alternatives like AWS, Google, or Snowflake.
How was the initial setup?
The initial setup is fairly complex, but if you get your data architecture right from the start, it's not a problem. We're using a totally cloud-based deployment with Azure.
What other advice do I have?
Integration capabilities are fairly smooth and comparable to AWS in terms of cloud integration. Some might say it's slightly better, others slightly worse, but I think it's good. I'd rate Azure Data Lake Storage an eight out of ten. However, it's important to note that it's only eventually consistent, so don't expect immediate consistency when changes are made. It works well as a data storage bucket for future use, but it's unsuitable for transactional work. You need to use a data lakehouse like Databricks for transactional processes, which can handle transactional work once the data is in the correct format (like Parquet). The tool is great for storing data you want to put into a data lakehouse, but not for frequent transactions. It's suitable for daily archiving, but anything more frequent than that might cause issues.
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Microsoft Azure
Disclosure: My company does not have a business relationship with this vendor other than being a customer.

Data Architecture and Engineering Specialist at coprocenva
Manages large data volumes and has user-friendly automation
Pros and Cons
- "Azure Data Lake Storage is user-friendly and easy to use."
- "Azure Data Lake Storage is user-friendly and easy to use."
- "Maybe the solution could be a bit more user-friendly."
- "The scalability is limited. However, it's easy to set up."
What is our primary use case?
We use Azure Data Lake Storage for managing large data volumes in our big data projects.
How has it helped my organization?
I have configured the tool to automate the deletion of data and transfer data from one repository to another automatically.
What is most valuable?
Azure Data Lake Storage is user-friendly and easy to use. It effectively manages large data volumes and allows for automated configuration of data operations such as deletion and transfer between repositories.
What needs improvement?
Maybe the solution could be a bit more user-friendly.
What do I think about the stability of the solution?
It is very stable and reliable. It is a good solution that doesn't crash.
What do I think about the scalability of the solution?
The scalability is limited. However, it's easy to set up.
How are customer service and support?
The support from Microsoft for Azure products is good. It's timely.
How would you rate customer service and support?
Positive
How was the initial setup?
The initial setup is very easy with this tool.
What's my experience with pricing, setup cost, and licensing?
I am not familiar with the pricing.
What other advice do I have?
Overall, I would rate the Azure Data Lake Storage as nine out of ten.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Buyer's Guide
Azure Data Lake Storage
October 2025

Learn what your peers think about Azure Data Lake Storage. Get advice and tips from experienced pros sharing their opinions. Updated: October 2025.
868,787 professionals have used our research since 2012.
Data Engineer at Universidad Peruana de Ciencias Aplicadas
Allows file access and reduces cost by optimizing performance
Pros and Cons
- "The solution is worth the money because it allows you to gain insights into your business and implement forecasting solutions to predict future trends."
- "When you store your files manually, you can't ensure complete data integrity, which can impact data security."
What is our primary use case?
We have parameters to create three types of data storage. The first is staging, the second is the intermediate, and the third is the target. The target storage contains the cleanest data used for reporting tools. For specific cases in Data Lake projects, we often use files stored in formats such as CSV, which are the most useful for this type of data processing.
How has it helped my organization?
Most of our clients use Excel. We prioritize making data accessible in formats compatible with Excel. We aim to meet these client requirements, for example, with Excel files. However, for big data solutions, it's often more efficient to use formats, which we store in the Data Lake.
What is most valuable?
In some projects, you can usually access files, enabling accessibility, mixing, and transformation. This is useful for both our team and for data engineers. For clients, it reduces costs by optimizing performance and calls and allows for implementing a security model. Additionally, using tools like Global Storage, you can create a hybrid cloud directory or restructure data, making it more organized and easier for clients to integrate with ETL tools.
What needs improvement?
When you store your files manually, you can't ensure complete data integrity, which can impact data security.
When you make these types of releases or improvements in this solution, you can enhance the data's stability. You can also include features like security integration with Active Directory for data access and ensure compatibility for various integrations. This approach complements both structured and unstructured data, making it more suitable for big data solutions.
For how long have I used the solution?
I have been using Azure Data Lake Storage for two years
What do I think about the stability of the solution?
What do I think about the scalability of the solution?
There are some limitations regarding data scalability. In such cases, you can complement it with Databricks. Databricks has a powerful engine that can enhance your security solutions. This combination provides great data performance and a better overall solution.
How are customer service and support?
Once you pass the initial two weeks, it becomes simpler to manage technical support. However, before this period, it is important to have clear instructions, documentation, or videos prepared to assist with technical support and ensure a smooth deployment process.
It's great, but it can be challenging if you don't have all the necessary documents or lack one-on-one discussions about the solutions.
How would you rate customer service and support?
Positive
How was the initial setup?
The initial setup depends on the project, but when storing this type of project, you need to have access activities to provide the source of the data. It's important to have a core level of security to access and transform the data. After storing the data, you need a solution for quality assurance to ensure data integration and quality. This is crucial because poor data quality can affect the future of your solution. Implementing data quality measures is essential for the success of your solutions.
Deployment depends, but it takes about two weeks. For example, one week is typically for deployment, and the other week is for checking the data flow to ensure no errors. If there is an error, you will receive an alert to check the status of the operation.
We need an Azure DevOps professional who makes many configurations to pass for a developer in a production environment.
What was our ROI?
The solution is worth the money because it allows you to gain insights into your business and implement forecasting solutions to predict future trends. Investing in such solutions is valuable for understanding and planning for future developments.
What other advice do I have?
For ETL solutions, it’s essential to stay updated with new and recurring events that might arise. Reviewing forums, articles, and media can help you identify trends and troubleshooting tips relevant to your project. This proactive approach can help you anticipate and address potential issues effectively.
Overall, I rate the solution a nine out of ten.
Which deployment model are you using for this solution?
Public Cloud
Disclosure: My company has a business relationship with this vendor other than being a customer. Partner
BI & Data Engineering Manager at a sports company with 10,001+ employees
Allows for seamless integration and secure storage with decent pricing
What is our primary use case?
When dealing with data from various sources or servers, Azure Data Lake Storage allows for seamless integration and secure storage through features like Azure BitLocker encryption. This ensures data integrity and protection, regardless of its origin or destination.
How has it helped my organization?
We focus on the regional messaging for our data housed within it ensures better data management and quality control.
What is most valuable?
The price for Azure is decent. However, the reliability of the provider and their efficient collaboration are positive aspects. Using Azure Data services, we've successfully deployed support systems that facilitate seamless data confirmation across various resources within our company.
Azure offers a comprehensive suite of tools beneficial for both development and delivery. Performance, usability, and scalability are strong points. Recently, I acquired a new tool called AWS Storage Gateway for seamless data transfer between AWS storage and Azure Data Lake, enhancing database operations and developer workflows.
What needs improvement?
One feature could be added is the ability to create and manage files within the same storage using serverless query integration with Data Lake Analytics.
When comparing your account with AWS, the financial aspect will be the deciding factor for the customer. If it's a favorable response, we will continue working with Azure. We were awaiting the HTTPS integration.
There's a strong community for learning how to use Azure Data Lake. We've encountered an issue while testing AzureProtect.
Overall, I rate the solution a nine out of ten.
For how long have I used the solution?
I have been using Azure Data Lake Storage for a year.
What do I think about the stability of the solution?
The solution is stable.
What do I think about the scalability of the solution?
The product’s scalability is good. 45 users are using this solution.
How was the initial setup?
The initial setup is easy. Our team consisting of two individuals initially, with the option for a third support member, making it easier to utilize Infiniti. It takes four months to deploy.
We have ongoing plans for building more deposits. Additionally, they can send project details via calls, as all projects are managed within the same system.
Four members were involved during deployment.
What's my experience with pricing, setup cost, and licensing?
The pricing is reasonable.
What other advice do I have?
We're planning to utilize machine learning techniques in Azure Data Lake.
Maintenance is easy.
Overall, I rate the solution a nine out of ten.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Senior Software Engineer at a tech consulting company with 10,001+ employees
Easily integrates with a company's current workflow
Pros and Cons
- "The response time and quality offered by the support team are good."
- "The high price of the product is an area of concern where improvements are required."
What is our primary use case?
I use the solution in my company as per our project requirements. In my company, we are only putting data from on-premises RDBMS into Azure Data Lake Storage Gen2, and then the file is stored in parquet format. After the aforementioned process is followed, my company has another data engineering team, which reads those data further.
What is most valuable?
For writing data to data lake, my company uses Oracle GoldenGate for Big Data. With Oracle GoldenGate for Big Data, my company had to use Handler (Java Platform SE 8), but now we use HDFS Handler, and then using it, we have to configure some files and open some ports between a bank's private network to Azure Data Lake Storage Gen2. After opening all the aforementioned areas, my company is able to push the data to Azure Data Lake Storage Gen2.
What needs improvement?
In my company, we are not facing any slowness or other kinds of issues with the product. Each day in my company, we create new directories and put the current files into them, so there is the segregation part that is taken care of, and because of this, there are no issues with the tool.
In our company, one of the teams use Azure Databricks to read data from Azure Data Lake Storage's account and as per the business use case, they move data or take the data further. The project I am currently doing has only limited work. I haven't explored all the points associated with the tool.
The high price of the product is an area of concern where improvements are required.
For how long have I used the solution?
I have been using Azure Data Lake Storage for a year.
What do I think about the stability of the solution?
The product's stability is good.
What do I think about the scalability of the solution?
The scalability part of the product is very good, and my company has not faced any issues with it.
At present, 15 to 16 percent of the company uses the tool, but it will increase by a percent in the future.
How are customer service and support?
The response time and quality offered by the support team are good. I rate the technical support as nine out of ten.
How would you rate customer service and support?
Positive
How was the initial setup?
The product's initial setup phase is not too simple, and it can be described as a moderate process.
The solution is deployed on the cloud.
What's my experience with pricing, setup cost, and licensing?
I rate the product price as two or three, where one is high, and ten is low. The product's price is really high.
What other advice do I have?
Integrating Azure Data Lake Storage into my company's current workflow was easy.
I recommend the product to those who plan to use it.
I rate the tool an eight to nine out of ten.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Data Manager at National Committee for Quality Assurance
User-friendly and helps aggregate all information into one particular environment
Pros and Cons
- "Azure Data Lake Storage is a user-friendly and easy-to-learn solution."
- "Some terminology is not easy to understand when using some interfaces."
What is our primary use case?
We use the tool to aggregate all our information into one particular environment. We have separate environments, but we aggregate the important pieces into one environment so they can be used for reporting and analytics.
What is most valuable?
Azure Data Lake Storage is a user-friendly and easy-to-learn solution.
What needs improvement?
The solution's instructions could be improved. Some terminology is not easy to understand when using some interfaces. Microsoft must provide more examples because everybody uses the tool differently.
For how long have I used the solution?
I have been using Azure Data Lake Storage for 5 years.
What do I think about the stability of the solution?
I rate the solution’s stability ten out of ten.
What do I think about the scalability of the solution?
Around 300 users use the solution in our organization, including developers and people who use the data through report interfaces.
I rate the solution ten out of ten for scalability.
How was the initial setup?
The solution’s initial setup was straightforward.
What about the implementation team?
We implemented the solution through an in-house team. Certain features are turned on to deploy the tool. You go in there and set up what you need, like the database or data factory. Once they turn it on and give you permission, you go in there and set it up yourself.
What was our ROI?
The tool is worth the money because I've worked in other environments, and this one seems to be the most flexible and easy to use.
What other advice do I have?
I use Azure Data Lake Storage version 2. We integrate the solution with other internal and external sources through REST and SQL. We have external sites that we integrate or pull the information into. We can see all the data in one place and report off of it.
The tool can be expanded at a moment's notice. Once I'm done, I can lower the threshold back down. The solution's flexibility allows me to increase my CPU or database size at a moment's notice.
Overall, I rate the solution 10 out of 10.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
VP - Global Delivery Head at Enhops
Has security access policies and helps to store customer data
Pros and Cons
- "The tool's most valuable feature is security access policies."
- "The solution needs to improve APIs and make them more accessible."
What is our primary use case?
We use the Azure Data Lake to store raw customer files exported from their databases. Our pipelines then pick up this data and process it in various ways. For instance, we use Databricks to handle the data processing, transformation, and ETL tasks. The processed data is then stored in SQL Server or converted into other file formats.
What is most valuable?
The tool's most valuable feature is security access policies.
What needs improvement?
The solution needs to improve APIs and make them more accessible.
What do I think about the scalability of the solution?
The tool is scalable. We currently manage around 200 GB of data, and recently, we've optimized and synchronized additional data, which is approximately 500 GB. My company has 1000-2000 users who use it weekly.
How are customer service and support?
We've had to seek help several times, particularly with Power BI and Databricks integrations. However, the initial level of support hasn't always been as knowledgeable as we'd like. They usually gather information and then schedule follow-up appointments, which can take a few days. Improving the expertise and responsiveness of the first-level support team could speed up the resolution process.
How was the initial setup?
We use templates for deployment and automate the process.
What's my experience with pricing, setup cost, and licensing?
The tool's licensing model is pay-as-you-go. Regarding pricing, there's always competition between Azure Data Lake Storage and AWS. They're quite similar, but to attract more customers, Azure Data Lake Storage could consider adjusting its pricing to be more competitive with AWS. This might make Azure Data Lake Storage a more appealing option for users.
What other advice do I have?
The tool is the best platform for storing all kinds of data; I've never experienced any downtime with it. Plus, it offers secure access and security features, which I appreciate. I rate the product a nine out of ten.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Data Architect /Data Engineer at Regional Council
Efficient data integration enabling modern data platforms
Pros and Cons
- "Azure Data Lake itself doesn't have built-in features for handling data on its own. It's used as a collection of data. Tools you can use with it include Azure Data Factory and Databricks."
- "Azure Data Lake Storage should support other formats apart from the Data Lake format, such as the Iceberg format."
What is our primary use case?
The main use cases are for people who don't want their data to be siloed. They want to integrate it into one place, which is a data lake, where they can have data coming from different sources and formats. They want to be able to report on the data from one place, have different personnel work in the same place, and utilize the modern data platform.
What is most valuable?
Azure Data Lake itself doesn't have built-in features for handling data on its own. It's used as a collection of data. Tools you can use with it include Azure Data Factory and Databricks. Most recently, Microsoft Fabric has been the main tool I have recommended.
What needs improvement?
Azure Data Lake Storage should support other formats apart from the Data Lake format, such as the Iceberg format. Additionally, improvements in supporting various formats would be beneficial.
For how long have I used the solution?
I've been working with Azure Data Lake Storage for about three years now.
What do I think about the stability of the solution?
I would rate stability between nine and ten. It is very stable, however, it depends on settings like geographical redundancy. It's also dependent on expertise and the company's willingness to pay for specific features.
What do I think about the scalability of the solution?
It is very scalable, especially the Gen 2 version. Azure Data Lake Gen 2 is very flexible and uses hierarchical file structures. The newest version, Gen 3, also supports a lake house approach and integrates with other cloud storage like Amazon S3, Google Storage, and Snowflake.
How are customer service and support?
In terms of community support, they respond between one to two weeks. Organization-paid technical support almost gets an immediate response. Overall, their service is rated eight.
How would you rate customer service and support?
Positive
How was the initial setup?
Microsoft Fabric is easier to use than Databricks. Fabric is a software as a service (SaaS), while Databricks is a platform as a service (PaaS, making Fabric easier for starters. Setting up Azure Data Lake Storage can be quick if done manually, but using code ensures scalability.
What's my experience with pricing, setup cost, and licensing?
It's very cheap to store large terabytes of data. It costs just a few dollars per terabyte per month. Computational costs could vary based on usage and are generally more expensive than storage.
What other advice do I have?
For startups, I recommend using Microsoft Fabric as it integrates well with other tools and is cost-efficient. The pricing model is easy to understand. I'd rate the solution nine out of ten.
Which deployment model are you using for this solution?
Hybrid Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Microsoft Azure
Disclosure: My company does not have a business relationship with this vendor other than being a customer.

Buyer's Guide
Download our free Azure Data Lake Storage Report and get advice and tips from experienced pros
sharing their opinions.
Updated: October 2025
Product Categories
Cloud StoragePopular Comparisons
Dropbox Business - Enterprise
Buyer's Guide
Download our free Azure Data Lake Storage Report and get advice and tips from experienced pros
sharing their opinions.
Quick Links
Learn More: Questions:
- What is the best solution for an enterprise-level storage environment?
- Which is better—Box or Dropbox?
- What is the minimum security features set required for Cloud Backup and Storage Software?
- When evaluating Cloud Storage, what aspect do you think is the most important to look for?
- How can we build a healthy digital transformation pipeline in 2022?
- How would you recommend selecting a compute and storage solution based on the company size?
- What are the benefits and drawbacks of using cloud storage?
- What are your top 3 Cloud Storage predictions for 2022?
- With the increasing risk of cyber attacks in the west, due to the war in Ukraine, how safe is your data in the cloud?
- Why is Cloud Storage important for companies?