We use the solution for data analytics of industrial data.
A scalable and cost-effective solution that has excellent translation features and can be used for data analytics
Pros and Cons
- "It is a cost-effective solution."
- "The product should provide more advanced features in future releases."
What is our primary use case?
What is most valuable?
We extensively use the product’s notebooks, jobs, and triggers. We can create activities. Wherever translation is required, we use Databricks. The product fulfills our customer requirements. It is a cost-effective solution.
What needs improvement?
The product should provide more advanced features in future releases.
For how long have I used the solution?
I have been using the solution for six months.
Buyer's Guide
Databricks
June 2026
Learn what your peers think about Databricks. Get advice and tips from experienced pros sharing their opinions. Updated: June 2026.
900,644 professionals have used our research since 2012.
What do I think about the stability of the solution?
Our data was not too huge. It worked well. It is easily adaptable.
What do I think about the scalability of the solution?
The tool is scalable. We can make it available for a larger audience.
How was the initial setup?
The initial setup is not that difficult. I rate the ease of setup a seven out of ten. The solution is cloud-based. We use native services like Data Factory for orchestration. Sometimes, the customers require us to use Amazon as the cloud provider instead of Azure.
What's my experience with pricing, setup cost, and licensing?
The pricing is average.
What other advice do I have?
There are many services which are coming up. They are still in the preview stage. Overall, I rate the product an eight out of ten.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Disclosure: My company has a business relationship with this vendor other than being a customer. Partner
Computer Scientist at Adobe
Pumps up performance and the processing power; comes with helpful Lakehouse and SQL environments
Pros and Cons
- "When we have a huge volume of data that we want to process with speed, velocity, and volume, we go through Databricks."
- "I believe that this product could be improved by becoming more user-friendly."
What is our primary use case?
Our primary use case is for data analytics. Essentially, we use it for the financial reporting for Adobe.
How has it helped my organization?
The way Databricks has improved my organization is definitely through giving us improved performance and the processing power. We are usually never able to achieve it using traditional data warehouses. When we have a huge volume of data that we want to process with speed, velocity, and volume, we go through Databricks.
What is most valuable?
The features I found most helpful with Databricks are the Lakehouse and SQL environments.
What needs improvement?
I believe that this product could be improved by becoming more user-friendly.
In the next release, I would like to see more flexibility in the dashboard. It has plenty of features but it can be enhanced so that it matches with other visualization tools, like Power BI and Tableau. Also, the integrations with other tools could be better.
For how long have I used the solution?
I have been using Databricks for the last three years.
What do I think about the stability of the solution?
I would rate the stability of Databricks an eight, on a scale from one to 10, with one being the worst and 10 being the best.
What do I think about the scalability of the solution?
I would rate the scalability of this solution a nine, on a scale from one to 10, with one being the worst and 10 being the best. I would say there are around 2,000 to 3,000 users of this solution in our organization.
How are customer service and support?
I've been in contact with the Databricks support team and received timely support from them. I would rate their support an eight, on a scale from one to 10, with one being the worst and 10 being the best.
How would you rate customer service and support?
Positive
Which solution did I use previously and why did I switch?
Prior to Databricks, we initially used Hadoop. Afterwards, we used HANA, SAP HANA, and the Microsoft SQL Server.
How was the initial setup?
The initial setup was relatively straightforward. I would rate it nine, on a scale from one to 10, with one being the easiest and 10 being the hardest.
There is no need to worry about the deployment as it can be done quickly. It is relatively automated. We used Terraform for auto-deployment, which happens in Azure. With Terraform, there are two options. As option one, you can deploy manually by creating services. For option two, you use Terraform and automate. Terraform is like infrastructure as a code where you can code the deployment part using it.
There were two or three persons involved in the deployment of this solution.
What other advice do I have?
The new version of the Databricks solution requires code maintenance. This is done by the platform team.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Buyer's Guide
Databricks
June 2026
Learn what your peers think about Databricks. Get advice and tips from experienced pros sharing their opinions. Updated: June 2026.
900,644 professionals have used our research since 2012.
Principal at a computer software company with 5,001-10,000 employees
Has advanced modeling and machine-learning features; highly scalable, with no stability issues
Pros and Cons
- "What I like about Databricks is that it's one of the most popular platforms that give access to folks who are trying not just to do exploratory work on the data but also go ahead and build advanced modeling and machine learning on top of that."
- "I have had some issues with some of the Spark clusters running on Databricks, where the Spark runtime and clusters go up and down, which is an area for improvement."
What is our primary use case?
I've worked with Databricks primarily in the pharmaceuticals and life sciences space, which means a lot of work on patient-level data and the predictive analytics around that.
Another use case for Databricks is in the manufacturing industry. I'm a consultant, so the use cases for the product vary, but my primary use case for it is in the pharma space.
What is most valuable?
From a data science and applied analytics perspective, what I like about Databricks is that it's probably one of the most popular platforms that give access to folks who are trying not just to do exploratory work on the data but also go ahead and build advanced modeling and machine learning on top of that, and then go ahead and make that available for dissemination of insights. For example, you can save all data and build out endpoints, so business analysts and users can access that data through a dashboard.
During the process, I also like that Databricks allows you to do portion control to keep track of your operations on the data and maintain that lineage to create reproducible results.
The most significant Databricks advantage is that you can do everything within the platform. You don't need to exit the platform because it's a one-stop shop that can help you do all processes.
The solution is top-notch from a data science, applied ML, or advanced analytics perspective.
What needs improvement?
I have had some issues with some of the Spark clusters running on Databricks, where the Spark runtime and clusters go up and down, which is an area for improvement. Still, I am generally unaware of any super-critical issues.
For how long have I used the solution?
My experience with Databricks is two and a half years.
What do I think about the stability of the solution?
Databricks stability is an eight out of ten because I never had issues with its stability.
What do I think about the scalability of the solution?
Databricks has high scalability. Most of my work on the solution has been in the pharma space, which has massive data sets, so it's a nine out of ten, scalability-wise.
How are customer service and support?
I've never dealt with the Databricks technical support team.
How was the initial setup?
I don't have experience setting up Databricks because that's generally taken care of by the IT, data, or software engineering team before the data science team comes in and starts leveraging the platform. I have yet to experience setting up the Databricks environment personally. However, I have had experience setting up clusters, which was pretty straightforward. Still, in the overall environment of an enterprise-wide system, I have yet to gain experience setting Databricks up.
What's my experience with pricing, setup cost, and licensing?
The cost for Databricks depends on the use case. I work on it as a consultant, so I'm using the client's Databricks, so it depends on how big the client is. If it's a global organization, that cost varies versus a smaller organization that has just adopted the platform and is trying to onboard a small team of five people. It depends.
What other advice do I have?
I'm a data scientist, so I frequently use Databricks and Domino Data Science Platform.
I'm a consultant, so every client has a different version or a different runtime in Databricks, so the versions used would vary per client.
The deployment for the solution is on the cloud, predominantly on AWS or Azure.
My clients adopted Databricks as the platform of choice, and with different use cases and more teams coming on board, the usage of Databricks will increase. I don't see that going down. It can only go up.
My advice to anyone looking into implementing Databricks is that it should be one of your top choices, especially if you're looking to focus on data processing, standard ETL operations, advanced analytics, or the ML type of work.
I'd rate the solution as nine out of ten. It checks almost all the boxes that modern applications need to have.
My organization is an active partner and implementer of Databricks, but it doesn't resell the solution.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Amazon Web Services (AWS)
Disclosure: My company has a business relationship with this vendor other than being a customer. Partner
Vice President - Data Engineering and Analytics at a financial services firm with 10,001+ employees
A good, but expensive, web-based platform for automated cluster management with some coding limitations
Pros and Cons
- "We like that this solution can handle a wide variety and velocity of data engineering, either in batch mode or real-time."
- "We looked at both Snowflake and BigQuery as a comparison with this solution, and we chose this product as it offered more scalability and a higher level of security, which is extremely important in our banking environment."
- "This solution only supports queries in SQL and Python, which is a bit limiting."
- "This is a fairly expensive solution for any service outside of the basic package, and costs can add up quite quickly if there are large scaling requirements."
What is our primary use case?
We use this solution for advanced civilization power.
What is most valuable?
We like that this solution can handle a wide variety and velocity of data engineering, either in batch mode or real-time.
This product allows us to write the email models in a way that allows us to take the advantage of the parallel scaling computer window backend on any of the satellite services.
What needs improvement?
This solution only supports queries in SQL and Python, which is a bit limiting.
This is a fairly expensive solution for any service outside of the basic package, and costs can add up quite quickly if there are large scaling requirements.
What do I think about the stability of the solution?
This is a stable solution in our experience.
What do I think about the scalability of the solution?
We have found that part of the beauty of this platform is that it is easy to scale and expand.
How are customer service and support?
The support for this product uses Microsoft as a middle man, and due to this there have been times when we experienced communication delays, as well as misunderstandings of what our issues are.
How would you rate customer service and support?
Neutral
How was the initial setup?
The initial setup for this solution is very simple.
What's my experience with pricing, setup cost, and licensing?
The basic version of this solution is now open-source, so there are no license costs involved. However, there is a charge for any advanced functionality and this can be quite expensive.
Which other solutions did I evaluate?
We looked at both Snowflake and BigQuery as a comparison with this solution. We choose this product as it offered more scalability and a higher level of security, which is extremely important in our banking environment.
What other advice do I have?
We would rate this solution an eight out of ten.
Which deployment model are you using for this solution?
On-premises
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
A stable, scalable solution that simplifies the development process but needs more debuggers and components
Pros and Cons
- "The simplicity of development is the most valuable feature."
- "Databricks has a lack of debuggers, and it would be good to see more components."
What is our primary use case?
We use the solution for data engineering.
How has it helped my organization?
The tool helps us manage large amounts of data.
What is most valuable?
The simplicity of development is the most valuable feature.
What needs improvement?
Databricks has a lack of debuggers, and it would be good to see more components.
Another issue is that the D4 data format keeps changing on our cluster. This doesn't affect me much because I use functions to define it, but it is very frustrating for some more casual users. One day the output will be in a particular format, and then it becomes an object without us changing the cluster configuration. As a small team, we don't have the capacity to dig deeply into the issue, which has been frustrating.
For how long have I used the solution?
We have been using the solution for three years.
What do I think about the stability of the solution?
The solution's stability is good.
What do I think about the scalability of the solution?
The product is scalable. We're a small organization with 12 users, and we don't currently have any plans to increase our usage.
What was our ROI?
We see an ROI from Databricks.
What other advice do I have?
I would rate the solution seven out of ten.
It's a good solution and more for handling large amounts of data. Databricks is better as a batch processing system than as an interactive system. The performance is a little disappointing because the memory processing is supposed to be excellent, but it's not as competitive as some other solutions out there in this regard. Even classical databases can respond and process faster.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Sr Data Engineer at PIMCO
Supports several coding languages, good performance, and facilitates team collaboration
Pros and Cons
- "The load distribution capabilities are good, and you can perform data processing tasks very quickly."
- "The most valuable feature is the versatility of the ecosystem."
- "In the future, I would like to see Data Lake support. That is something that I'm looking forward to."
- "The cost of this solution is high, on the expensive side."
What is our primary use case?
Our primary use case is ETL.
How has it helped my organization?
Using Databricks enables us to use the Data Mesh methodology, where every team performs their own ETL.
What is most valuable?
The most valuable feature is the versatility of the ecosystem. You can write code in SQL, Python, or Java.
The load distribution capabilities are good, and you can perform data processing tasks very quickly.
You can save and share notebooks between different teams.
The interface is easy to use.
What needs improvement?
The cost of this solution is high, on the expensive side.
In the future, I would like to see Data Lake support. That is something that I'm looking forward to.
For how long have I used the solution?
I worked with Databricks for approximately two years in my previous company.
What do I think about the scalability of the solution?
This is a very scalable solution. We have twenty-five data engineers that use it, and we may grow our usage.
How are customer service and support?
The technical support is okay. I would rate them a seven out of ten.
How would you rate customer service and support?
Neutral
Which solution did I use previously and why did I switch?
We did not use another similar solution prior to Databricks.
How was the initial setup?
The cloud-based deployment is simple.
If you use an on-premises deployment then there is more to do.
What about the implementation team?
We deployed it with our in-house team.
There is no maintenance required.
What was our ROI?
We have seen a return on our investment with Databricks.
What's my experience with pricing, setup cost, and licensing?
Price-wise, I would rate Databricks a three out of five.
Which other solutions did I evaluate?
When we looked into Databricks, we evaluated Azure Data Factory and some of the others on the market. We found that Databricks was one of the easiest ones to use.
What other advice do I have?
My advice for anybody that is looking into Databricks is not to use the on-premises deployment. Instead, use the cloud-based setup.
In summary, this is a good product.
I would rate this solution an eight out of ten.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Microsoft Azure
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Team Lead at a tech services company with 1,001-5,000 employees
Gives us the ability to write analytics code in multiple languages
Pros and Cons
- "Databricks provides a consistent interface for data engineers to work with data in a consistent language on a single integrated platform for ingesting, processing, and serving data to the end user."
- "Databricks is a one-stop shop for everything data related, and it can scale with you."
- "Databricks would benefit from enhanced metrics and tighter integration with Azure's diagnostics."
- "I would like to see improvement with the UI. It is functional and useful, but it's a bit clunky at times."
What is our primary use case?
We use Databricks for batch data processing and stream data processing.
How has it helped my organization?
Databricks provides a consistent interface for data engineers to work with data in a consistent language on a single integrated platform for ingesting, processing, and serving data to the end user.
What is most valuable?
The flexibility of Databricks is the most valuable feature. It gives us the ability to write analytics code in multiple languages.
There is a single workspace for different data roles like data engineers, machine learning engineers, and the end user, who can connect to the same system.
Databricks computes separate from storage, so you are not coupled with the underlying data sets, allowing for multiple processes and multiple programs to be written on the same code.
What needs improvement?
I would like to see improvement with the UI. It is functional and useful, but it's a bit clunky at times. It should be more user-friendly.
In future releases, Databricks would benefit from enhanced metrics and tighter integration with Azure's diagnostics.
For how long have I used the solution?
I have been using Databricks for eight months.
What do I think about the stability of the solution?
Databricks is very stable.
What do I think about the scalability of the solution?
The scalability of this solution is good. In our organization, users include analysts, data engineers, and data scientists.
How are customer service and support?
I would give Databrick service and support a four and a half out of five overall.
How would you rate customer service and support?
Positive
Which solution did I use previously and why did I switch?
Prior to using Databricks, we used Azure Stream Analytics. We made the switch because of the scalability and integrated platform.
How was the initial setup?
The initial setup of Databricks is more complex. I would rate it a four out of five on the complexity of the setup. It took two days to deploy the solution.
What about the implementation team?
We used a third party for some of the implementations of Databricks. The number of staff required to deploy and maintain this solution depends on the number of processes you have. Due to the cloud nature of the technology, it is easy to deploy and maintain.
What's my experience with pricing, setup cost, and licensing?
The licensing of Databricks is a tiered licensing regime, so it is flexible. I feel their pricing is a five out of five.
What other advice do I have?
Databricks is a one-stop shop for everything data related, and it can scale with you.
I would rate this solution a 9.5 out of 10 overall.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Microsoft Azure
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Head CEO at bizmetric
A user-friendly and customizable solution that offers excellent integration
Pros and Cons
- "The solution is built from Spark and has integration with MLflow, which is important for our use case."
- "Databricks is also user-friendly, providing customizable codes and models that allow people to experiment quickly."
- "The ability to customize our own pipelines would enhance the product, similar to what's possible using ML files in Microsoft Azure DevOps."
What is our primary use case?
Our use case is confidential, but I can say we use it for a deep learning model for machine learning.
What is most valuable?
The solution is built from Spark and has integration with MLflow, which is important for our use case.
Databricks is also user-friendly, providing customizable codes and models that allow people to experiment quickly.
Integration of Delta Lake is another useful feature.
What needs improvement?
Writing pandas-profiling reports could be easier.
The ability to customize our own pipelines would enhance the product, similar to what's possible using ML files in Microsoft Azure DevOps.
For how long have I used the solution?
I have been using this product for one and a half years.
What do I think about the stability of the solution?
For now the solution seems stable.
What do I think about the scalability of the solution?
The solution is easy to scale horizontally and it has a useful auto-scaling feature. For vertical scaling, you need to bring the system down and make some adjustments.
On my current project I have a team of 30 members under me, including data engineers and data science people. Our data science, engineering, and MLOps projects are expanding, so we are planning to do some vertical scaling to increase the team size to over 100 members. In our company, we are trying to certify more and more people in Databricks because it's cloud-agnostic.
How are customer service and support?
We have never needed to contact customer support, online resources have been sufficient to solve our problems.
How was the initial setup?
The initial setup of the solution is straightforward, once you understand the UI it is easy to implement. I would rate Databricks a four out of five for ease of setup.
One migration project took two to three months, including writing all the code and implementing end-to-end pipelines.
We are planning to deploy the solution in stages over the next 15 months to completely implement MLOps for our organization.
What's my experience with pricing, setup cost, and licensing?
I'm not involved in the financing, but I can say that the solution seemed reasonably priced compared to the competitors. Similar products are usually in the same price range. With five being affordable and one being expensive, I would rate Databricks a four out of five.
I find that deployed systems work out cheaper than having to operate manually, which appeals to our customers.
What other advice do I have?
I would rate this solution an eight out of ten.
There is an issue where clusters are automatically deleted after termination or after 100 days of non-usage. This could be more user-friendly, and they could include an enabler to pin the clusters you want to keep, instead of having to go and research why clusters got deleted after implementing the product. That documentation needs to be right in front of the user to avoid issues.
I definitely recommend this product to other users.
Which deployment model are you using for this solution?
Hybrid Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Microsoft Azure
Disclosure: My company has a business relationship with this vendor other than being a customer. Partner
Technical Architect at Infosys
Enables us to find anomalies and apply rules to the streaming data
Pros and Cons
- "The ability to stream data and the windowing feature are valuable."
- "Support for Microsoft technology and the compatibility with the .NET framework is somewhat missing."
- "Databricks is still having some stability issues."
What is our primary use case?
We use this solution for finding anomalies and applying the rules to the streaming data.
There are around 50 people using this solution in my organization, including data scientists.
What is most valuable?
The ability to stream data and the windowing feature are valuable. There are a number of targeted integration points, so that is a difference between Stream Analytics and Databricks. The integrations input or output are better in Databricks. It's accessible to use any of the Python or even Java. I can use the third party, deploy it, and use it.
What needs improvement?
Support for Microsoft technology and the compatibility with the .NET framework is somewhat missing. There should be reliability between these two. Databricks is based on open sources. If it's more synchronous between the Microsoft technology and the programming languages, it'll be better. Python has better languages, but compatibility would be a great help.
I would like to have better support for Microsoft technology and better language components.
With Azure or Cosmo DB, I can store other data links or time series data tables. That would be a great help for analytics in real time.
For how long have I used the solution?
I have been using Databricks for eight months.
What do I think about the scalability of the solution?
The scalability is fine. We had thousands of devices and were sending data infrequently, so that worked for us. If the amount increases, the windowing function and job schedule may not perform as expected.
How are customer service and support?
I would rate technical support 4 out of 5. We had some issues with setup, and they were finally solved but it was after following up a few times.
Which solution did I use previously and why did I switch?
Azure Stream Analytics is easy to use and easy to deploy. It's a little bit better. Databricks is still having some stability issues. Azure Stream Analytics has a few input and output sources, and it's scalable to all types of third party or interfaces.
How was the initial setup?
Setup was complex. There were some issues with setting up a database and installing the third party component on top of services. I would rate the setup 3 out of 5.
What about the implementation team?
Implementation was done in-house.
What's my experience with pricing, setup cost, and licensing?
The cost is around $600,000 for 50 users.
I would rate the price 2 out of 5.
What other advice do I have?
I would rate this solution 8 out of 10.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Helps users with data processing and analytics
Pros and Cons
- "The tool helps with data processing and analytics with large-scale data or big data since it is associated with managing data at a large scale."
- "The biggest problem associated with the product is that it is quite pricey."
What is our primary use case?
I use Databricks to manage the setting up of data lakes for SaaS.
What needs improvement?
The biggest problem associated with the product is that it is quite pricey. We cannot find a better solution than Databricks in the market currently.
For how long have I used the solution?
I have been using Databricks for a year.
What's my experience with pricing, setup cost, and licensing?
It is an expensive tool. The licensing model is a pay-as-you-go one.
What other advice do I have?
The tool helps with data processing and analytics with large-scale data or big data since it is associated with managing data at a large scale.
For my general use cases, I would say that I am not a technical person, so I cannot explain to you how the tool helps with the area of data engineering tasks.
There is another team in my company that is involved in the use of machine learning and AI features in Databricks. My team is mostly into operations. The tool is used in a multi-country project.
For example, in my company, they make some shopping decisions related to solutions based on what is the product chosen by the whole company.
I rate the tool an eight out of ten.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Buyer's Guide
Download our free Databricks Report and get advice and tips from experienced pros
sharing their opinions.
Updated: June 2026
Product Categories
Cloud Data Warehouse Data Science Platforms Data Management Platforms (DMP) Streaming AnalyticsPopular Comparisons
Informatica Intelligent Data Management Cloud (IDMC)
Teradata
Qlik Talend Cloud
Palantir Foundry
SAP Business Data Cloud
Snowflake
Azure Data Factory
Coralogix
KNIME Business Hub
IBM SPSS Statistics
Alteryx
Dataiku
Denodo
Amazon SageMaker
Microsoft Azure Machine Learning Studio
Buyer's Guide
Download our free Databricks Report and get advice and tips from experienced pros
sharing their opinions.
Quick Links
Learn More: Questions:
- Which do you prefer - Databricks or Azure Machine Learning Studio?
- How would you compare Databricks vs Amazon SageMaker?
- Which would you choose - Databricks or Azure Stream Analytics?
- Which product would you choose for a data science team: Databricks vs Dataiku?
- Which ETL or Data Integration tool goes the best with Amazon Redshift?
- What are the main differences between Data Lake and Data Warehouse?
- What are the benefits of having separate layers or a dedicated schema for each layer in ETL?
- What are the key reasons for choosing Snowflake as a data lake over other data lake solutions?
- Are there any general guidelines to allocate table space quota to different layers in ETL?
- What cloud data warehouse solution do you recommend?























