Databricks offers a scalable, versatile platform that integrates seamlessly with Spark and multiple languages, supporting data engineering, machine learning, and analytics in a unified environment.
Product | Market Share (%) |
---|---|
Databricks | 8.3% |
Snowflake | 17.7% |
Dremio | 8.9% |
Other | 65.1% |
Title | Rating | Mindshare | Recommending | |
---|---|---|---|---|
Teradata | 4.1 | 8.5% | 87% | 77 interviewsAdd to research |
KNIME Business Hub | 4.1 | N/A | 94% | 60 interviewsAdd to research |
Company Size | Count |
---|---|
Small Business | 24 |
Midsize Enterprise | 12 |
Large Enterprise | 50 |
Company Size | Count |
---|---|
Small Business | 811 |
Midsize Enterprise | 566 |
Large Enterprise | 3180 |
Databricks stands out for its scalability, ease of use, and powerful integration with Spark, multiple languages, and leading cloud services like Azure and AWS. It provides tools such as the Notebook for collaboration, Delta Lake for efficient data management, and Unity Catalog for data governance. While enhancing data engineering and machine learning workflows, it faces challenges in visualization and third-party integration, with pricing and user interface navigation being common concerns. Despite needing improvements in connectivity and documentation, it remains popular for tasks like real-time processing and data pipeline management.
What features make Databricks unique?In the tech industry, Databricks empowers teams to perform comprehensive data analytics, enabling them to conduct extensive ETL operations, run predictive modeling, and prepare data for SparkML. In retail, it supports real-time data processing and batch streaming, aiding in better decision-making. Enterprises across sectors leverage its capabilities for creating secure APIs and managing data lakes effectively.
Databricks was previously known as Databricks Unified Analytics, Databricks Unified Analytics Platform, Redash.
Elsevier, MyFitnessPal, Sharethrough, Automatic Labs, Celtra, Radius Intelligence, Yesware
Author info | Rating | Review Summary |
---|---|---|
Data Engineer at a engineering company with 1,001-5,000 employees | 4.0 | As a data engineer at Fractal, I frequently use Databricks within Azure for efficient ED and Synapse integration. Databricks' extensive course offerings and visualization tool compatibility are valuable, though cluster failures remain a significant challenge despite efforts to resolve them. |
Data Platform Architect at KELLANOVA | 3.5 | I am currently working as an IT architect, using Databricks on our AWS analytics platform to enhance our AI/ML projects. It offers valuable features like a Unified Catalog and serverless computing, but we have yet to see a definitive ROI. |
Senior Data Engineer at a logistics company with 51-200 employees | 3.5 | I handle data ingestion and create warehouses with Databricks, utilizing it for analytics support. The all-in-one approach is beneficial, but maintaining infrastructure and managing costs is challenging for our startup. Using Microsoft Azure enhances our operations. |
Head CEO at bizmetric | 4.5 | I use Databricks for data engineering and machine learning, appreciating its Unity Catalog and MLflow features. However, improvements in the Databricks File System are needed. Overall, Databricks is cost-effective and offers a convenient setup with Spark clusters. |
Solution Architect at Mercedes-Benz AG | 4.5 | We use Databricks on Azure for collaboration and cost efficiency, benefiting from features like notebooks and automatic scheduling. However, API and model deployment could improve. For simple data tasks, ADF might be more cost-effective, but Databricks excels in analytics. |
Data Engineer at CRAFT Tech | 4.0 | I use Databricks primarily to build data lakehouses for clients. Its Delta Lake and Unity Catalog are valuable features, aiding in data governance and integration. Improvements are needed in dashboards, but their new IBI dashboard shows promise. |
Senior Data Engineer at Shell | 4.5 | I use Databricks for data transformations and analytics, leveraging Delta Lake and Delta Life tables for ETL processes. The platform offers valuable features like a user-friendly interface and AI functionalities, although debugging in Delta Live Table needs improvement. |
Senior Data Engineer at a computer software company with 1,001-5,000 employees | 4.5 | I build data pipelines with Azure Data Factory and Databricks, leveraging its integration with Azure and Unity Catalog for governance. Though performance needs improvement, Databricks excels in data science and machine learning, surpassing our previous use of Informatica. |