Find out in this report how the two Cloud Data Warehouse solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
For a lot of different tasks, including machine learning, it is a nice solution.
When it comes to big data processing, I prefer Databricks over other solutions.
rating the customer support at ten points out of ten
I have been self-taught and I have been able to handle all my problems alone.
Whenever we reach out, they respond promptly.
As of now, we are raising issues and they are providing solutions without any problems.
I rate the technical support as fine because they have levels of technical support available, especially partners who get really good support from Databricks on new features.
It is a 10 out of 10 in terms of scalability.
The scalability is definitely good because we are migrating to the cloud since the computers on the premises or the big database we need are no longer enough.
The patches have sometimes caused issues leading to our jobs being paused for about six hours.
Databricks is an easily scalable platform.
I would rate the scalability of this solution as very high, about nine out of ten.
They release patches that sometimes break our code.
Although it is too early to definitively state the platform's stability, we have not encountered any issues so far.
Databricks is definitely a very stable product and reliable.
Troubleshooting requires opening each pipeline individually, which is time-consuming.
In general, if I know SQL and start playing around, it will start making sense.
BigQuery is already integrating Gemini AI into the data extraction process directly in order to reduce costs.
Adjusting features like worker nodes and node utilization during cluster creation could mitigate these failures.
We prefer using a small to mid-sized cluster for many jobs to keep costs low, but this sometimes doesn't support our operations properly.
We use MLflow for managing MLOps, however, further improvement would be beneficial, especially for large language models and related tools.
Being able to optimize the queries to data is critical. Otherwise, you could spend a fortune.
The price is perceived as expensive, rated at eight out of ten in terms of costliness.
It is not a cheap solution.
It is really fast because it can process millions of rows in just a matter of one or two seconds.
BigQuery processes a substantial amount of data, whether in gigabytes or terabytes, swiftly producing desired data within one or two minutes.
The features I find most valuable in this solution are the ability to run and handle large data sets in a very efficient way with multiple types of data, relational as SQL data.
Databricks' capability to process data in parallel enhances data processing speed.
The platform allows us to leverage cloud advantages effectively, enhancing our AI and ML projects.
The Unity Catalog is for data governance, and the Delta Lake is to build the lakehouse.
Product | Market Share (%) |
---|---|
BigQuery | 8.0% |
Databricks | 8.3% |
Other | 83.7% |
Company Size | Count |
---|---|
Small Business | 11 |
Midsize Enterprise | 9 |
Large Enterprise | 20 |
Company Size | Count |
---|---|
Small Business | 25 |
Midsize Enterprise | 12 |
Large Enterprise | 56 |
BigQuery is an enterprise data warehouse that solves this problem by enabling super-fast SQL queries using the processing power of Google's infrastructure. ... You can control access to both the project and your data based on your business needs, such as giving others the ability to view or query your data.
Databricks offers a scalable, versatile platform that integrates seamlessly with Spark and multiple languages, supporting data engineering, machine learning, and analytics in a unified environment.
Databricks stands out for its scalability, ease of use, and powerful integration with Spark, multiple languages, and leading cloud services like Azure and AWS. It provides tools such as the Notebook for collaboration, Delta Lake for efficient data management, and Unity Catalog for data governance. While enhancing data engineering and machine learning workflows, it faces challenges in visualization and third-party integration, with pricing and user interface navigation being common concerns. Despite needing improvements in connectivity and documentation, it remains popular for tasks like real-time processing and data pipeline management.
What features make Databricks unique?In the tech industry, Databricks empowers teams to perform comprehensive data analytics, enabling them to conduct extensive ETL operations, run predictive modeling, and prepare data for SparkML. In retail, it supports real-time data processing and batch streaming, aiding in better decision-making. Enterprises across sectors leverage its capabilities for creating secure APIs and managing data lakes effectively.
We monitor all Cloud Data Warehouse reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.