Databricks and IBM Watson Studio compete in data science and analytics. Databricks seems to have an advantage in performance for large-scale data processing and Spark integration, while IBM Watson Studio excels in AI capabilities and offers a comprehensive suite of data science tools.
Features: Databricks is known for its ease in running large-scale analytics, built-in optimization recommendations, and support for collaboration in notebooks. It integrates seamlessly with Spark and offers notable machine learning capabilities. The scalability and support for multiple programming languages enhance its versatility with cloud integrations. IBM Watson Studio stands out for its AI capabilities, built-in data science tools like SPSS Modeler, and use of Jupyter notebooks, making it powerful for machine learning and data integration tasks.
Room for Improvement: Databricks could improve by expanding advanced visualization and machine learning libraries, enhancing ETL tool integration, and providing more user-friendly documentation and interfaces for popular BI tools. IBM Watson Studio could enhance its intuitive user experience, ease of deployment, and support for new AI features. Users often cite navigation challenges, desiring a more integrated interface.
Ease of Deployment and Customer Service: Databricks offers versatile deployment options with support for public, private, and hybrid clouds. It provides comprehensive documentation, though technical support experiences vary. IBM Watson Studio also supports cloud and on-premises deployments. Its documentation is praised for clarity, with users reporting reliability that reduces the need for frequent technical support.
Pricing and ROI: Databricks operates on a pay-as-you-go model, seen as costly for extensive data usage but flexible and offering positive ROI by reducing infrastructure costs. IBM Watson Studio is viewed as reasonably priced for its features, although complex workloads may incur higher costs. Its pricing is generally considered straightforward, delivering beneficial ROI for large-scale data projects.
For a lot of different tasks, including machine learning, it is a nice solution.
When it comes to big data processing, I prefer Databricks over other solutions.
The product offers a significant return on investment through its scalability and integration capabilities.
My customers have seen returns on investment through increased efficiency, automated calculations, improved accuracy in pricing, and reduced staffing needs due to the automation.
Whenever we reach out, they respond promptly.
As of now, we are raising issues and they are providing solutions without any problems.
I rate the technical support as fine because they have levels of technical support available, especially partners who get really good support from Databricks on new features.
The support quality depends on the SLA or the contract terms.
The community access is weak, which limits the ability to engage in discussions and find documentation and examples of similar cases effectively.
The patches have sometimes caused issues leading to our jobs being paused for about six hours.
Databricks is an easily scalable platform.
I would rate the scalability of this solution as very high, about nine out of ten.
Watson Studio is very scalable.
I rate IBM Watson Studio seven out of ten for scalability because while it scales, it requires significant resources to do so, making it expensive compared to some competitors.
They release patches that sometimes break our code.
Although it is too early to definitively state the platform's stability, we have not encountered any issues so far.
Cluster failure is one of the biggest weaknesses I notice in our Databricks.
Expertise in optimization is necessary to manage such issues effectively.
Adjusting features like worker nodes and node utilization during cluster creation could mitigate these failures.
We prefer using a small to mid-sized cluster for many jobs to keep costs low, but this sometimes doesn't support our operations properly.
We use MLflow for managing MLOps, however, further improvement would be beneficial, especially for large language models and related tools.
The platform is associated with a complicated setup process and demands heavy hardware, making it expensive to scale.
One area that could be improved is the backup and restoration of the database and the overall database configuration.
It is not a cheap solution.
IBM Watson Studio is considered rather expensive, with a rating of six or seven.
Databricks' capability to process data in parallel enhances data processing speed.
The platform allows us to leverage cloud advantages effectively, enhancing our AI and ML projects.
The Unity Catalog is for data governance, and the Delta Lake is to build the lakehouse.
This capability saves a significant amount of time by automating processes that typically involve manual work, such as data cleaning, feature engineering, and predictive analytics.
It integrates well with other platforms and offers good scalability.
Databricks is utilized for advanced analytics, big data processing, machine learning models, ETL operations, data engineering, streaming analytics, and integrating multiple data sources.
Organizations leverage Databricks for predictive analysis, data pipelines, data science, and unifying data architectures. It is also used for consulting projects, financial reporting, and creating APIs. Industries like insurance, retail, manufacturing, and pharmaceuticals use Databricks for data management and analytics due to its user-friendly interface, built-in machine learning libraries, support for multiple programming languages, scalability, and fast processing.
What are the key features of Databricks?
What are the benefits or ROI to look for in Databricks reviews?
Databricks is implemented in insurance for risk analysis and claims processing; in retail for customer analytics and inventory management; in manufacturing for predictive maintenance and supply chain optimization; and in pharmaceuticals for drug discovery and patient data analysis. Users value its scalability, machine learning support, collaboration tools, and Delta Lake performance but seek improvements in visualization, pricing, and integration with BI tools.
IBM Watson Studio provides tools for data scientists, application developers and subject matter experts to collaboratively and easily work with data to build and train models at scale. It gives you the flexibility to build models where your data resides and deploy anywhere in a hybrid environment so you can operationalize data science faster.
We monitor all Data Science Platforms reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.