We have different use cases. Our banking use case uses machine learning to identify customer life events and recommend the best-suited card products. These machine-learning models are deployed in our environment, where they run on a scheduled basis. We rely on the platform for every data science modeling. We have 170 models in production. It is very helpful for production and evaluation.
What is our primary use case?
What is most valuable?
I appreciate CDSW's ability to logically segregate environments, such as data, DR, and production, ensuring they don't interfere with each other. The deployment of machine learning is fast and easy to manage. Its API calls are also fast.
What needs improvement?
The tool's MLOps is not good. It's pricing also needs to improve.
What do I think about the stability of the solution?
The product is stable.
What do I think about the scalability of the solution?
The tool is scalable and works on a cluster and node system.
How was the initial setup?
If you don't configure CDSW well, then it might be not useful for you. Deploying the tool can vary in complexity, but most of the time, it's relatively simple and straightforward. Triggering a job from data to production is easy, as the platform automates the deployment process. However, ensuring optimal resource allocation is essential for smooth operations.
What's my experience with pricing, setup cost, and licensing?
The product is expensive.
What other advice do I have?
We can integrate the product using APIs to connect with other systems. Additionally, integration with GitLab is straightforward, as it offers features to connect with external systems seamlessly. The product has enterprise customers and focuses on stability and performance. I rate it six out of ten.
Snowflake has a lot of good things compared to the product. CDSW is on-prem, while Snowflake is on the cloud. Snowflake is better in terms of data lineage, data catalog, and pricing.
Which deployment model are you using for this solution?
On-premises

