What is our primary use case?
Our primary use case for Azure Synapse is as a data warehouse, for creation of data pipelines. It allows login into to one central workspace, manage our databases, and the entire warehouse. We can embed business intelligence (BI), using Power BI. This allows us to show visualizations all in one central place.
What needs improvement?
I think potential areas to improve on could be performance and if they offered a decoupled compute from storage kind of service that would be nice. But I don't think that is possible as it's a fundamental change in the underlying architecture and Microsoft won't make that decision easily.
For how long have I used the solution?
We have been using Microsoft Azure technology for the last 4-5 years, including Synapse.
What do I think about the stability of the solution?
As Synapse is hosted on the Azure cloud, it's very stable.
What do I think about the scalability of the solution?
The Azure Synapse service is highly scalable.
How was the initial setup?
The initial setup is very straight forward. Azure Synapse offers you one workspace where you can do everything, creation of your data warehouse, ETL pipelines using Azure Data Factory, Create storage and data marts. Also use Power BI for visualization.
Before Synapse was available, all of these was offered as separate services and this is how a data warehouse was constructed. Synapse is one layer on top of this where we make use of one single workspace to initiate and manage the entire set of services that you need for creating and managing your data platform - Data Warehouses and marts using SQL Warehouse, ETL pipelines using Azure Data Factory, Data Lake using Azure Blob Storage, and it offers server-less SQL - meaning you can run queries without having to initiate an SQL database or SQL data warehouse instance. It also offers Spark compute to process non-structured data.
What about the implementation team?
We are a Microsoft partner and have setup and built Azure Synapse based solutions for our manufacturing, energy and healthcare clients. We are very customer centric and build and manage solutions based on our clients needs. We recommend what the best technology stack is for them.
What was our ROI?
If I hosted a Microsoft setup on premise, I would need to invest in licensing for different tools and services, SQL server, SSIS, SSRS, Power BI or SSAS. Compared to this if you use Azure Synapse, the return on investment is very high. You get rid of your hardware, licensing and you move to a subscription based pay as you use model. Your operational costs reduce and your optimization increases. Capital expenditure absolutely diminishes and you move to an OpEx model.
Finally, the overall management of it is simplified as compared to on premise. This of course leads to high RoI.
What's my experience with pricing, setup cost, and licensing?
Azure Synapse is best for people who are already invested in Microsoft technologies, in particular those who already use Microsoft data warehousing services, including MS SQL-Server based data warehouse technology. For them, migrating to Azure is very straight forward and Synapse adoption stays easy.
With Azure Synapse, there is no database installation, no licensing cost, no hardware setup, everything is available as a cloud service, you then pay for the service, pay only for what you use.
With regards to pricing, as I said you pay for what you use. The amount of data you store and compute power contributes to your pricing. If I use Azure's blob storage, the pricing depends on how much I use. If I utilize Azure Data Factory, pricing depends on how much data I process through the ETL pipelines and so on.
Which other solutions did I evaluate?
For some customers we recommend Snowflake, for others Azure Synapse or Google BigQuery. In one of our cases we are building a solution with both Azure Synapse and Snowflake.
What other advice do I have?
We have approximately 20-25 team members with knowledge of Azure Synapse service capabilities.
With regards to deployment and maintenance, an Azure Synapse based solution may need anywhere between 3-15 people. This depends on what type of warehouse and analytics you want to create, the number of reports and visualization. Typically a small team size would be of 3-4 people and a large team size would be of around 12-14 members.
Which deployment model are you using for this solution?
Private Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Microsoft Azure
Disclosure: My company has a business relationship with this vendor other than being a customer. Partner