What is our primary use case?
I use the solution for logging, defining alerts, and monitoring. Our company's Java and Python logging teams mainly use it. We also used for database request processing SLA Monitoring. for our Dev env, we are using google provided dashboards for Monitoring service performance.
How has it helped my organization?
As part of our company, we implemented several changes in our log analytics pattern, including the storage and procurement process. Earlier, before implementing the solution, our company was able to procure only one year of data, but later, we came to the three-year mark. Around 15-20% reduction has been witnessed in the total analytic consumption of our company.
The aforementioned result was possible because the solution allowed the creation of a dashboard where factors like storage costs, proportion of logs, and logs presence in a storage bucket or Big Query can all be checked. Earlier all logs were stored in a raw storage, but currently our company is able to move logs in table bucket that contributes towards cost savings.
It has default integration for all gcp services. recently managed Prometheus support gives more flexibility to organizations to remain connected with their current Prometheus setup. We leveraged integrated FinOps Hub for recommendations for our workloads and server configurations, helpd us lot in order to get maximum TCO.
What is most valuable?
Logging is the most valuable feature of the solution for our company.
Monitoring with default metrices also a good part of it's as a predefined monitoring dashboard.
Trace, Debugger and Profiler services are good services for troubleshooting code, finding out latencies and to identify problematic code in cloud, however no such any incidents, to utilize its capabilities.
What needs improvement?
If the errors are caught early in the interface, it would be easier for users to manage. The process of logging analytics can be improved. Its integrations with cloud storage, Pub/Sub and other webhooks gives more integration options while using it.
For how long have I used the solution?
I have been using the solution for 6+ years.
What do I think about the stability of the solution?
The resources of our company remain distributed across multiple regions, and sometimes, when we as part of the company need to find logging data from the US region, it's very time-consuming on a few occasions.
What do I think about the scalability of the solution?
The solution is scalable. At our company, 150+ servers are used in the environment provided by Google Cloud's operations suite. Four people from the operations team and four from the development team are using the solution in our organization.
How are customer service and support?
No such incident to interact with Google Customer support team
How would you rate customer service and support?
Which solution did I use previously and why did I switch?
Our company is using a different tool for monitoring purposes, we transformed to Google Cloud's operations suite, and we are preparing the monitoring dashboards as part of our company.
How was the initial setup?
The initial setup of the product is very simple. At our company, we selected a project and started working on it using Google Cloud's operations suite. The solution allowed easy logging and monitoring data, but for creating the log router at our company, we had used Terraform script. In our organization, we created a bucket to route the major logs and we specifically used BigQuery for the billing logs.
The entire deployment process of the solution took just half an hour and the solution has been very easy to maintain from that time.
What about the implementation team?
we implemented through in-house talent team, having basic experience on integration.
What was our ROI?
Overall, ROI with Google Ops Suite is lesser compared to other Monitoring suites in Google.
What's my experience with pricing, setup cost, and licensing?
As Ops Suite, is a google product which effectively comes at zero setup cost, in order to manage your on-premises logs on onsite, it involves negligible cost for using ops agent and it also includes network cost during transfer of data. data storage cost compared to cloud storage bucket pricing is very lesser for some free quota. for usage of managed Prometheus, enterprise license cost involved in it.
Which other solutions did I evaluate?
Before choosing the solution, we evaluated the merits and demerits of competitor products as well like Datadog, Dynatrace, splunk, new relic and many more. We explored six tools in terms of integration availability, reliability and downtime because at our company, we use multiple NoSQL databases like MongoDB, Bigtable, Firestore, etc.
What other advice do I have?
The Ops Agent and logging transport feature of the solution have had a major impact on improving application performance. The solution also allows the transport of logs into log buckets, which is highly useful for future purposes.
Google Cloud's operations suite also caters to log analytics and helps in log export from an organization, folder, or project label. Another vital feature of the solution is the query drag-and-drop feature on the interface.
The error reporting and diagnostics tools have helped address application issues through JSON. The solution allows drag-and-drop between different layers to identify the exact error. The solution already provides error color coding.
The solution can be easily integrated with third-party tools. For instance, if I have a log that I exported in BigQuery, I can obtain meaningful insights effortlessly from there. As part of our company, we have witnessed the proper functioning of error logging and caching in the solution.
I would definitely recommend Google Cloud's operations suite to others, similar to how, as part of our company, we suggest it to clients or customers. Some other on-premise version competitor products provide their own cloud, but there is an additional log exporting cost. This is why our company always suggests customers to use a cloud operations suite.
I would rate the solution as 9 out of 10.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Google