What is our primary use case?
Our main use case was to monitor an entire infrastructure as well as the application tech stack which we were having in a cloud environment. We had application telemetry as well, like monitoring traces, metrics, and logs. On the infrastructure side, we also monitored their metrics.
Specifically with Chronosphere, we integrated OpenTelemetry as a collector on a centralized platform and used that to divert all the telemetry logs coming from our infrastructure as well as our application into Chronosphere where we have the dashboarding done. It is best that we have alerting integrated directly into Chronosphere. This means we don't need to set up a third party alerting mechanism. Chronosphere helped us in setting up a detailed dashboard with proper filtration of the data which is coming as well as custom filtering of data traces. We set up an alert for each and every metric and trace which we wanted to monitor, which is very critical at the production level. We were based on an e-commerce platform so each and every order failure or something similar would be very crucial to us to get notified. Chronosphere helped us in setting up those alerts very quickly on a single platform instead of hopping around multiple platforms.
We integrated an entire monitoring setup. We were getting the data from OpenTelemetry collector into Chronosphere, wherein we had a centralized collector picking up the metrics and traces at different endpoints. We had dashboards created and alerting mechanisms also integrated into that, which was directly integrated into some notification mechanisms like Slack or other tools.
Specifically, we had EKS monitoring as well. We were more into a data related environment and had a big data lake setup. Chronosphere helped us in having custom automatic instrumentations which helped us to instrument our application easily and have a proper detailed monitoring setup using Chronosphere which helped us in multiple scenarios. The support team was very good for us. They had a dedicated person assigned to us who would help us very thoroughly in terms of having all the issues sorted out. We had pretty good options of monitoring multiple applications using different methods. One method is using OpenTelemetry, and we also used multiple Lake view setups. This was very useful.
What is most valuable?
There are multiple features which Chronosphere offers that I liked. A couple of features, for example, are the integration of alerting and dashboards into a single platform wherein you can easily create a dashboard and set up monitoring for that particular dashboard. You can set up particular traces of metrics based on the criticality as well as the threshold values and also custom filtration of your metrics and traces and telemetry logs. The severity and automated grouping of your coming logs based on the keyword present in it is very helpful. For example, if it is an error, it is automatically grouped. If it is some particular value we are looking at, that also helps us consolidate them. Dashboarding is very easy and the user interface is very user friendly.
What needs improvement?
We can improve a bit of UI aspects. The UI could be made more user friendly. Sometimes when identifying the specific logs patterns and identifying what metrics and what logs are coming in, going to a specific log explorer and finding it there is a little difficult. It would be very useful if we could group according to projects and have that UI a little more user friendly.
The user interface part was a bit confusing in the beginning. To make it better, I believe we would need some more open sourced or freely available courses on Chronosphere which would help us understand the platform a bit more. The team provides detailed walkthroughs whenever you get into that. However, it would be better if we could have proper video sessions or documentation which would help us understand the tool a bit more.
For how long have I used the solution?
I have been using Chronosphere for around one and a half years.
What do I think about the stability of the solution?
Chronosphere is very stable.
What do I think about the scalability of the solution?
Scalability was not an issue for us. We have it on the public cloud setup, so scalability was never a concern.
How are customer service and support?
The support team was very good for us. They had a dedicated person assigned to us who would help us very thoroughly in terms of having all the issues sorted out.
Before Chronosphere, we were only handling dedicated open source alerting and monitoring mechanisms. When we moved into Chronosphere, one particular thing was that we had a beautiful support team from Chronosphere which helped us to solve a lot of our problems when we were not able to do it on the open source platforms. Integrating Chronosphere was pretty much easy coming from an open source tool and it helped us to streamline our monitoring and alerting setup across our organization, which directly impacted on the streamlining of the process as well as reducing errors and also keeping our environment uptime to a greater extent by those alerts and quick responses.
We have a dedicated team of people who are helping us from setup as well as debugging errors. Anytime you reach out to them, they help us a lot.
I would say you can accept jumping into Chronosphere if you're using an open source tool and looking to a paid solution, because it offers detailed customer support as well as a ton of features which would really help streamline your existing environment and process. It would be a great option if you're looking to make a change.
Which solution did I use previously and why did I switch?
Previously we were using open source solutions like Grafana and Prometheus. The only reason was that we would have to open between too many tools for a proper detailed monitoring setup. Chronosphere helped us better. We had to introduce separate agents to pick up the metrics from applications. Logs were coming through different applications as data sources and then added into Grafana. Creating a dashboard and alerting from there was quite another tool. Grafana was the only tool we had. Once Chronosphere came in, all got consolidated.
How was the initial setup?
I cannot be specific on those terms. The initial setup did take time, but once the setup was completed and we had all the environments up and running, the overall time really got saved because we had all the templates ready. We pulled in a template and created a dashboard as and when needed. We onboarded all the applications onto Chronosphere where we could monitor everything properly. There was a bit of time saved. In terms of employees, the initial setup took some time with some group of people. However, when things got easier or things got moving, we had very few employees involved into that. We were able to handle it with fewer employees.
What's my experience with pricing, setup cost, and licensing?
It was pretty decent and competitive compared to the market. I didn't feel it was on the expensive side. The features and functionalities it provides, I feel the pricing is right on point.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?