Senior Staff Site Reliability Engineer at a tech vendor with 501-1,000 employees
Real User
Top 10
Dec 10, 2025
Sysdig Monitor has become essential for overseeing a vast array of hosts and EC2 instances across our environment. We initially tried Grafana, but it fell short in operational capabilities. Managing multiple instances of a self-hosted Grafana setup led to operational overload and high S3 costs. We needed a managed solution with robust host monitoring, and Sysdig Monitor delivered just that. Since our infrastructure was based on Prometheus, it was crucial that the new tool support Prometheus metrics to simplify our transition. Sysdig Monitor fit the bill perfectly, making our migration and dashboard setup seamless. Managing between 1,000 to 1,500 nodes, we operate on a substantial scale. We started using Sysdig Monitor for host-based monitoring and have since expanded to track application-specific metrics by having applications expose their custom metrics. This feature has been incredibly useful, and the anomaly detection and Cost Advisor tools have helped us pinpoint excessive spending and resource overuse effectively.
During my undergraduate studies, I investigated how the frequency or order of actions within a specific system triggered events on our website. To achieve this, I effectively used Sysdig Monitor to track the health of my containerized environment, essentially acting as a central monitoring station within the container.
Container Monitoring ensures a consistent observing and managing system for containerized applications, affording enterprises enhanced scalability and performance. Enhancing reliability and security in application environments is critical, and Container Monitoring achieves this by offering comprehensive visibility into containerized workloads. The solution integrates seamlessly with container orchestration platforms like Kubernetes, detecting unusual activities and providing real-time...
Sysdig Monitor has become essential for overseeing a vast array of hosts and EC2 instances across our environment. We initially tried Grafana, but it fell short in operational capabilities. Managing multiple instances of a self-hosted Grafana setup led to operational overload and high S3 costs. We needed a managed solution with robust host monitoring, and Sysdig Monitor delivered just that. Since our infrastructure was based on Prometheus, it was crucial that the new tool support Prometheus metrics to simplify our transition. Sysdig Monitor fit the bill perfectly, making our migration and dashboard setup seamless. Managing between 1,000 to 1,500 nodes, we operate on a substantial scale. We started using Sysdig Monitor for host-based monitoring and have since expanded to track application-specific metrics by having applications expose their custom metrics. This feature has been incredibly useful, and the anomaly detection and Cost Advisor tools have helped us pinpoint excessive spending and resource overuse effectively.
During my undergraduate studies, I investigated how the frequency or order of actions within a specific system triggered events on our website. To achieve this, I effectively used Sysdig Monitor to track the health of my containerized environment, essentially acting as a central monitoring station within the container.