

Datadog and Grafana are key players in the monitoring and observability category. Datadog seems to have the upper hand in comprehensive monitoring capabilities, while Grafana excels in data visualization and customization.
Features: Datadog offers extensive functionalities including hosted infrastructure, intuitive tagging, real-time monitoring, and a rich ecosystem of integrations suitable for diverse environments. Key features include sharable dashboards, anomaly detection, and seamless integration with AWS. Grafana is valued for its robust data visualization and impressive customizability, along with integration capabilities. As an open-source tool, it supports capacity planning and provides a rich graphical interface.
Room for Improvement: Datadog users highlight the need for improved querying and more granular control over dashboard sharing. There are also concerns about the high cost and pricing complexity. Enhancements in front-end integration capabilities are also sought. Grafana users seek better data aggregation capabilities, enhanced documentation, and improved performance for large datasets. Expanded reporting and alerting functions are also desired.
Ease of Deployment and Customer Service: Both products support deployment in various environments. Datadog is favored in public and hybrid cloud environments for its comprehensive monitoring solutions, whereas Grafana is preferred in on-premises setups. Datadog offers good customer service, though some mention variability in support quality. Grafana relies on community-driven support, lacking real-time support compared to Datadog’s premium services.
Pricing and ROI: Datadog's pricing is considered complex and high, particularly with extensive data ingestion and custom metrics. Users benefit from tools that help manage usage and costs, yet it's seen as a significant investment. Grafana, as an open-source tool, provides cost benefits, with many users effectively leveraging its free version. The enterprise edition incurs costs but remains generally more affordable. Datadog delivers ROI through operational efficiency and quick issue resolution while Grafana offers cost-efficiency and visualization capabilities without high expenses.
Previously we had thirteen contractors doing the monitoring for us, which is now reduced to only five.
Datadog has delivered more than its value through reduced downtime, faster recovery, and infrastructure optimization.
We have also seen fewer escalations for minor issues because alerts help us catch problems earlier, which indirectly reduces downtime and improves overall efficiency.
I identified over-provisioned servers and reduced my AWS monthly bill by 15%, which is a significant saving in terms of costs.
When I have additional questions, the ticket is updated with actual recommendations or suggestions pointing me in the correct direction.
Overall, the entire Datadog comprehensive experience of support, onboarding, getting everything in there, and having a good line of feedback has been exceptional.
I've had a couple instances where I reached out to Datadog's support team, and they have been really super helpful and very kind, even reaching back out after resolving my issues to check if everything's going well.
The technical support team is very helpful with complex PromQL troubleshooting.
My advice for people who are new to Grafana or considering it is to reach out to the community mainly, as that's the primary benefit of Grafana.
I do not use Grafana's support for technical issues because I have found solutions on Stack Overflow and ChatGPT helps me as well.
Datadog's scalability has been great as it has been able to grow with our needs.
Since it is a SaaS platform, we did not have to worry about backend scaling.
We have not faced any major performance issues from the platform side; it handles increased metrics and monitoring loads smoothly.
It is highly scalable and built on a big data architecture capable of ingesting trillions of data points.
In terms of our company, the infrastructure is using two availability zones in AWS.
In assessing Grafana's scalability, we started noticing logs missing or metrics not syncing in time.
Metrics collection and alerting have been consistent in day-to-day use.
Datadog is very stable, as there hasn't been any downtime or issues since I've been here, and it's always on time.
Datadog seems stable in my experience without any downtime or reliability issues.
When something in their dashboard does not work, because it is open source, I am able to find all the relative combinations that people are having, making it much easier for me to fix.
Once you get to a higher load, you need to re-evaluate your architecture and put that into account.
Even when handling millions of data points, the visualization layer remains responsive.
It would be great to see stronger AI-driven anomaly detection and predictive analytics to help identify potential issues before they impact performance.
We want to be able to customize the cost part, and we would appreciate more granular access control.
Having more transparent and granular cost control features would make it easier to manage usage.
It would be better if they made the technology easy to use without needing to read extensive documentation.
Grafana cannot be easily embedded into certain applications and offers limited customization options for graphs.
I would want to see improvements, especially in the tracing part, where following different requests between different services could be more powerful.
The setup cost for Datadog is more than $100.
Pricing is mainly based on data ingestion, such as logs, metrics, and traces, and it can increase quickly if everything is enabled by default.
Everybody wants the agent installed, but we only have so many dollars to spread across, so it's been difficult for me to prioritize who will benefit from Datadog at this time.
In an enterprise setting, pricing is reasonable, as many customers use it.
The costs associated with using Grafana are somewhere in the ten thousands because we are able to control the logs in a more efficient way to reduce it.
I purchased my Grafana Cloud subscription through the AWS Marketplace, which simplified my procurement process and allowed me to apply the cost towards my AWS committed spend.
Our architecture is written in several languages, and one area where Datadog particularly shines is in providing first-class support for a multitude of programming languages.
Having all that associated analytics helps me in troubleshooting by not having to bounce around to other tools, which saves me a lot of time.
Datadog was able to find the alerts and trigger to notify our team in a very prompt manner before it got worse, allowing us to promptly adjust and remediate the situation in time.
Users can monitor metrics with greater ease, and the tool aids in quickly identifying issues by providing a visual representation of data.
The fact that I can join data from my SQL database with metrics from Prometheus in the same table is a feature I have not found performed as well elsewhere.
You can check those metrics in the incident management tool by filtering the alert source as Grafana, and it helps in reducing production incidents because you can acknowledge and visualize the metrics from Grafana on time.
| Product | Mindshare (%) |
|---|---|
| Datadog | 4.7% |
| Grafana | 2.7% |
| Other | 92.6% |


| Company Size | Count |
|---|---|
| Small Business | 82 |
| Midsize Enterprise | 47 |
| Large Enterprise | 100 |
| Company Size | Count |
|---|---|
| Small Business | 13 |
| Midsize Enterprise | 10 |
| Large Enterprise | 25 |
Datadog integrates extensive monitoring solutions with features like customizable dashboards and real-time alerting, supporting efficient system management. Its seamless integration capabilities with tools like AWS and Slack make it a critical part of cloud infrastructure monitoring.
Datadog offers centralized logging and monitoring, making troubleshooting fast and efficient. It facilitates performance tracking in cloud environments such as AWS and Azure, utilizing tools like EC2 and APM for service management. Custom metrics and alerts improve the ability to respond to issues swiftly, while real-time tools enhance system responsiveness. However, users express the need for improved query performance, a more intuitive UI, and increased integration capabilities. Concerns about the pricing model's complexity have led to calls for greater transparency and control, and additional advanced customization options are sought. Datadog's implementation requires attention to these aspects, with enhanced documentation and onboarding recommended to reduce the learning curve.
What are Datadog's Key Features?In industries like finance and technology, Datadog is implemented for its monitoring capabilities across cloud architectures. Its ability to aggregate logs and provide a unified view enhances reliability in environments demanding high performance. By leveraging real-time insights and integration with platforms like AWS and Azure, organizations in these sectors efficiently manage their cloud infrastructures, ensuring optimal performance and proactive issue resolution.
Grafana offers a customizable, user-friendly platform for robust data visualization and integration, enhancing real-time monitoring with extensive alerting and collaboration capabilities supported by an active open-source community.
Grafana stands out for its flexible dashboards and robust visualization options, integrating smoothly with tools like Prometheus. This open-source platform supports diverse environments, aiding in the visualization of IT infrastructure and business analytics. Its alerting system efficiently supports real-time monitoring. While it is praised for its community backing and cost-effectiveness, there is demand for better data aggregation, intuitive interfaces, and enhanced documentation compared to competitors such as Splunk. Simplification of configuration and the interface is sought, alongside improvements in machine learning and reporting features.
What are Grafana's most important features?Grafana is implemented widely across industries for monitoring IT infrastructure and visualizing business analytics. Companies utilize it to analyze server performance or monitor Kubernetes environments and payment transactions. The platform integrates with AWS services and other data sources to ensure observability and system health tracking, focusing on performance metrics through customized dashboards and alerts. Organizations employ Grafana to bolster observability and optimize infrastructure through robust data insights.
We monitor all Application Performance Monitoring (APM) and Observability reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.