We use it to store editorial content.
We started out on the on-premise version, then moved to the AWS version.
We use it to store editorial content.
We started out on the on-premise version, then moved to the AWS version.
I don't have to worry about upgrades with the AWS version.
The on-premise version is very difficult to upgrade.
We run the agent in AWS.
It has empowered all our platform engineers with a very powerful and easy to use monitoring system. Most of our platform organization is now involved in monitoring. Previously, only a handful of platform engineers were involved, because Graphite and Sensu were so cumbersome to use.
It is incredibly easy to do common monitoring actions:
Very rarely. Maybe only once or twice that we noticed. It is very reliable.
No.
It is excellent. The web app has a real-time support chat window in which a support engineer is chatting with you within a minute. That is the "right" way to do support.
We previously ran Graphite and Sensu ourselves. By moving to Datadog, we did not need to manage our own monitoring infrastructure anymore. Graphite was somewhat complex to run.
Initial setup is easy. Install the agent and send it metrics. There are StatsD/Datadog libraries available for most languages.
Pricing seems reasonable. It depends on the size of your organization, the size of your infrastructure, and what portion of your overall business costs go toward infrastructure. It is hard to say without looking at all of this.
We looked at several competitors at the time (Summer 2016). There did not seem to be any compelling alternatives. Once we did the PoC with Datadog, we loved it and decided to move forward.
Try it out and see if you like it.
We can build dashboards as fast we roll out new systems, which can be fast.
We use standard and custom metrics for every new system we roll out for 360 degree visibility into our systems.
The most valuable features have been: Sharable dashboards, TimeBoards, dogstatsd API, Slack Integration, Event logging API. CloudTrail Events, Tags, alerts, and anomaly detection. EBS Volume Snapshot Age, which they added upon request. We used PagerDuty integration for a while as well.
More granular control over dashboard sharing. Timeboard sharing.
There are infrequent hiccups, which have been decreasing over the time we have used it.
No.
Customer Service:
Never seen better. Questions answered usually almost immediately, even on weekends. An in-stream with your event stream.
Technical Support:
High.
Overall they have always had an amazing team, and quality has been maintained as the company has grown.
Complementary to other tools we used.
Setup is generally easy. They provide an large number of integrations, some are more complex than others, which is to be expected.
In house implementation.
We didn’t calculate explicitly, but as we used the product to track down underutilized instances, it more than paid for itself in the first month.
Pricing overall in this segment has standardized in the last several years.
A few, including Zabbix and Icinga.
One of the fastest and most flexible tools we have used in this area..
We were building a real-time bidding exchange for digital out-of-home ads by providing the analytics and the infrastructure for the ecosystem. We not only facilitate the buys, but we also act as an ad server for the network and advertisers who put in their server requirements. It is very similar to the large online ad servers. In order to provide this real-time service, we had to be able to monitor and analyze a wide range of data points for the many media companies in their ad exchange.
We wanted to find an “out-of-the-box” metrics solution that would integrate easily with current systems. We found that Datadog included the integrations that we needed to get our monitoring solution up quickly. Additionally, we liked Datadog’s ability to perform customized metric monitoring, log critical events, and scale easily. I had scaled up open-source monitoring solutions once before, and it wasn’t fun. So when I had the opportunity to do it again, I said ‘No.’
On a daily basis, Datadog eliminates a lot of the back and forth with the media owners to find out what is going on, it is just a good visual tool for seeing the activity from each of the content management systems that we work with. It is an easy way to go in and get a feel of what is going on. Without Datadog, we would have to repeatedly spend time to reach out to each media owner directly to see if they are sending any requests that day.
As we moved toward a real-time system, it was important to understand whether our partner networks had reported all of their ads in a short period of time. We now use Datadog as a daily monitoring tool to get a feel for what networks are actively sending requests to their servers, what live campaigns are running smoothly and whether the right networks are requesting ads. I always have Datadog up on a daily basis to debug issues with integrations or just to make sure that live campaigns are running smoothly.
Each network that we work with has different requirements. In terms of connectivity of their networks, we are always dealing with different frequencies and rates of requests each day. The wide diversity in partners has made customization key since the monitoring needs for each partner varies drastically. There are days where we see weird trends of requests coming in. We are able to use Datadog’s custom graphs to catch the anomalies or concern areas in their system on a real-time basis.
Going forward, we are looking to create separate dashboards for each ad network that we work with. This will enable us to gain more detailed metrics for each individual media owner. We will be able to take these detailed metrics and use them to quickly identify potential problem areas, improving our ability to solve problems as soon as they are detected.
The solution is primarily used for better understanding the health of applications, modern environments, and many other solutions, which are the main focus of Datadog and many other monitoring tools.
With Datadog specifically, I can look at the health of the technology stack and services, and also integrate multiple metric sources, security, business data, and much more. This makes it a real software solution for centralizing data and unifying monitoring silos in one place. Datadog is like a hub - not just a monitoring software.
The solution primarily has helped the organization by helping us better understand the health of applications, modern environments, et cetera.
We can see the health of the technology stack and services. We can also integrate multiple metric sources, security, business data, and much more. It centralizes data and unifies monitoring in one place. It's Helping reduce costs with other solutions, and also reduces costs with teams that might waste time with manual troubleshooting.
Understanding better the health of applications, modern environments, and many other solutions, is the main focus of Datadog and many other monitoring tools.
With Datadog I can look at the health of the technology stack and services. I can also integrate multiple metric sources, security, business data, and much more. It's great for centralizing data and unifying monitoring silos.
Datadog is a hub, not just a monitoring software. The biggest value of Datadog is looking at the big picture, not only one part of it.
Datadog could have a better business analysis module.
Other vendors have specific business collections and analyses. With Datadog, I don't see much of it. It's possible to do this with custom metrics, dashboards, etc. However, none of those are business-focused - and that is what is lacking in Datadog.
I've used the solution for two years.
We use the SaaS version of the product.
We are providing managed services to our customers across multiple industries. Datadog is key to delivering these services by bringing the observability, monitoring, and alerting capabilities we need to operate at scale.
We operate custom cloud native workloads as well as ISV products such as Atlassian Jira or Confluence.
Integrating Synthetics, infrastructure, and application performance monitoring, as well as piping all logs through Datadog allows us to operate more with less with good alerting right in time.
We are providing managed services to our customers across multiple industries.
Datadog is key to delivering these services. It brings in observability, monitoring, and alerting capabilities - all of which we need to operate at scale.
We operate custom cloud native workloads as well as ISV products such as Atlassian Jira or Confluence.
Integrating Synthetics, infrastructure, and application performance monitoring, as well as piping all logs through Datadog, help with getting alerts in real-time.
We are providing managed services to our customers across multiple industries.
Datadog delivers observability, monitoring, and alerting capabilities we need to operate at scale.
Operating custom cloud native workloads as well as ISV products such as Atlassian Jira or Confluence is also something we do. Integrating Synthetics, infrastructure, and application performance monitoring, as well as piping all logs through Datadog allows us to operate while grabbing alerts in real-time.
The current way accounts are billed could be vastly improved - especially when involving multiple organizations across multiple accounts in combination with reserved commitments.
Being able to have an automatic materialized report on certain dashboards that could be exported as PDF to be shared with non-Datadog users could help a lot.
Other than that, we are more than happy with the features we use regularly.
We have been using Datadog since 2015.
We primarily use the solution for monitoring applications and informing customers via Pagerduty and Statuspage. The monitoring and alerts can be personalized internally, and we are able to find problems and issues. The response time monitor has been great, and it has been validating upgrades. We can check in to see which step fails,
Previously, we had monitors scattered with different places and products, making troubleshooting harder and slower. Also, logs and monitors were on different platforms, making it harder to put the infrastructure puzzle together.
Datadog documentation on web pages has improved a lot and is pretty easy to follow and find.
Additionally, integrations with, for example, GCP, Network, component, and Software providers are much easier as everything is now centralized.
API and notification integrations are also a great benefit for our organization.
Datadog is listening actively for customer feedback and develops improvements for us effectively.
The most valuable features of the solution include the APM, log monitor, SQL monitors, network monitors, and integrations.
Alerting timing should be improved to be more fine-tuned and exact. The current problem is that monitoring is integrated with the Statuspage and the SLA.
Also, browser support for browsers other than Chrome should be added. Browser test recording is another problem, as it does not always work in normal mode. One needs to use incognito mode or a pop-up.
I've been using the solution for three years.
