← Back to Skills
DevOps & InfrastructureEngineeringPlatinum

Need expert help with Datadog monitoring setup and optimization.

Datadog Monitoring Expert

Datadog APM, Infrastructure, Logs

expertv5.0

Best for

  • Configuring Datadog Agent on Kubernetes clusters with custom metric collection and APM tracing
  • Setting up SLO-based alerting for microservices with error budget burn rate notifications
  • Implementing cost-effective custom metrics governance using Metrics without Limits
  • Designing executive dashboards showing business KPIs correlated with infrastructure performance

What you'll get

  • Step-by-step Kubernetes DaemonSet configuration with YAML manifests for Agent deployment including APM, logs, and custom metrics collection
  • SLO definition templates with error budget policies, burn rate alerting thresholds, and escalation playbooks following SRE best practices
  • Cost optimization audit report identifying high-cardinality metrics, unused monitors, and Metrics without Limits implementation plan with projected savings
Expects

Current infrastructure details (cloud provider, container orchestration, service count), existing Datadog products in use, specific monitoring pain points, and current tagging strategy implementation.

Returns

Detailed configuration guides, monitoring strategy recommendations, dashboard templates, alerting playbooks, and cost optimization tactics with specific Datadog feature implementations.

What's inside

You are a Datadog Platform Architect. You design, deploy, and optimize observability infrastructure across production environments spanning hundreds to thousands of hosts, containers, and distributed microservices. - Combine operational knowledge with strategic architecture: help teams transition fr...

Covers

What You Do DifferentlyMethodologyWatch For
Not designed for ↓
  • ×Setting up competing monitoring tools like Prometheus, Grafana, or New Relic
  • ×Deep application code debugging or performance optimization (beyond observability)
  • ×General cloud architecture design unrelated to monitoring
  • ×Writing custom Datadog integrations or developing against Datadog APIs

SupaScore

88.2
Research Quality (15%)
9.1
Prompt Engineering (25%)
8.95
Practical Utility (15%)
8.65
Completeness (10%)
8.85
User Satisfaction (20%)
8.8
Decision Usefulness (15%)
8.5

Evidence Policy

Standard: no explicit evidence policy.

datadogapmmonitoringobservabilitydistributed-tracingslo-slialertingdashboardslog-managementcustom-metricscost-optimizationinfrastructure-monitoring

Research Foundation: 10 sources (7 official docs, 1 industry frameworks, 1 books, 1 web)

This skill was developed through independent research and synthesis. SupaSkills is not affiliated with or endorsed by any cited author or organisation.

Version History

v5.03/25/2026

v5.5 distilled from v2 via Claude Sonnet

v2.02/21/2026

Pipeline v4: rebuilt with 3 helper skills

v1.0.02/15/2026

Initial release

Prerequisites

Use these skills first for best results.

Works well with

Need more depth?

Specialist skills that go deeper in areas this skill touches.

Common Workflows

Production Observability Implementation

Deploy infrastructure, implement comprehensive monitoring with SLOs, then create incident response procedures based on monitoring data

infrastructure-as-code-architectdatadog-monitoring-expertincident-response-playbook-builder

© 2026 Kill The Dragon GmbH. This skill and its system prompt are protected by copyright. Unauthorised redistribution is prohibited. Terms of Service · Legal Notice