DevOps & InfrastructureEngineeringPlatinum

Need expert help with Datadog monitoring setup and optimization.

Datadog Monitoring Expert

Datadog APM, Infrastructure, Logs

intermediatev6.0

Best for

▸Configuring Datadog Agent on Kubernetes clusters with custom metric collection and APM tracing
▸Setting up SLO-based alerting for microservices with error budget burn rate notifications
▸Implementing cost-effective custom metrics governance using Metrics without Limits
▸Designing executive dashboards showing business KPIs correlated with infrastructure performance

What you'll get

▸Step-by-step Kubernetes DaemonSet configuration with YAML manifests for Agent deployment including APM, logs, and custom metrics collection
▸SLO definition templates with error budget policies, burn rate alerting thresholds, and escalation playbooks following SRE best practices
▸Cost optimization audit report identifying high-cardinality metrics, unused monitors, and Metrics without Limits implementation plan with projected savings

Expects

Current infrastructure details (cloud provider, container orchestration, service count), existing Datadog products in use, specific monitoring pain points, and current tagging strategy implementation.

Returns

Detailed configuration guides, monitoring strategy recommendations, dashboard templates, alerting playbooks, and cost optimization tactics with specific Datadog feature implementations.

What's inside

“You are a Datadog Monitoring Expert. You design, deploy, and optimize Datadog observability platforms across production environments spanning hosts, containers, serverless functions, and distributed microservices. - **Unified Service Tagging as a hard prerequisite.** Before any configuration, you ve...”

Covers

What You Do DifferentlyMethodologyWatch For

Not designed for ↓

×Setting up competing monitoring tools like Prometheus, Grafana, or New Relic
×Deep application code debugging or performance optimization (beyond observability)
×General cloud architecture design unrelated to monitoring
×Writing custom Datadog integrations or developing against Datadog APIs

SupaScore

88.2▼

Research Quality (15%)

9.1

Prompt Engineering (25%)

8.95

Practical Utility (15%)

8.65

Completeness (10%)

8.85

User Satisfaction (20%)

8.8

Decision Usefulness (15%)

8.5

Evidence Policy

Standard: no explicit evidence policy.

datadogapmmonitoringobservabilitydistributed-tracingslo-slialertingdashboardslog-managementcustom-metricscost-optimizationinfrastructure-monitoring

Research Foundation: 10 sources (7 official docs, 1 industry frameworks, 1 books, 1 web)

This skill was developed through independent research and synthesis. SupaSkills is not affiliated with or endorsed by any cited author or organisation.

Version History

v6.06/12/2026

v6.0 wave-1 repair: re-distilled from masterfile/v2 (truncation incident 2026-06, delta-first rules)

v5.03/25/2026

v5.5 distilled from v2 via Claude Sonnet

v2.02/21/2026

Pipeline v4: rebuilt with 3 helper skills

v1.0.02/15/2026

Initial release

Prerequisites

Use these skills first for best results.

Container Orchestration ExpertPlatinum

Works well with

AWS Solutions ArchitectPlatinum Distributed Tracing EngineerPlatinum Kubernetes Operations AdvisorPlatinum Observability Pipeline DesignerPlatinum Site Reliability EngineerPlatinum

Need more depth?

Specialist skills that go deeper in areas this skill touches.

OpenTelemetry Instrumentation EngineerPlatinum Log Management ArchitectPlatinum Kubernetes Security HardeningPlatinum

Common Workflows

Production Observability Implementation

Deploy infrastructure, implement comprehensive monitoring with SLOs, then create incident response procedures based on monitoring data

infrastructure-as-code-architect→datadog-monitoring-expert→incident-response-playbook-builder