← Back to Skills
DevOps & InfrastructureEngineeringPlatinum

Designing observability systems for complex microservices.

Observability Pipeline Designer

OpenTelemetry, Prometheus, Grafana, ELK

expertv5.0

Best for

  • Design telemetry collection architecture for 100+ microservices deployment
  • Implement OpenTelemetry instrumentation strategy with cost-optimized sampling
  • Build SLO-driven alerting pipelines with Prometheus and Grafana dashboards
  • Design multi-destination data routing for logs, metrics, and traces

What you'll get

  • Multi-tier OpenTelemetry Collector deployment diagram with processor chains, sampling strategies, and exporter routing configurations
  • Comprehensive instrumentation guide with semantic conventions, custom spans, and context propagation patterns for specific tech stacks
  • SLO definition matrix with error budgets, alerting thresholds, and Grafana dashboard specifications linked to business outcomes
Expects

System architecture details, service topology, current observability gaps, compliance requirements, team size, and cost constraints for comprehensive telemetry pipeline design.

Returns

Detailed observability architecture with OpenTelemetry instrumentation strategy, collector topology, data routing configuration, SLO definitions, and cost optimization recommendations.

What's inside

You are an Observability Pipeline Architect. You design production-grade monitoring, logging, and tracing systems for cloud-native applications that balance actionability, cost, and operational excellence. - **Engineer for actionability, not comprehensiveness.** Every metric, log, and trace must ser...

Covers

What You Do DifferentlyMethodology
Not designed for ↓
  • ×Basic server monitoring setup or single-application logging
  • ×Application performance optimization or debugging specific code issues
  • ×Infrastructure provisioning or cloud resource management
  • ×Business metrics reporting or user analytics dashboards

SupaScore

89.4
Research Quality (15%)
9.1
Prompt Engineering (25%)
8.95
Practical Utility (15%)
8.8
Completeness (10%)
8.9
User Satisfaction (20%)
9
Decision Usefulness (15%)
8.85

Evidence Policy

Standard: no explicit evidence policy.

observabilitymonitoringopentelemetryprometheusgrafanaloggingdistributed-tracingsloalertingmetricselk-stacksite-reliability

Research Foundation: 8 sources (4 official docs, 3 books, 1 industry frameworks)

This skill was developed through independent research and synthesis. SupaSkills is not affiliated with or endorsed by any cited author or organisation.

Version History

v5.03/25/2026

v5.5 distilled from v2 via Claude Sonnet

v2.02/25/2026

Pipeline v4: rebuilt with 3 helper skills

v1.0.02/16/2026

Initial release

Prerequisites

Use these skills first for best results.

Works well with

Need more depth?

Specialist skills that go deeper in areas this skill touches.

Common Workflows

Complete Observability Platform Implementation

End-to-end observability platform deployment from infrastructure provisioning through instrumentation to incident response procedures

© 2026 Kill The Dragon GmbH. This skill and its system prompt are protected by copyright. Unauthorised redistribution is prohibited. Terms of Service · Legal Notice