Browse Skills
1078+ expert skills across 5 domains. Scored on 6 quality dimensions.
Terraform Infrastructure Architect
Designs production-grade Terraform configurations with expert module architecture, state management strategies (remote backends, workspaces), provider patterns, drift detection, CI/CD integration, multi-environment promotion, and HCP Terraform workflows.
Log Management Architect
Designs centralized logging architectures covering structured logging schemas, log pipeline design, storage tiering, cost optimization, compliance-ready audit logging, and log-based alerting for distributed systems at any scale.
Monitoring & Observability Designer
Designs comprehensive observability strategies covering metrics, logs, and traces with alerting frameworks, SLI/SLO/SLA definitions, and dashboard patterns using Prometheus, Grafana, and OpenTelemetry.
Cloud Cost Optimizer
Expert cloud cost optimization advisor that helps organizations reduce and manage cloud spending using FinOps principles, right-sizing strategies, reserved instance planning, and cost allocation frameworks across AWS, Azure, and GCP environments.
Container Security Hardener
Harden container images and runtime environments with image scanning, admission controllers, seccomp/AppArmor profiles, distroless images, and supply chain security practices.
Database Migration Strategist
Design zero-downtime database migration strategies with backward-compatible schema evolution, blue-green database patterns, safe rollback procedures, and data migration orchestration.
Kubernetes Operations Advisor
Designs and troubleshoots Kubernetes architectures including Deployments, Services, Ingress, Helm charts, resource management, and production-grade cluster operations with security best practices.
Zero Downtime Deployment Engineer
Expert guide for planning and executing zero-downtime deployments using blue-green, rolling update, and canary release strategies with database migration support.
Blue-Green Deployment Orchestrator
Implements zero-downtime blue-green deployment patterns with automated environment switching, health checks, rollback triggers, and traffic routing for web services and databases.
Distributed Tracing Engineer
Design and implement distributed tracing systems using OpenTelemetry, with sampling strategies, trace pipeline architecture, and production debugging workflows for microservices.
Observability Pipeline Designer
Designs comprehensive monitoring, logging, and tracing pipelines using OpenTelemetry, Prometheus, Grafana, and ELK with SLO-driven alerting and cost-optimized data routing.
Azure Cloud Architect
Expert guidance on designing and implementing scalable, secure Azure cloud solutions and architectures.
Incident Response Playbook Builder
Creates structured incident response playbooks with severity classifications, escalation procedures, communication templates, runbooks, postmortem frameworks, and blameless culture practices.
Helm Charts Architect
Designs production-grade Kubernetes Helm v3 charts with expert-level Go templating, values.yaml architecture, dependency management, chart testing strategies, security scanning, and GitOps integration patterns.
Network Architecture Designer
Designs cloud and hybrid network architectures including VPC topologies, subnetting strategies, DNS management, L4/L7 load balancing, CDN integration, network security groups, and hybrid connectivity patterns.
Datadog Monitoring Expert
Comprehensive Datadog monitoring expertise covering APM, infrastructure monitoring, log management, custom metrics, dashboard design, alerting strategies, SLO/SLI definition, distributed tracing, Agent configuration, cost optimization, and integration patterns for production environments.
Automotive Supplier SPC Specialist
Implement and manage Statistical Process Control systems for automotive supply chains. Guide suppliers through MSA validation, control chart deployment, capability studies (Cpk/Ppk), and PPAP-ready SPC documentation that meets OEM requirements.
SRE Incident Response Expert
Designs and executes structured incident response processes for production outages, combining SRE discipline with Incident Command System principles to minimize downtime and maximize organizational learning.
Kafka Streaming Architect
Expert guidance for designing, deploying, and operating Apache Kafka streaming platforms, covering topic architecture, producer/consumer patterns, stream processing, schema governance, cluster sizing, and disaster recovery.
Webhook Reliability Architect
Design production-grade webhook delivery systems with guaranteed delivery, intelligent retry policies, idempotency patterns, dead-letter queue strategies, and comprehensive observability — ensuring no event is ever silently lost.
Infrastructure as Code Architect
Designs Infrastructure as Code architectures using Terraform, Pulumi, or CDK with module patterns, state management strategies, drift detection, GitOps workflows, and multi-environment promotion.
Serverless Architecture Advisor
Designs event-driven serverless architectures optimizing for cost, performance, and operational simplicity. Covers Lambda/Cloud Functions patterns, cold start mitigation, Step Functions orchestration, and serverless database selection.
Cloudflare Workers Developer
Build, optimize, and deploy production-grade Cloudflare Workers applications using V8 isolates, Wrangler CLI, KV storage, Durable Objects, R2, D1, Workers AI, cron triggers, WebSocket handling, and edge-side request routing with expert guidance on performance tuning and cost management.
Plant Operations VSM Expert
Analyzes and optimizes manufacturing flow efficiency through systematic value stream mapping, identifying waste and designing future-state production flows using lean principles, takt time analysis, and digital VSM integration.