Browse Skills
1306+ expert skills across 6 domains. Scored on 6 quality dimensions.
On-Call Runbook Expert
Design, author, and maintain operational runbooks that enable on-call engineers to diagnose and resolve incidents faster with structured response procedures, escalation frameworks, and toil reduction strategies.
Zero Downtime Deployment Engineer
Expert guide for planning and executing zero-downtime deployments using blue-green, rolling update, and canary release strategies with database migration support.
GitLab CI Pipeline Designer
Design, optimize, and scale GitLab CI/CD pipelines for any project size. Produces production-ready .gitlab-ci.yml configurations with caching, parallelization, security scanning, and compliance patterns.
API Failure Injection Specialist
Design and implement controlled failure injection tests for API dependencies to validate system resilience, graceful degradation, and recovery mechanisms before production incidents occur.
Database Backup Strategy Architect
Designs comprehensive database backup and disaster recovery strategies including RPO/RTO targets, backup rotation schemes, cross-region replication, point-in-time recovery, and automated restore testing.
Distributed Tracing Engineer
Design and implement distributed tracing systems using OpenTelemetry, with sampling strategies, trace pipeline architecture, and production debugging workflows for microservices.
AWS Serverless Architect
Design production-grade serverless applications on AWS using Lambda, API Gateway, DynamoDB, Step Functions, and EventBridge with cost-optimized, secure architectures.
GCP Solutions Specialist
Strategic guidance for designing and optimizing Google Cloud Platform architectures and data-intensive workloads.
Monorepo Pipeline Strategy
Design and optimize CI/CD pipelines for monorepo architectures. Provides task orchestration, caching strategies, affected-command patterns, and release coordination for multi-package repositories using Nx, Turborepo, or Bazel.
AWS Step Functions Architect
Design production-grade AWS Step Functions workflows with proper state machine patterns, error handling, saga orchestration, and cost-optimized execution strategies.
Database Performance Tuner
Diagnose and optimize database performance across PostgreSQL, MySQL, MongoDB, and cloud-managed databases.
Canary Release Governance
Design and implement canary release strategies with automated analysis, traffic shifting policies, rollback triggers, and compliance-ready governance for progressive delivery pipelines.
Docker & Container Architect
Designs production-ready containerization strategies with Docker and orchestration. Covers Dockerfile best practices, multi-stage builds, security hardening, Compose patterns, and container orchestration for reliable, secure, and efficient deployments.
n8n Error Handling Expert
Design resilient n8n workflows with intelligent retry strategies, error routing, dead letter queues, and alerting — transforming fragile automations into production-ready systems.
Kubernetes Security Baselines
Designs and enforces security baselines for Kubernetes clusters using Pod Security Standards, CIS Benchmarks, admission controllers, and network policies to harden workloads against common attack vectors.
Infrastructure Cost Attribution
Implements cloud cost tagging, FinOps dashboards, and per-team/per-product cost allocation using AWS Cost Explorer, GCP BigQuery billing exports, and Kubernetes resource quotas for accountability.
Kubernetes Cost Optimizer
Analyzes Kubernetes cluster resource utilization and recommends cost optimization strategies including right-sizing, autoscaler tuning, spot instance adoption, and commitment discount planning to reduce cloud spend by 30-60%.
Observability Pipeline Designer
Designs comprehensive monitoring, logging, and tracing pipelines using OpenTelemetry, Prometheus, Grafana, and ELK with SLO-driven alerting and cost-optimized data routing.
AWS Cost Optimizer
Analyze and optimize AWS cloud spending through commitment-based discounts, right-sizing, spot strategies, and automated cost governance using FinOps best practices.
DevOps & Cloud
Comprehensive DevOps guidance covering CI/CD pipelines, Docker, Kubernetes, cloud platforms (AWS/GCP/Azure), infrastructure as code, monitoring, incident response, and production reliability engineering.
Terraform State Management Expert
Design and operate Terraform state management strategies including remote backends, state locking, workspace topology, state splitting, migration, and disaster recovery for team-scale infrastructure-as-code workflows.
Release Engineering Specialist
Design robust release engineering processes covering build systems, artifact management, versioning, and deployment orchestration.
GCP Cloud Run Developer
Expert guidance for deploying and managing containerized applications on Google Cloud Run — from container optimization and autoscaling to IAM security, VPC networking, and cost management.
Plant Operations VSM Expert
Analyzes and optimizes manufacturing flow efficiency through systematic value stream mapping, identifying waste and designing future-state production flows using lean principles, takt time analysis, and digital VSM integration.