← Back to Skills
Data & AnalyticsTechnologyPlatinum

Ensuring data quality in pipelines.

Data Quality Engineer

Great Expectations, Soda Core, DMBOK2

advancedv5.0

Best for

  • Implementing Great Expectations validation suites for production data pipelines
  • Setting up Soda Core SQL-based data quality checks with custom metrics
  • Designing data contracts with schema enforcement and SLA monitoring
  • Building data observability dashboards with anomaly detection alerts

What you'll get

  • Step-by-step Great Expectations expectation suite configuration with YAML files and Python code for specific data quality dimensions
  • Soda Core check definitions with SQL-based validation rules, thresholds, and alert configurations for production deployment
  • Data contract specification templates with schema definitions, quality SLAs, and enforcement patterns using JSON Schema or Protobuf
Expects

Details about your data sources, existing quality issues, downstream dependencies, and business impact of data failures.

Returns

Specific implementation guidance for validation rules, tool configurations, monitoring setups, and remediation workflows with code examples.

What's inside

You are a Data Quality Engineer. You systematize quality controls across data pipelines, preventing 80%+ of incidents before they reach production. * Build defense-in-depth validation (schema → statistical → business rules → freshness) instead of point checks; place quality gates at ingestion, trans...

Covers

What You Do DifferentlyMethodologyWatch For
Not designed for ↓
  • ×Database performance tuning or query optimization
  • ×ETL pipeline development without quality considerations
  • ×Data governance policy creation at the organizational level
  • ×Machine learning model validation or feature quality assessment

SupaScore

88.43
Research Quality (15%)
8.85
Prompt Engineering (25%)
9.2
Practical Utility (15%)
8.55
Completeness (10%)
8.85
User Satisfaction (20%)
8.75
Decision Usefulness (15%)
8.65

Evidence Policy

Standard: no explicit evidence policy.

data-qualitydata-validationgreat-expectationssodadata-contractsdata-observabilityschema-enforcementanomaly-detectiondata-governancedata-engineeringetldbt

Research Foundation: 8 sources (3 official docs, 4 books, 1 industry frameworks)

This skill was developed through independent research and synthesis. SupaSkills is not affiliated with or endorsed by any cited author or organisation.

Version History

v5.03/25/2026

v5.5 final distill

v2.02/21/2026

Pipeline v4: rebuilt with 3 helper skills

v1.0.12/15/2026

Auto-versioned: masterfile quality gate passed (score: 85.0)

v1.0.02/14/2026

Initial release

Prerequisites

Use these skills first for best results.

Works well with

Need more depth?

Specialist skills that go deeper in areas this skill touches.

Common Workflows

Modern Data Stack Quality Implementation

End-to-end implementation of data quality controls across ingestion, transformation, and serving layers with comprehensive monitoring

© 2026 Kill The Dragon GmbH. This skill and its system prompt are protected by copyright. Unauthorised redistribution is prohibited. Terms of Service · Legal Notice