Optimizing large datasets in Google BigQuery for cost and performance.
BigQuery Analytics Expert
Google BigQuery, SQL Analytics, Dataform
Best for
- ▸Optimizing BigQuery table partitioning and clustering for 10TB+ datasets to reduce query costs by 60-90%
- ▸Implementing BigQuery ML models with CREATE MODEL for customer segmentation and demand forecasting
- ▸Building Dataform data pipelines with dependency management and automated testing
- ▸Designing materialized views for real-time dashboards while managing BigQuery slot consumption
What you'll get
- ▸Detailed partitioning strategy with DATE partitioning on transaction_date and clustering on [customer_id, product_category] with cost impact analysis showing 75% scan reduction
- ▸Complete BigQuery ML implementation with CREATE MODEL statement for ARIMA_PLUS time series forecasting including feature engineering and evaluation metrics
- ▸Comprehensive Dataform pipeline architecture with SQLX transformations, assertion tests, and incremental table configurations for multi-TB daily processing
Data architecture details including table sizes, query patterns, current partitioning strategy, slot usage, and specific BigQuery performance or cost optimization challenges.
Detailed BigQuery optimization recommendations with specific SQL examples, partitioning strategies, cost projections, and implementation steps for scalable analytics architecture.
What's inside
“You are a BigQuery Analytics Expert. You design and optimize petabyte-scale data warehouses using columnar storage, distributed query execution, and advanced BigQuery features for cost-efficient, secure analytics. - Diagnose root causes of high costs and poor performance by analyzing storage archite...”
Covers
Not designed for ↓
- ×Non-Google Cloud data warehouse platforms like Snowflake or Redshift architecture
- ×Traditional on-premises ETL tools or database administration outside BigQuery
- ×Basic SQL query writing without BigQuery-specific optimization considerations
- ×Cloud infrastructure provisioning beyond BigQuery and related GCP services
SupaScore
88.65▼
Evidence Policy
Standard: no explicit evidence policy.
Research Foundation: 8 sources (4 official docs, 2 paper, 2 web)
This skill was developed through independent research and synthesis. SupaSkills is not affiliated with or endorsed by any cited author or organisation.
Version History
v5.5 final distill
Pipeline v4: rebuilt with 3 helper skills
Initial release
Prerequisites
Use these skills first for best results.
Works well with
Need more depth?
Specialist skills that go deeper in areas this skill touches.
Common Workflows
Enterprise Data Warehouse Migration
Complete migration from legacy data warehouse to BigQuery with optimized architecture, ETL pipelines, and governance framework
ML-Powered Analytics Platform
Build end-to-end analytics platform with BigQuery ML models, feature engineering, and business intelligence dashboards
© 2026 Kill The Dragon GmbH. This skill and its system prompt are protected by copyright. Unauthorised redistribution is prohibited. Terms of Service · Legal Notice