← Back to Skills
Data & AnalyticsTechnologyPlatinum

Analyze data and create visual insights using Python.

Python Data Analyst

Pandas, NumPy, Matplotlib, EDA

intermediatev5.0

Best for

  • Exploratory data analysis of customer transaction datasets to identify spending patterns
  • Statistical profiling of messy CSV files to assess data quality before model training
  • Creating business dashboards with Pandas aggregations and Matplotlib visualizations
  • Data cleaning pipelines for survey responses with missing values and inconsistent formats

What you'll get

  • Jupyter notebook with step-by-step EDA including data profiling, outlier detection, correlation heatmaps, and business insights
  • Python script with method-chained Pandas operations for data cleaning, feature engineering, and grouped aggregations
  • Statistical summary report with distribution plots, missing value analysis, and recommendations for next steps
Expects

Raw datasets in common formats (CSV, Excel, JSON) with business context about what insights are needed.

Returns

Clean, documented Python code with data quality assessments, statistical summaries, and publication-ready visualizations.

What's inside

You are a Python Data Analyst. You transform raw data into reliable analytical assets using Pandas, NumPy, statistical rigor, and compelling visualizations that drive business decisions. - Combine statistical rigor with business pragmatism: report effect sizes alongside p-values, check assumptions b...

Covers

What You Do DifferentlyMethodologyWatch For
Not designed for ↓
  • ×Deep learning model development or neural network architecture
  • ×Real-time streaming data processing or production ETL pipelines
  • ×Advanced statistical modeling like causal inference or Bayesian analysis
  • ×Big data processing beyond single-machine memory limits

SupaScore

87.75
Research Quality (15%)
8.85
Prompt Engineering (25%)
8.75
Practical Utility (15%)
8.9
Completeness (10%)
9.4
User Satisfaction (20%)
8.7
Decision Usefulness (15%)
8.3

Evidence Policy

Standard: no explicit evidence policy.

pythonpandasnumpyedadata-analysisvisualizationstatistics

Research Foundation: 7 sources (4 official docs, 1 paper, 2 books)

This skill was developed through independent research and synthesis. SupaSkills is not affiliated with or endorsed by any cited author or organisation.

Version History

v5.03/25/2026

v5.5 final distill

v2.02/26/2026

Pipeline v4: rebuilt with 3 helper skills

v1.0.02/15/2026

Initial version

Works well with

Need more depth?

Specialist skills that go deeper in areas this skill touches.

Common Workflows

Data Discovery to Dashboard

Comprehensive workflow from raw data exploration through quality assessment to final business dashboard creation

© 2026 Kill The Dragon GmbH. This skill and its system prompt are protected by copyright. Unauthorised redistribution is prohibited. Terms of Service · Legal Notice