The Claude Code skill ecosystem is growing fast. Over 71,000 skills listed on SkillsMP alone. Vercel, Microsoft, HashiCorp, Cloudflare, Stripe, Sentry, Hugging Face, and dozens of smaller contributors are all shipping skills. The supply is real. The question is whether anyone is checking the quality.
We scored 167 community skills from 40+ organisations using the same six-dimension SupaScore rubric we apply to our own 1,278 published skills. Same weights, same criteria, same threshold.
Not a single community skill reached our 80.0 publishing threshold.
The sample
167 skills from a broad cross-section of the ecosystem. Corporate contributors like Vercel, Microsoft, HashiCorp, Cloudflare, Stripe, Sentry, and Hugging Face. Platform-focused builders like Expo and Supabase. Prolific independent contributors like softaworks (42 skills), TerminalSkills (24), obra, ComposioHQ, and machina-sports. Curated collections from awesome-skills and similar aggregators.
We excluded trivially broken skills (empty files, syntax errors) and focused on skills that were clearly intended for real use. If someone published it, we scored it.
Tier distribution
| Tier | Count | Share | Score range |
|---|---|---|---|
| Platinum (85+) | 0 | 0% | n/a |
| Gold (70-84) | 14 | 8.4% | 70.0-79.5 |
| Silver (60-69) | 37 | 22.2% | 60.0-69.9 |
| Bronze (under 60) | 116 | 69.5% | 25.0-59.9 |
Community average: 54.6. SupaSkills average: 86.7. A 32-point gap.
For context: our quality gate requires a minimum score of 80.0, at least 6 research sources across 2+ source types, and a passed masterfile from our 8-phase pipeline. 98% of our 1,278 skills are Platinum tier. Zero community skills reached Platinum. 14 reached Gold.
The best of the community
The highest score in the entire sample: Move Code Quality by 1NickPappas, at 79.5. A focused skill for Move blockchain development with real domain expertise. It falls just short of our publishing threshold, but it is genuinely well-built.
The pattern is consistent. The best community skills come from domain experts who know their subject deeply:
- Move Code Quality (79.5, Gold). Blockchain-specific, technically precise, clear scope.
- Robotics skills from specialised contributors. Narrow focus, real engineering knowledge.
- Email marketing bible. Practical, opinionated, grounded in actual campaign experience.
What these have in common: a person who knows the domain wrote them by hand, with real constraints and real opinions. They are not generated from documentation.
Where the scores break down
Research Quality is universally weak
The weakest dimension across all 167 skills. Most cite zero sources. No academic references, no industry frameworks, no books, no standards documents. The skill is the author's knowledge, unverified and unattributed.
This is the single largest factor in the score gap. Our pipeline requires minimum 6 sources across 2+ types (books, academic papers, industry frameworks, standards, etc.). That requirement alone accounts for roughly a third of the difference.
To be fair: some authors may have done extensive research without citing it in the skill file. We cannot know. So we re-ran the entire audit without the Research Quality dimension, redistributing its 15% weight proportionally across the remaining five dimensions. The result: the community average rises from 55.5 to 58.1. The gap narrows from 31 points to 28. Two skills cross our 80.0 threshold (agent-browser by Vercel at 81.1 and Obsidian Agent Memory at 80.5). Zero reach Platinum. The structural gap persists across all dimensions, not just research.
Many skills are documentation snippets
A common pattern: take an API reference or framework documentation, paste it into a skill file, add a brief instruction header. Claude already has access to this information through its training data. The skill adds little that a well-written prompt would not.
softaworks (42 skills) and TerminalSkills (24 skills) are the most prolific contributors in our sample. Both score consistently in the Bronze range. Volume without a quality gate produces volume.
Corporate skills are better structured, still shallow
Skills from Vercel, Microsoft, and similar organisations tend to have cleaner prompt engineering, better formatting, and more thoughtful instruction design. They score higher on Prompt Engineering and Completeness.
But they still lack research depth. A Vercel skill for Next.js deployment is well-written, but it does not cite the Next.js documentation, performance benchmarks, or deployment best practices from external sources. It is the author's knowledge in prompt form, same as the independents.
Practical Utility is the strongest dimension
This is encouraging. Many community skills solve real workflow problems. Deployment automation, code review checklists, framework-specific patterns. The intent is practical. The skills want to be useful.
The issue is not ambition. It is infrastructure.
The structural gap
The 32-point gap between community average (54.6) and our average (86.7) is not about individual talent. Some community skill authors clearly know their domains well. The gap is structural.
| What's missing | Effect on score |
|---|---|
| No research sources | Research Quality collapses (avg ~2/10) |
| No quality gate | Bronze skills ship alongside Gold |
| No pipeline | Inconsistent structure, missing governance |
| No versioning | No way to improve iteratively |
| No IP audit | Unknown copyright/trademark exposure |
| No peer review | Single-author blind spots persist |
Our 8-phase pipeline exists because individual expertise is not enough. Expert Council, Deep Research, Quality Gate, IP Audit, Cross-Validation. Each phase catches problems the others miss. The result is not that our authors are smarter. The result is that the pipeline catches more.
The security dimension
This is not just about quality scores. A recent arXiv study found that 26.1% of community skills contain security vulnerabilities. Prompt injection vectors, unsafe tool access patterns, missing input validation.
We run automated IP audits on every skill (1,278/1,278 audited, zero high-risk findings). We run a daily security scan with 25 checks. We have a canary skill that tests our delivery guard every 24 hours.
Most community skills have none of this. The ecosystem is growing faster than its safety infrastructure.
What we learned
Quality infrastructure makes a measurable difference. Not a marginal one. A 32-point, two-tier-category difference. The best community skill (79.5) would not pass our quality gate (80.0). That is not a coincidence. The gate exists precisely at the boundary where research depth, governance, and structured evaluation start to matter.
The best skills come from domain experts, not aggregators. Move blockchain, robotics, email marketing. Narrow focus, real knowledge, clear opinions. Aggregators who publish 20+ skills at once produce Bronze consistently.
The ecosystem needs quality infrastructure. 71,000+ skills on SkillsMP. 26.1% with security vulnerabilities. No scoring, no quality gates, no IP audits. The supply side is solved. The quality side is wide open.
The numbers
| Metric | Community (167) | SupaSkills (1,278) |
|---|---|---|
| Average score | 54.6 | 86.7 |
| Platinum tier | 0 (0%) | 1,252 (98%) |
| Gold tier | 14 (8.4%) | 26 (2%) |
| Bronze tier | 116 (69.5%) | 0 (0%) |
| Highest score | 79.5 | 92.05 |
| Min sources required | 0 | 6 |
| IP audit coverage | 0% | 100% |
| Quality gate | None | 80.0 minimum |
| Security scanning | None | 25 daily checks |
We built SupaSkills because we believe skills are the most important layer on top of Claude's already strong foundation. Claude is excellent at reasoning, coding, and analysis. Skills add domain structure, research depth, and specialised frameworks that make that foundation even more effective.
But skills only add value if they are good. The ecosystem audit suggests most are not there yet. The good news: the path from Bronze to Platinum is well understood. It is research, structure, and a quality gate that does not let anything through until it is ready.
We built that infrastructure. It is open for anyone who wants to use it.
Methodology: 167 skills scored using SupaScore (6 dimensions, weighted formula). Sources: GitHub repositories and skill directories from 40+ organisations. Scoring performed March 2026. All scores use the same rubric applied to our own skills. The full dataset is available on request.
Full Results: All 167 Skills Scored
Transparency matters. Here is every skill we tested, with author and score.
| # | Skill | Author | Score | Tier |
|---|---|---|---|---|
| 1 | Move Code Quality | 1NickPappas | 79.5 | Gold |
| 2 | Robot Perception | arpitg1304 | 79.5 | Gold |
| 3 | ROS2 Development | arpitg1304 | 78.5 | Gold |
| 4 | humanizer | blader | 77.0 | Gold |
| 5 | agent-browser | vercel-labs | 76.5 | Gold |
| 6 | Obsidian Agent Memory | AdamTylerLynch | 76.0 | Gold |
| 7 | Robotics Security | arpitg1304 | 75.5 | Gold |
| 8 | email-marketing-bible | CosmoBlk | 74.5 | Gold |
| 9 | mermaid-syntax-skill | awesome-skills | 74.0 | Gold |
| 10 | Robotics Design Patterns | arpitg1304 | 73.0 | Gold |
| 11 | Family History Research | emaynard | 71.5 | Gold |
| 12 | skill-judge | softaworks | 71.5 | Gold |
| 13 | Terraform IaC | TerminalSkills | 71.0 | Gold |
| 14 | Security Audit | TerminalSkills | 70.0 | Gold |
| 15 | manim-skill | awesome-skills | 69.0 | Silver |
| 16 | iOS Simulator | conorluddy | 68.5 | Silver |
| 17 | database-schema-designer | softaworks | 68.0 | Silver |
| 18 | 5-whys-skill | awesome-skills | 67.0 | Silver |
| 19 | Sports Betting | machina-sports | 67.0 | Silver |
| 20 | Defold Scripts Editing | indiesoftby | 66.5 | Silver |
| 21 | Kaizen | NeoLabHQ | 66.5 | Silver |
| 22 | c4-architecture | softaworks | 66.5 | Silver |
| 23 | qa-test-planner | softaworks | 66.5 | Silver |
| 24 | Stream Coding Methodology | frmoretto | 66.0 | Silver |
| 25 | Creative Director | smixs | 65.5 | Silver |
| 26 | ipsw iOS/macOS Security Research | blacktop | 65.0 | Silver |
| 27 | writing-skills | obra | 65.0 | Silver |
| 28 | RealityKit visionOS Developer | tomkrikorian | 65.0 | Silver |
| 29 | tailored-resume-generator | ComposioHQ | 64.5 | Silver |
| 30 | Polymarket | machina-sports | 64.0 | Silver |
| 31 | receiving-code-review | obra | 64.0 | Silver |
| 32 | subagent-driven-development | obra | 64.0 | Silver |
| 33 | first-principles-skill | awesome-skills | 63.5 | Silver |
| 34 | Stripe Billing | TerminalSkills | 63.5 | Silver |
| 35 | react-dev | softaworks | 63.0 | Silver |
| 36 | Playwright Browser Automation | lackeyjb | 62.5 | Silver |
| 37 | FastF1 Formula 1 Data | machina-sports | 62.5 | Silver |
| 38 | NFL Data | machina-sports | 62.5 | Silver |
| 39 | requirements-clarity | softaworks | 62.5 | Silver |
| 40 | supabase-best-practices | supabase | 62.5 | Silver |
| 41 | ARKit visionOS Developer | tomkrikorian | 62.5 | Silver |
| 42 | react-best-practices | vercel-labs | 62.5 | Silver |
| 43 | Unblock Action (Tapestry) | michalparkola | 62.0 | Silver |
| 44 | daily-meeting-update | softaworks | 62.0 | Silver |
| 45 | twitter-algorithm-optimizer | ComposioHQ | 61.5 | Silver |
| 46 | verification-before-completion | obra | 61.5 | Silver |
| 47 | beautiful_prose | SHADOWPR0 | 61.5 | Silver |
| 48 | FFmpeg | TerminalSkills | 61.5 | Silver |
| 49 | Prediction Markets | TerminalSkills | 61.5 | Silver |
| 50 | Prompt Engineering (NeoLab) | NeoLabHQ | 61.0 | Silver |
| 51 | lesson-learned | softaworks | 61.0 | Silver |
| 52 | brainstorming | obra | 60.5 | Silver |
| 53 | feedback-mastery | softaworks | 60.5 | Silver |
| 54 | mobile-app-design | awesome-skills | 60.0 | Silver |
| 55 | PyPICT Testing | omkamal | 60.0 | Silver |
| 56 | difficult-workplace-conversations | softaworks | 60.0 | Silver |
| 57 | skill-creator | ComposioHQ | 59.5 | Bronze |
| 58 | marp-slide | softaworks | 59.5 | Bronze |
| 59 | openapi-to-typescript | softaworks | 59.5 | Bronze |
| 60 | Unity Game Engine | TerminalSkills | 59.5 | Bronze |
| 61 | code-review-skill | awesome-skills | 59.0 | Bronze |
| 62 | Web Accessibility | KreerC | 59.0 | Bronze |
| 63 | Subagent Driven Development (NeoLab) | NeoLabHQ | 59.0 | Bronze |
| 64 | Spatial SwiftUI Developer | tomkrikorian | 59.0 | Bronze |
| 65 | langsmith-fetch | ComposioHQ | 58.5 | Bronze |
| 66 | Scrum Sage (Tapestry) | michalparkola | 58.5 | Bronze |
| 67 | mui | softaworks | 58.5 | Bronze |
| 68 | professional-communication | softaworks | 58.5 | Bronze |
| 69 | Blender Scripting | TerminalSkills | 58.5 | Bronze |
| 70 | TensorFlow | TerminalSkills | 58.5 | Bronze |
| 71 | design-system-starter | softaworks | 58.0 | Bronze |
| 72 | Elasticsearch | TerminalSkills | 58.0 | Bronze |
| 73 | mermaid-diagrams | softaworks | 57.5 | Bronze |
| 74 | ship-learn-next | softaworks | 57.5 | Bronze |
| 75 | OpenCV Computer Vision | TerminalSkills | 57.5 | Bronze |
| 76 | content-research-writer | ComposioHQ | 57.0 | Bronze |
| 77 | slack-gif-creator | ComposioHQ | 57.0 | Bronze |
| 78 | backend-to-frontend-handoff-docs | softaworks | 57.0 | Bronze |
| 79 | Godot Game Engine | TerminalSkills | 57.0 | Bronze |
| 80 | meeting-insights-analyzer | ComposioHQ | 56.5 | Bronze |
| 81 | dispatching-parallel-agents | obra | 56.5 | Bronze |
| 82 | agent-md-refactor | softaworks | 56.5 | Bronze |
| 83 | command-creator | softaworks | 56.5 | Bronze |
| 84 | dependency-updater | softaworks | 56.5 | Bronze |
| 85 | gepetto | softaworks | 56.5 | Bronze |
| 86 | naming-analyzer | softaworks | 56.5 | Bronze |
| 87 | Contract Review | TerminalSkills | 56.5 | Bronze |
| 88 | GDPR Compliance | TerminalSkills | 56.5 | Bronze |
| 89 | Kubernetes Helm | TerminalSkills | 56.5 | Bronze |
| 90 | ClawSec Security Scanner | prompt-security | 56.0 | Bronze |
| 91 | writing-clearly-and-concisely | softaworks | 56.0 | Bronze |
| 92 | Unreal Engine | TerminalSkills | 56.0 | Bronze |
| 93 | invoice-organizer | ComposioHQ | 55.5 | Bronze |
| 94 | react-useeffect | softaworks | 55.5 | Bronze |
| 95 | session-handoff | softaworks | 55.5 | Bronze |
| 96 | webapp-testing | ComposioHQ | 55.0 | Bronze |
| 97 | Ship Learn Next (Tapestry) | michalparkola | 55.0 | Bronze |
| 98 | AI Guardrails | TerminalSkills | 55.0 | Bronze |
| 99 | PyTorch | TerminalSkills | 55.0 | Bronze |
| 100 | ShaderGraph Editor | tomkrikorian | 55.0 | Bronze |
| 101 | CSV Data Summarizer | coffeefuelbump | 54.5 | Bronze |
| 102 | YouTube Transcript | michalparkola | 54.5 | Bronze |
| 103 | Session Log (Tapestry) | michalparkola | 54.0 | Bronze |
| 104 | using-git-worktrees | obra | 54.0 | Bronze |
| 105 | writing-plans | obra | 54.0 | Bronze |
| 106 | file-organizer | ComposioHQ | 53.5 | Bronze |
| 107 | Article Extractor | michalparkola | 53.5 | Bronze |
| 108 | Learn This (Tapestry) | michalparkola | 53.5 | Bronze |
| 109 | canvas-design | ComposioHQ | 53.0 | Bronze |
| 110 | Pomodoro System Skill | jakedahn | 53.0 | Bronze |
| 111 | react-native-skills | vercel-labs | 52.5 | Bronze |
| 112 | competitive-ads-extractor | ComposioHQ | 51.5 | Bronze |
| 113 | jira | softaworks | 51.5 | Bronze |
| 114 | Sensei Skill Improver | spboyer | 51.5 | Bronze |
| 115 | GraphQL | TerminalSkills | 51.5 | Bronze |
| 116 | gRPC | TerminalSkills | 51.5 | Bronze |
| 117 | Markdown Exporter | bowenliang123 | 51.0 | Bronze |
| 118 | domain-name-brainstormer | softaworks | 50.5 | Bronze |
| 119 | composition-patterns | vercel-labs | 50.5 | Bronze |
| 120 | gemini | softaworks | 50.0 | Bronze |
| 121 | Algorithmic Trading | TerminalSkills | 49.5 | Bronze |
| 122 | AWS Agentic AI | zxkane | 49.0 | Bronze |
| 123 | changelog-generator | ComposioHQ | 48.0 | Bronze |
| 124 | reducing-entropy | softaworks | 48.0 | Bronze |
| 125 | frontend-to-backend-requirements | softaworks | 47.5 | Bronze |
| 126 | crafting-effective-readmes | softaworks | 47.0 | Bronze |
| 127 | recall | arjunkmrm | 46.5 | Bronze |
| 128 | lead-research-assistant | ComposioHQ | 46.5 | Bronze |
| 129 | remotion-best-practices | remotion-dev | 46.5 | Bronze |
| 130 | Solidity Smart Contracts | TerminalSkills | 46.5 | Bronze |
| 131 | ClawSec Security Suite | prompt-security | 46.0 | Bronze |
| 132 | game-changing-features | softaworks | 46.0 | Bronze |
| 133 | Soul Guardian | prompt-security | 45.5 | Bronze |
| 134 | draw-io | softaworks | 45.5 | Bronze |
| 135 | excalidraw | softaworks | 45.5 | Bronze |
| 136 | Data Analysis | TerminalSkills | 45.5 | Bronze |
| 137 | Sixtyfour People Intelligence API | rxhxm | 45.0 | Bronze |
| 138 | commit-work | softaworks | 44.5 | Bronze |
| 139 | Apache Kafka | TerminalSkills | 44.5 | Bronze |
| 140 | Jules (Gemini Agent Delegation) | sanjay3290 | 44.0 | Bronze |
| 141 | Postgres | sanjay3290 | 44.0 | Bronze |
| 142 | Markdown to EPUB Converter | smerchek | 44.0 | Bronze |
| 143 | connect | ComposioHQ | 43.5 | Bronze |
| 144 | Apache Spark | TerminalSkills | 43.5 | Bronze |
| 145 | Review Implementing | mhattingpete | 43.0 | Bronze |
| 146 | executing-plans | obra | 43.0 | Bronze |
| 147 | requesting-code-review | obra | 43.0 | Bronze |
| 148 | datadog-cli | softaworks | 43.0 | Bronze |
| 149 | plugin-forge | softaworks | 43.0 | Bronze |
| 150 | Software Architecture (NeoLab) | NeoLabHQ | 42.5 | Bronze |
| 151 | perplexity | softaworks | 41.5 | Bronze |
| 152 | brand-guidelines | ComposioHQ | 40.5 | Bronze |
| 153 | theme-factory | ComposioHQ | 39.5 | Bronze |
| 154 | Outline Wiki | sanjay3290 | 39.5 | Bronze |
| 155 | codex | softaworks | 39.5 | Bronze |
| 156 | NotebookLM Integration | PleasePrompto | 39.0 | Bronze |
| 157 | meme-factory | softaworks | 39.0 | Bronze |
| 158 | web-to-markdown | softaworks | 38.5 | Bronze |
| 159 | raffle-winner-picker | ComposioHQ | 38.0 | Bronze |
| 160 | using-superpowers | obra | 38.0 | Bronze |
| 161 | Prompt Agent | prompt-security | 38.0 | Bronze |
| 162 | internal-comms | ComposioHQ | 37.5 | Bronze |
| 163 | Reddit Fetch | ykdojo | 37.5 | Bronze |
| 164 | Test Fixing | mhattingpete | 36.0 | Bronze |
| 165 | Deep Research (Gemini) | sanjay3290 | 34.5 | Bronze |
| 166 | Imagen (Gemini) | sanjay3290 | 25.5 | Bronze |
| 167 | image-enhancer | ComposioHQ | 25.0 | Bronze |
All skills scored using SupaScore 6D rubric (Research Quality 15%, Prompt Engineering 25%, Practical Utility 15%, Completeness 10%, User Satisfaction 20%, Decision Usefulness 15%). Same rubric applied to all 1,278 SupaSkills.
Disclaimer: This benchmark was conducted in good faith using a documented, reproducible methodology. The same rubric and scoring process is applied to our own skills. Errors are possible. If you are a skill author and believe your skill was scored unfairly, miscategorised, or evaluated based on an outdated version, please contact us. We are happy to re-score with an updated version, correct any factual errors, or discuss methodology. The goal of this audit is to improve the ecosystem, not to diminish anyone's work. The security vulnerability statistic (26.1%) is sourced from Liu et al., 2026 (arXiv:2601.10338), an independent academic study.