The Ecosystem Audit: Scoring 167 Community Agent Skills

The Claude Code skill ecosystem is growing fast. Over 71,000 skills listed on SkillsMP alone. Vercel, Microsoft, HashiCorp, Cloudflare, Stripe, Sentry, Hugging Face, and dozens of smaller contributors are all shipping skills. The supply is real. The question is whether anyone is checking the quality.

We scored 167 community skills from 40+ organisations using the same six-dimension SupaScore rubric we apply to our own 1,278 published skills. Same weights, same criteria, same threshold.

Not a single community skill reached our 80.0 publishing threshold.

The sample

167 skills from a broad cross-section of the ecosystem. Corporate contributors like Vercel, Microsoft, HashiCorp, Cloudflare, Stripe, Sentry, and Hugging Face. Platform-focused builders like Expo and Supabase. Prolific independent contributors like softaworks (42 skills), TerminalSkills (24), obra, ComposioHQ, and machina-sports. Curated collections from awesome-skills and similar aggregators.

We excluded trivially broken skills (empty files, syntax errors) and focused on skills that were clearly intended for real use. If someone published it, we scored it.

Tier distribution

Tier	Count	Share	Score range
Platinum (85+)	0	0%	n/a
Gold (70-84)	14	8.4%	70.0-79.5
Silver (60-69)	37	22.2%	60.0-69.9
Bronze (under 60)	116	69.5%	25.0-59.9

Community average: 54.6. SupaSkills average: 86.7. A 32-point gap.

For context: our quality gate requires a minimum score of 80.0, at least 6 research sources across 2+ source types, and a passed masterfile from our 8-phase pipeline. 98% of our 1,278 skills are Platinum tier. Zero community skills reached Platinum. 14 reached Gold.

The best of the community

The highest score in the entire sample: Move Code Quality by 1NickPappas, at 79.5. A focused skill for Move blockchain development with real domain expertise. It falls just short of our publishing threshold, but it is genuinely well-built.

The pattern is consistent. The best community skills come from domain experts who know their subject deeply:

Move Code Quality (79.5, Gold). Blockchain-specific, technically precise, clear scope.
Robotics skills from specialised contributors. Narrow focus, real engineering knowledge.
Email marketing bible. Practical, opinionated, grounded in actual campaign experience.

What these have in common: a person who knows the domain wrote them by hand, with real constraints and real opinions. They are not generated from documentation.

Where the scores break down

Research Quality is universally weak

The weakest dimension across all 167 skills. Most cite zero sources. No academic references, no industry frameworks, no books, no standards documents. The skill is the author's knowledge, unverified and unattributed.

This is the single largest factor in the score gap. Our pipeline requires minimum 6 sources across 2+ types (books, academic papers, industry frameworks, standards, etc.). That requirement alone accounts for roughly a third of the difference.

To be fair: some authors may have done extensive research without citing it in the skill file. We cannot know. So we re-ran the entire audit without the Research Quality dimension, redistributing its 15% weight proportionally across the remaining five dimensions. The result: the community average rises from 55.5 to 58.1. The gap narrows from 31 points to 28. Two skills cross our 80.0 threshold (agent-browser by Vercel at 81.1 and Obsidian Agent Memory at 80.5). Zero reach Platinum. The structural gap persists across all dimensions, not just research.

Many skills are documentation snippets

A common pattern: take an API reference or framework documentation, paste it into a skill file, add a brief instruction header. Claude already has access to this information through its training data. The skill adds little that a well-written prompt would not.

softaworks (42 skills) and TerminalSkills (24 skills) are the most prolific contributors in our sample. Both score consistently in the Bronze range. Volume without a quality gate produces volume.

Corporate skills are better structured, still shallow

Skills from Vercel, Microsoft, and similar organisations tend to have cleaner prompt engineering, better formatting, and more thoughtful instruction design. They score higher on Prompt Engineering and Completeness.

But they still lack research depth. A Vercel skill for Next.js deployment is well-written, but it does not cite the Next.js documentation, performance benchmarks, or deployment best practices from external sources. It is the author's knowledge in prompt form, same as the independents.

Practical Utility is the strongest dimension

This is encouraging. Many community skills solve real workflow problems. Deployment automation, code review checklists, framework-specific patterns. The intent is practical. The skills want to be useful.

The issue is not ambition. It is infrastructure.

The structural gap

The 32-point gap between community average (54.6) and our average (86.7) is not about individual talent. Some community skill authors clearly know their domains well. The gap is structural.

What's missing	Effect on score
No research sources	Research Quality collapses (avg ~2/10)
No quality gate	Bronze skills ship alongside Gold
No pipeline	Inconsistent structure, missing governance
No versioning	No way to improve iteratively
No IP audit	Unknown copyright/trademark exposure
No peer review	Single-author blind spots persist

Our 8-phase pipeline exists because individual expertise is not enough. Expert Council, Deep Research, Quality Gate, IP Audit, Cross-Validation. Each phase catches problems the others miss. The result is not that our authors are smarter. The result is that the pipeline catches more.

The security dimension

This is not just about quality scores. A recent arXiv study found that 26.1% of community skills contain security vulnerabilities. Prompt injection vectors, unsafe tool access patterns, missing input validation.

We run automated IP audits on every skill (1,278/1,278 audited, zero high-risk findings). We run a daily security scan with 25 checks. We have a canary skill that tests our delivery guard every 24 hours.

Most community skills have none of this. The ecosystem is growing faster than its safety infrastructure.

What we learned

Quality infrastructure makes a measurable difference. Not a marginal one. A 32-point, two-tier-category difference. The best community skill (79.5) would not pass our quality gate (80.0). That is not a coincidence. The gate exists precisely at the boundary where research depth, governance, and structured evaluation start to matter.

The best skills come from domain experts, not aggregators. Move blockchain, robotics, email marketing. Narrow focus, real knowledge, clear opinions. Aggregators who publish 20+ skills at once produce Bronze consistently.

The ecosystem needs quality infrastructure. 71,000+ skills on SkillsMP. 26.1% with security vulnerabilities. No scoring, no quality gates, no IP audits. The supply side is solved. The quality side is wide open.

The numbers

Metric	Community (167)	SupaSkills (1,278)
Average score	54.6	86.7
Platinum tier	0 (0%)	1,252 (98%)
Gold tier	14 (8.4%)	26 (2%)
Bronze tier	116 (69.5%)	0 (0%)
Highest score	79.5	92.05
Min sources required	0	6
IP audit coverage	0%	100%
Quality gate	None	80.0 minimum
Security scanning	None	25 daily checks

We built SupaSkills because we believe skills are the most important layer on top of Claude's already strong foundation. Claude is excellent at reasoning, coding, and analysis. Skills add domain structure, research depth, and specialised frameworks that make that foundation even more effective.

But skills only add value if they are good. The ecosystem audit suggests most are not there yet. The good news: the path from Bronze to Platinum is well understood. It is research, structure, and a quality gate that does not let anything through until it is ready.

We built that infrastructure. It is open for anyone who wants to use it.

Methodology: 167 skills scored using SupaScore (6 dimensions, weighted formula). Sources: GitHub repositories and skill directories from 40+ organisations. Scoring performed March 2026. All scores use the same rubric applied to our own skills. The full dataset is available on request.

Full Results: All 167 Skills Scored

Transparency matters. Here is every skill we tested, with author and score.

#	Skill	Author	Score	Tier
1	Move Code Quality	1NickPappas	79.5	Gold
2	Robot Perception	arpitg1304	79.5	Gold
3	ROS2 Development	arpitg1304	78.5	Gold
4	humanizer	blader	77.0	Gold
5	agent-browser	vercel-labs	76.5	Gold
6	Obsidian Agent Memory	AdamTylerLynch	76.0	Gold
7	Robotics Security	arpitg1304	75.5	Gold
8	email-marketing-bible	CosmoBlk	74.5	Gold
9	mermaid-syntax-skill	awesome-skills	74.0	Gold
10	Robotics Design Patterns	arpitg1304	73.0	Gold
11	Family History Research	emaynard	71.5	Gold
12	skill-judge	softaworks	71.5	Gold
13	Terraform IaC	TerminalSkills	71.0	Gold
14	Security Audit	TerminalSkills	70.0	Gold
15	manim-skill	awesome-skills	69.0	Silver
16	iOS Simulator	conorluddy	68.5	Silver
17	database-schema-designer	softaworks	68.0	Silver
18	5-whys-skill	awesome-skills	67.0	Silver
19	Sports Betting	machina-sports	67.0	Silver
20	Defold Scripts Editing	indiesoftby	66.5	Silver
21	Kaizen	NeoLabHQ	66.5	Silver
22	c4-architecture	softaworks	66.5	Silver
23	qa-test-planner	softaworks	66.5	Silver
24	Stream Coding Methodology	frmoretto	66.0	Silver
25	Creative Director	smixs	65.5	Silver
26	ipsw iOS/macOS Security Research	blacktop	65.0	Silver
27	writing-skills	obra	65.0	Silver
28	RealityKit visionOS Developer	tomkrikorian	65.0	Silver
29	tailored-resume-generator	ComposioHQ	64.5	Silver
30	Polymarket	machina-sports	64.0	Silver
31	receiving-code-review	obra	64.0	Silver
32	subagent-driven-development	obra	64.0	Silver
33	first-principles-skill	awesome-skills	63.5	Silver
34	Stripe Billing	TerminalSkills	63.5	Silver
35	react-dev	softaworks	63.0	Silver
36	Playwright Browser Automation	lackeyjb	62.5	Silver
37	FastF1 Formula 1 Data	machina-sports	62.5	Silver
38	NFL Data	machina-sports	62.5	Silver
39	requirements-clarity	softaworks	62.5	Silver
40	supabase-best-practices	supabase	62.5	Silver
41	ARKit visionOS Developer	tomkrikorian	62.5	Silver
42	react-best-practices	vercel-labs	62.5	Silver
43	Unblock Action (Tapestry)	michalparkola	62.0	Silver
44	daily-meeting-update	softaworks	62.0	Silver
45	twitter-algorithm-optimizer	ComposioHQ	61.5	Silver
46	verification-before-completion	obra	61.5	Silver
47	beautiful_prose	SHADOWPR0	61.5	Silver
48	FFmpeg	TerminalSkills	61.5	Silver
49	Prediction Markets	TerminalSkills	61.5	Silver
50	Prompt Engineering (NeoLab)	NeoLabHQ	61.0	Silver
51	lesson-learned	softaworks	61.0	Silver
52	brainstorming	obra	60.5	Silver
53	feedback-mastery	softaworks	60.5	Silver
54	mobile-app-design	awesome-skills	60.0	Silver
55	PyPICT Testing	omkamal	60.0	Silver
56	difficult-workplace-conversations	softaworks	60.0	Silver
57	skill-creator	ComposioHQ	59.5	Bronze
58	marp-slide	softaworks	59.5	Bronze
59	openapi-to-typescript	softaworks	59.5	Bronze
60	Unity Game Engine	TerminalSkills	59.5	Bronze
61	code-review-skill	awesome-skills	59.0	Bronze
62	Web Accessibility	KreerC	59.0	Bronze
63	Subagent Driven Development (NeoLab)	NeoLabHQ	59.0	Bronze
64	Spatial SwiftUI Developer	tomkrikorian	59.0	Bronze
65	langsmith-fetch	ComposioHQ	58.5	Bronze
66	Scrum Sage (Tapestry)	michalparkola	58.5	Bronze
67	mui	softaworks	58.5	Bronze
68	professional-communication	softaworks	58.5	Bronze
69	Blender Scripting	TerminalSkills	58.5	Bronze
70	TensorFlow	TerminalSkills	58.5	Bronze
71	design-system-starter	softaworks	58.0	Bronze
72	Elasticsearch	TerminalSkills	58.0	Bronze
73	mermaid-diagrams	softaworks	57.5	Bronze
74	ship-learn-next	softaworks	57.5	Bronze
75	OpenCV Computer Vision	TerminalSkills	57.5	Bronze
76	content-research-writer	ComposioHQ	57.0	Bronze
77	slack-gif-creator	ComposioHQ	57.0	Bronze
78	backend-to-frontend-handoff-docs	softaworks	57.0	Bronze
79	Godot Game Engine	TerminalSkills	57.0	Bronze
80	meeting-insights-analyzer	ComposioHQ	56.5	Bronze
81	dispatching-parallel-agents	obra	56.5	Bronze
82	agent-md-refactor	softaworks	56.5	Bronze
83	command-creator	softaworks	56.5	Bronze
84	dependency-updater	softaworks	56.5	Bronze
85	gepetto	softaworks	56.5	Bronze
86	naming-analyzer	softaworks	56.5	Bronze
87	Contract Review	TerminalSkills	56.5	Bronze
88	GDPR Compliance	TerminalSkills	56.5	Bronze
89	Kubernetes Helm	TerminalSkills	56.5	Bronze
90	ClawSec Security Scanner	prompt-security	56.0	Bronze
91	writing-clearly-and-concisely	softaworks	56.0	Bronze
92	Unreal Engine	TerminalSkills	56.0	Bronze
93	invoice-organizer	ComposioHQ	55.5	Bronze
94	react-useeffect	softaworks	55.5	Bronze
95	session-handoff	softaworks	55.5	Bronze
96	webapp-testing	ComposioHQ	55.0	Bronze
97	Ship Learn Next (Tapestry)	michalparkola	55.0	Bronze
98	AI Guardrails	TerminalSkills	55.0	Bronze
99	PyTorch	TerminalSkills	55.0	Bronze
100	ShaderGraph Editor	tomkrikorian	55.0	Bronze
101	CSV Data Summarizer	coffeefuelbump	54.5	Bronze
102	YouTube Transcript	michalparkola	54.5	Bronze
103	Session Log (Tapestry)	michalparkola	54.0	Bronze
104	using-git-worktrees	obra	54.0	Bronze
105	writing-plans	obra	54.0	Bronze
106	file-organizer	ComposioHQ	53.5	Bronze
107	Article Extractor	michalparkola	53.5	Bronze
108	Learn This (Tapestry)	michalparkola	53.5	Bronze
109	canvas-design	ComposioHQ	53.0	Bronze
110	Pomodoro System Skill	jakedahn	53.0	Bronze
111	react-native-skills	vercel-labs	52.5	Bronze
112	competitive-ads-extractor	ComposioHQ	51.5	Bronze
113	jira	softaworks	51.5	Bronze
114	Sensei Skill Improver	spboyer	51.5	Bronze
115	GraphQL	TerminalSkills	51.5	Bronze
116	gRPC	TerminalSkills	51.5	Bronze
117	Markdown Exporter	bowenliang123	51.0	Bronze
118	domain-name-brainstormer	softaworks	50.5	Bronze
119	composition-patterns	vercel-labs	50.5	Bronze
120	gemini	softaworks	50.0	Bronze
121	Algorithmic Trading	TerminalSkills	49.5	Bronze
122	AWS Agentic AI	zxkane	49.0	Bronze
123	changelog-generator	ComposioHQ	48.0	Bronze
124	reducing-entropy	softaworks	48.0	Bronze
125	frontend-to-backend-requirements	softaworks	47.5	Bronze
126	crafting-effective-readmes	softaworks	47.0	Bronze
127	recall	arjunkmrm	46.5	Bronze
128	lead-research-assistant	ComposioHQ	46.5	Bronze
129	remotion-best-practices	remotion-dev	46.5	Bronze
130	Solidity Smart Contracts	TerminalSkills	46.5	Bronze
131	ClawSec Security Suite	prompt-security	46.0	Bronze
132	game-changing-features	softaworks	46.0	Bronze
133	Soul Guardian	prompt-security	45.5	Bronze
134	draw-io	softaworks	45.5	Bronze
135	excalidraw	softaworks	45.5	Bronze
136	Data Analysis	TerminalSkills	45.5	Bronze
137	Sixtyfour People Intelligence API	rxhxm	45.0	Bronze
138	commit-work	softaworks	44.5	Bronze
139	Apache Kafka	TerminalSkills	44.5	Bronze
140	Jules (Gemini Agent Delegation)	sanjay3290	44.0	Bronze
141	Postgres	sanjay3290	44.0	Bronze
142	Markdown to EPUB Converter	smerchek	44.0	Bronze
143	connect	ComposioHQ	43.5	Bronze
144	Apache Spark	TerminalSkills	43.5	Bronze
145	Review Implementing	mhattingpete	43.0	Bronze
146	executing-plans	obra	43.0	Bronze
147	requesting-code-review	obra	43.0	Bronze
148	datadog-cli	softaworks	43.0	Bronze
149	plugin-forge	softaworks	43.0	Bronze
150	Software Architecture (NeoLab)	NeoLabHQ	42.5	Bronze
151	perplexity	softaworks	41.5	Bronze
152	brand-guidelines	ComposioHQ	40.5	Bronze
153	theme-factory	ComposioHQ	39.5	Bronze
154	Outline Wiki	sanjay3290	39.5	Bronze
155	codex	softaworks	39.5	Bronze
156	NotebookLM Integration	PleasePrompto	39.0	Bronze
157	meme-factory	softaworks	39.0	Bronze
158	web-to-markdown	softaworks	38.5	Bronze
159	raffle-winner-picker	ComposioHQ	38.0	Bronze
160	using-superpowers	obra	38.0	Bronze
161	Prompt Agent	prompt-security	38.0	Bronze
162	internal-comms	ComposioHQ	37.5	Bronze
163	Reddit Fetch	ykdojo	37.5	Bronze
164	Test Fixing	mhattingpete	36.0	Bronze
165	Deep Research (Gemini)	sanjay3290	34.5	Bronze
166	Imagen (Gemini)	sanjay3290	25.5	Bronze
167	image-enhancer	ComposioHQ	25.0	Bronze

All skills scored using SupaScore 6D rubric (Research Quality 15%, Prompt Engineering 25%, Practical Utility 15%, Completeness 10%, User Satisfaction 20%, Decision Usefulness 15%). Same rubric applied to all 1,278 SupaSkills.

Disclaimer: This benchmark was conducted in good faith using a documented, reproducible methodology. The same rubric and scoring process is applied to our own skills. Errors are possible. If you are a skill author and believe your skill was scored unfairly, miscategorised, or evaluated based on an outdated version, please contact us. We are happy to re-score with an updated version, correct any factual errors, or discuss methodology. The goal of this audit is to improve the ecosystem, not to diminish anyone's work. The security vulnerability statistic (26.1%) is sourced from Liu et al., 2026 (arXiv:2601.10338), an independent academic study.