<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/">
  <channel>
    <title>Agent-Skills on ToolGenix — AI Tools Discovery &amp; Reviews</title>
    <link>https://toolgenix.nxtniche.com/tags/agent-skills/</link>
    <description>Recent content in Agent-Skills on ToolGenix — AI Tools Discovery &amp; Reviews</description>
    <generator>Hugo</generator>
    <language>en-us</language>
    <lastBuildDate>Thu, 11 Jun 2026 00:00:00 +0000</lastBuildDate>
    <atom:link href="https://toolgenix.nxtniche.com/tags/agent-skills/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>Agent Skills: 7 Commands That Make AI Agents Write Prod Code</title>
      <link>https://toolgenix.nxtniche.com/posts/agent-skills-quick-review-2026-06-11/</link>
      <pubDate>Thu, 11 Jun 2026 00:00:00 +0000</pubDate>
      <guid>https://toolgenix.nxtniche.com/posts/agent-skills-quick-review-2026-06-11/</guid>
      <description>Addy Osmani&amp;#39;s agent-skills gives AI coding agents 7 production-grade commands — spec-first, test-driven, review-before-merge. I tested it on Claude Code.</description>
      <content:encoded><![CDATA[<p>AI coding agents are incredible at generating code fast. But ask them to write a spec before touching the keyboard? Review their own output before shipping? Crickets.</p>
<p>The problem isn&rsquo;t the models — it&rsquo;s that agents lack structured engineering workflows. They jump straight to code. No plan. No tests. No review. It works for small scripts, but for anything production-grade, that shortcut burns you sooner or later.</p>
<p>That&rsquo;s exactly what agent-skills (52.5k★ on GitHub, trending #1 as I type this) is built to fix. Addy Osmani — yes, the Google Chrome engineering manager who wrote the book on JavaScript patterns — put together 7 slash commands that encode senior engineer workflows for Claude Code, Cursor, Gemini CLI, and OpenCode. Install once, and your agent suddenly knows how to spec-first, test-drive, and review-before-merge. Like having a staff engineer sitting next to your terminal.</p>
<blockquote>
<p>This pairs nicely with today&rsquo;s <a href="/posts/claude-mem-review-2026-06-11/">claude-mem review</a>. If claude-mem gives your agent a memory, agent-skills gives it a work ethic. Together they form a pretty complete coding agent setup.</p>
</blockquote>
<h2 id="the-7-commands">The 7 Commands</h2>
<p>Here&rsquo;s the full pipeline. So each command maps to a real engineering phase:</p>
<table>
	<thead>
			<tr>
					<th style="text-align: left">Command</th>
					<th style="text-align: left">Phase</th>
					<th style="text-align: left">What It Does</th>
			</tr>
	</thead>
	<tbody>
			<tr>
					<td style="text-align: left"><code>/spec</code></td>
					<td style="text-align: left">Define</td>
					<td style="text-align: left">Writes a detailed specification before any code. Defines scope, inputs, outputs, edge cases</td>
			</tr>
			<tr>
					<td style="text-align: left"><code>/plan</code></td>
					<td style="text-align: left">Plan</td>
					<td style="text-align: left">Breaks the spec into small, atomic implementation tasks</td>
			</tr>
			<tr>
					<td style="text-align: left"><code>/build</code></td>
					<td style="text-align: left">Build</td>
					<td style="text-align: left">Implements one slice at a time — no more 500-line monster PRs</td>
			</tr>
			<tr>
					<td style="text-align: left"><code>/test</code></td>
					<td style="text-align: left">Verify</td>
					<td style="text-align: left">Generates tests alongside the code. Tests are proof the code works</td>
			</tr>
			<tr>
					<td style="text-align: left"><code>/review</code></td>
					<td style="text-align: left">Review</td>
					<td style="text-align: left">Reviews the code you just wrote — catches issues before you merge</td>
			</tr>
			<tr>
					<td style="text-align: left"><code>/code-simplify</code></td>
					<td style="text-align: left">Simplify</td>
					<td style="text-align: left">Refactors for clarity. &ldquo;Clarity over cleverness&rdquo; is the guiding principle</td>
			</tr>
			<tr>
					<td style="text-align: left"><code>/ship</code></td>
					<td style="text-align: left">Deploy</td>
					<td style="text-align: left">Prepares the final commit. Change log, summary, ready to push</td>
			</tr>
	</tbody>
</table>
<p>I installed this on a personal React + FastAPI side project I&rsquo;ve been building with Claude Code. My workflow before agent-skills was basically: type a prompt → Claude generates code → I manually review → ship. But about 40% of the time I&rsquo;d catch something the second pass — missing edge cases, no error handling, tests that only exercise the happy path.</p>
<p>After setting up agent-skills, I ran through the full pipeline on a new API endpoint. Here&rsquo;s what that looked like.</p>
<h2 id="quick-start--one-command">Quick Start — One Command</h2>
<p>For Claude Code users, the install is dead simple:</p>
<pre tabindex="0"><code>/plugin marketplace add addyosmani/agent-skills
/plugin install agent-skills@addy-agent-skills
</code></pre><p>That&rsquo;s it. Took me about 15 seconds. Still, in Cursor you drop the SKILL.md into <code>.cursor/rules/</code>. Gemini CLI and OpenCode have native support via <code>.gemini/commands</code> and <code>.opencode/</code> directories — so no extra config needed.</p>
<h2 id="walking-through-a-real-workflow">Walking Through a Real Workflow</h2>
<p>I picked a concrete task: &ldquo;Add a WebSocket endpoint for real-time logs to the monitoring dashboard.&rdquo; Something that touches frontend, backend, and state management — classic multi-layer work that usually trips agents up.</p>
<p><strong>Step 1 — <code>/spec</code></strong>: The agent asked clarifying questions about reconnection strategy, auth requirements, and log retention. Then it wrote a 3-page spec covering the full WebSocket lifecycle. That took about 2 minutes. But here&rsquo;s the kicker — before agent-skills, I would&rsquo;ve started coding immediately and discovered these edge cases three refactors later.</p>
<p><strong>Step 2 — <code>/plan</code></strong>: The spec got broken into 5 atomic tasks: (1) backend connection manager, (2) WebSocket handler, (3) Frontend hook, (4) UI component, (5) integration test. And each was small enough to complete in one session — the kind of granularity that makes a real difference when you&rsquo;re context-switching.</p>
<p><strong>Step 3 — <code>/build</code></strong> through <strong><code>/ship</code></strong>: I ran each command in sequence. Now, the <code>/test</code> step generated integration tests that caught a race condition I would&rsquo;ve missed — the WebSocket reconnection loop was firing twice on a dropped connection. Even the <code>/review</code> step flagged an unused import and an unhandled error path. Though what surprised me most was <code>/code-simplify</code> — it collapsed a 40-line state machine into 25 lines with clearer logic.</p>
<p>Total time for the full pipeline: about 45 minutes. Still, the same task without this structure would&rsquo;ve taken me 20 minutes of coding plus 30 minutes of debugging the stuff I missed. So roughly the same total time — but the output was <strong>significantly higher quality</strong>, with test coverage baked in from the start.</p>
<h2 id="what-to-watch-out-for">What to Watch Out For</h2>
<p>Agent-skills is young. The commands work well for the happy path, but complex edge cases — like nested sub-tasks or cross-repository refactors — can produce specs that are overly verbose or plans that miss downstream dependencies. I noticed the <code>/plan</code> output sometimes creates tasks that overlap, especially when the change touches multiple files.</p>
<p>Also, this only makes sense if you already use AI coding agents regularly. If you&rsquo;re not on Claude Code, Cursor, or Gemini CLI, agent-skills has nothing to plug into. And if you&rsquo;re a solo dev who prefers to code directly without agent assistance, this whole layer of indirection might feel like overhead.</p>
<p>One more thing — the project moves fast. At 821 stars per day and a last commit 4 hours ago, expect breaking changes as the skill packs evolve. Still, the fundamentals (7 commands) are stable, and the exact behavior will only sharpen over time.</p>
<h2 id="bottom-line">Bottom Line</h2>
<p>Agent-skills solves a real problem: AI coding agents that skip engineering discipline. The 7-command pipeline is opinionated, lightweight, and drops into existing workflows with zero friction. So if you&rsquo;ve been frustrated by your agent producing messy code that works but isn&rsquo;t maintainable — this is worth the 15-second install. And if you&rsquo;re already running it and watching token burn during those spec and review steps, our <a href="/posts/headroom-quick-review-2026/">Headroom review</a> shows how to cut API costs by up to 92%.</p>
]]></content:encoded>
    </item>
  </channel>
</rss>
