spec-driven-dev

Your plan, fresh agents, zero drift.

A structured workflow for AI-assisted development — from discussion to reviewed, tested, standards-compliant code, through a version-controlled plan. 2 skills, 7 agents, ~800 lines of markdown. No code, no config, no state directories — just prompts.

Prerequisites

Claude Code (requires a Max/Team subscription or API key)

Install

/plugin marketplace add mkrtchian/spec-driven-dev
/plugin install spec-driven-dev@mkrtchian

Usage

# 1. Discuss the feature, draft and review the plan
/write-plan

# 2. Review the plan yourself, adjust if needed

# 3. Execute the plan step by step
/clear
/implement-plan plans/YYYY-MM-DD_my-feature.md

The problem

AI coding assistants hit two walls on non-trivial changes:

They don't know what to build. The more autonomy you give them, the more they drift from your intent. Without a precise spec, you spend more time correcting than you save.
Context degrades. A single conversation that discusses requirements, writes code, runs tests, and reviews standards will do all of these poorly. The agent loses focus as context fills up, and large changes exceed what fits in one pass.

The approach

Two skills, each orchestrating fresh agents with isolated context. Each agent starts with a fresh context window, focused on a single concern — no attention pollution between phases.

flowchart TD
    subgraph "/write-plan"
        A["Discussion with you"] --> B["Draft plan"]
        B --> C["Review plan for additional gaps"]
        C --> D["Check plan for coding standards"]
        D --> E["Break into steps that fit in context"]
    end

    E --> F["You review the plan"]

    subgraph "/implement-plan"
        G["For each step"]
        G --> H["Implement · red-green for business logic"]
        H --> I["Harden · catch drift, fix issues, commit"]
        I -- next step --> G

        G -. all steps done .-> J["Enforce coding standards on full diff"]
        J --> K["Final review · fix issues, flag trade-offs"]
    end

    F --> G

    style A fill:#f3f0ff,stroke:#7c3aed
    style F fill:#fef3c7,stroke:#d97706
    style K fill:#ecfdf5,stroke:#059669

Design decisions

Isolated passes. A single agent asked to "implement this plan, follow TDD, and check coding standards" will do all three poorly. An agent that just spent 20 minutes implementing code is not in the right mindset to review standards — it's biased toward defending what it just wrote. Fresh context per concern — same principle as code review. For the detailed rationale, see workflow.md.

Plans in git. Your plan is a plain markdown file — it goes through your normal PR review process. No special directories, no hidden state. Two developers can plan and implement different features on different branches without interfering. A plan typically covers: context and approach, files to modify with code details, what stays unchanged, edge cases, test scenarios, and verification commands.

Sequential execution. Each pass builds on verified state — simpler to reason about, debug, and review. Parallel execution saves time but adds coordination complexity that isn't worth it for single-feature work.

Dynamic discovery over configuration. Skills detect your project's test runner, linter, and standards by finding and reading CLAUDE.md and other relevant files. Nothing is hardcoded to a stack.

Conditional TDD. Business logic gets test-first. Glue code, wiring, and config changes don't. This matches how experienced developers actually work.

Step hardening. After each implementation step, a fresh agent verifies alignment with the plan and fixes emergent issues. Problems are caught early, not discovered at the end.

Who is this for

Developers working on non-trivial features where AI "just do it" approaches produce drift and rework
Teams that do code review and want AI-generated code to go through the same rigor
Anyone who wants a predictable, inspectable AI workflow — plan in git, fresh agents, no hidden state

What's in this repo

skills/          2 orchestrator skills (/write-plan, /implement-plan)
agents/          7 custom agent definitions (reviewer, implementer, hardener, etc.)
docs/            Workflow guide and framework comparison

Each agent is a custom agent definition distributed via the plugin. The orchestrator skills reference them by subagent_type — their prompt content is never loaded into the orchestrator's own context. Manual installation is not supported; the plugin system handles resolution.

Reliability

In practice, well-structured prompts with Opus are followed 95%+ of the time — tests run, TDD is applied, standards are checked. The step hardener catches most of the remaining 5% by verifying each step with fresh context before committing.

For hard guarantees on test/lint/typecheck, pair with git pre-commit hooks — agents trigger them on every commit.

Comparison

Tested on the same feature and repo as GSD and Superpowers. All three produced working implementations. The key difference is in the review layer: spec-driven-dev runs dedicated review passes with fresh agents that never saw the code being written — same principle as human code review, where the reviewer shouldn't be the author. The trade-off is speed (~22 min vs ~15 min for the others).

For the full benchmark and detailed analysis, see the framework comparison.

Contributing

Contributions welcome — open an issue to discuss before submitting a PR.

License

MIT — Roman Mkrtchian

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
.claude-plugin		.claude-plugin
.claude/skills/commit		.claude/skills/commit
agents		agents
docs		docs
skills		skills
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

spec-driven-dev

Prerequisites

Install

Usage

The problem

The approach

Design decisions

Who is this for

What's in this repo

Reliability

Comparison

Contributing

License

About

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

spec-driven-dev

Prerequisites

Install

Usage

The problem

The approach

Design decisions

Who is this for

What's in this repo

Reliability

Comparison

Contributing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!