npm install -g tasuki

Real multi-agent orchestration.
Not just prompt-switching.

In Claude Code, each agent is a real teammate — its own context window, its own model — via Agent Teams. In other AI tools, Tasuki provides structured role-switching that still enforces discipline. 9 hooks, 2-layer memory, a sequential pipeline. Tasuki (襟) — the sash used in Japan to tie back sleeves before working. Say "tasuki" and the pipeline activates.

Get started — it's free See the pipeline

npm

npm install -g tasuki

homebrew

brew tap ForeroAlexander/tasuki && brew install tasuki

9 mechanical hooks Agent Teams 2-layer memory MIT License $0 to onboard

Claude Code — your-project

$ tasuki onboard . ✓ FastAPI + PostgreSQL + React detected ✓ 9 agents configured · 9 hooks installed · Agent Teams enabled Ready. Talk to Claude Code normally. You: tasuki Add user authentication with JWT [Planner] → PRD designed · "Continue?" ✓ [QA] → 14 failing tests written (TDD) [DB Arch] → Migration: users, sessions, tokens [Backend] → JWT auth — 14/14 pass ✓ [Frontend] → Login/signup UI · all states [Security] → OWASP audit — PASS ✓ [Reviewer] → Code review — APPROVE ✓ ✓ Feature complete · $1.24 in API credits

Process — 9 specialist agents

Real teammates,
not prompt personas.

In Claude Code, each agent runs as an Agent Team member — a real subprocess with its own context window and model. In other tools, Tasuki provides guided role-switching with structured handoffs. Either way: planner first, tests before code, security audit, code review. No shortcuts. See the full pipeline →

Planner

Stage 1

Designs architecture, writes PRDs, decomposes work into per-agent tasks. Waits for your confirmation before any code is written.

thinking · opus

Stage 2

Writes failing tests before any code exists. The test IS the specification. The tdd-guard hook mechanically blocks implementation without tests.

execution · sonnet

DB Architect

Stage 3

Zero-downtime migrations in your exact format. Verifies the plan matches QA's tests before touching the schema.

execution · sonnet

Backend Dev

Stage 4

Runs failing tests first, then implements until all pass. APIs, services, background jobs. Structured handoff to Frontend.

execution · sonnet

Frontend Dev

Stage 5

Shows a design preview for your approval before building. Accessible, responsive UI with all states covered. Figma or generated.

execution · sonnet

Debugger

Stage 5.5

Reactive — only activates when tests fail. Max 5 diagnostic steps, then escalates. Never restarts from zero after a failed fix.

execution · sonnet

Security

Stage 6

Full OWASP Top 10:2025 audit. Variant analysis — finds one vuln, searches the entire codebase. Always runs. No excuses.

thinking · opus

Reviewer

Stage 7

Reads every changed file end-to-end. Cross-references models ↔ routes ↔ schemas ↔ tests. 3-round fix loop. Always runs.

thinking · opus

DevOps

Stage 8

Updates Docker, CI/CD, deploys, runs health checks via Sentry. Only activates when there are infrastructure changes to make.

execution · sonnet

How it works

Sequential by design,
not by accident.

Say "tasuki" in your prompt and the pipeline-trigger hook activates the full process. 9 hooks enforce discipline mechanically — no advisory rules, no hoping the AI behaves. In Claude Code, this is real orchestration via Agent Teams. Every stage has a reason to go exactly where it does. Deep dive →

Planner

Confirms with you

Reads your project context, designs architecture, saves the PRD to tasuki-plans/. Decomposes into per-agent tasks via shared task list. Shows you the summary and waits before any code is written.

QA — tests first

TDD enforced

Writes failing tests before any implementation exists. The test is the specification. The tdd-guard hook mechanically blocks implementation code if no test file exists — exit code 2, no bypass.

DB Architect

Verifies the plan matches QA's tests before creating anything. Migration + model in production-safe DDL patterns: expand-contract for renames, batched defaults for large tables.

Backend Dev

Runs the failing tests first to confirm they fail for the right reason. Implements until all tests pass. Produces a structured handoff with every new endpoint, auth behavior, and env var needed by Frontend.

⚡ test checkpoint

all pass → continue · any fail → debugger

Frontend Dev

Design preview

Asks if you have a Figma file or want a generated preview before building. Builds responsive, accessible UI consuming verified backend endpoints. All states covered.

⚡ test checkpoint

all pass → security · any fail → debugger

Security

Always runs

OWASP Top 10:2025 full checklist, automated Semgrep scan, and variant analysis. Finds one vulnerability, searches the entire codebase for the same root cause pattern.

Reviewer

Always runs

Reads every changed file end-to-end, not just the diff. Cross-references models ↔ routes ↔ schemas ↔ tests. 3-round fix loop with regression detection after every change.

8–9

DevOps + Completion

Deploys if needed, health checks via Sentry, then writes a clean summary. Lessons learned save to the knowledge graph — the next task starts smarter.

Works with 8 AI tools

Built for Claude Code.
Compatible with the rest.

One onboard generates config for every tool. Claude Code gets full mechanical automation. Every other tool gets structured role-switching — still dramatically better than nothing.

Claude Code

Full pipeline

Pipeline execution

Agent Teams — real teammates with separate context windows

TDD hook

Mechanical block (exit 2)

Security hook

Mechanical block

Agent isolation

Real — each agent is a subprocess with its own context

Model per agent

Yes — Opus for thinking, Sonnet for execution

Inter-agent messaging

Native (TeammateIdle, TaskCompleted hooks)

Memory vault

Full knowledge graph

Cursor · Codex · Copilot

+ Windsurf, Continue, Roo Code, Gemini

Compatible

Pipeline execution

Role-switching (guided)

TDD hook

Advisory rules

Security hook

Advisory rules

Agent isolation

Same context

Memory vault

Full knowledge graph

Memory — two layers

Memory that persists
across sessions.

Layer 1: wikilinks — $0, offline, human-readable. Layer 2: RAG deep memory — on-demand for deep context. Two layers that get smarter with every task. No server, no embeddings pipeline, no infra. Full architecture →

Traditional RAG

Code → Chunking → Embeddings
→ Vector DB → Semantic search
→ LLM query → $$ per query

Needs: server, API keys, infra
Visibility: black box

Tasuki Memory

Code → Agent learns → writes .md
→ [[wikilinks]] connect knowledge
→ next run → reads only its memories

Layer 1: $0, offline, human-readable
Layer 2: on-demand, local SQLite

Layer 1 — Wikilinks

Each agent reads only its own memories via [[links]]. Backend Dev reads 18 memories → ~72 tokens. Zero extra cost.

Max 20 per agent · 4-dimension format · opens in Obsidian

Layer 2 — Deep Memory

Vector search over schema, APIs, plans, git history. Agent queries: "how do we handle auth?" → full models, endpoints, past PRDs.

Local SQLite · auto-syncs · 50+ entries · <50ms queries

Memory comparison

Claude remembers.
Tasuki's agents learn.

Claude Memory

claude.ai

Scope

General — one pool for everything

Learning

Manual — you tell it or it decides

All memories loaded every time

Confidence

None — no decay, no scoring

Connections

Isolated entries, no relations

Visibility

Hidden — you can't easily browse or edit

Tasuki Memory Vault

Per-agent memory

Scope

Per-agent — each specialist has its own memories

Learning

Automatic — agents write what they learned after every task

Filtered — [[wikilinks]] load only relevant entries (~72 tokens)

Confidence

Scored — high/experimental/deprecated + auto-decay + applied-count

Connections

Knowledge graph — BFS traversal across related nodes

Visibility

Open — tasuki dashboard or any editor (plain markdown, git-versioned)

Claude Memory is great for personal preferences.
Tasuki Memory Vault is for agents that get smarter about your project with every task.

Cost

Tasuki is free.
You pay for API credits.

The CLI is MIT open source. Onboarding is bash — $0, zero API calls. The only cost is your normal AI usage, made more efficient.

Fast fix

Small bugs, quick tweaks. Complexity 1–3.

~$0.26

~$0.15 – $0.50 per task

3 agents (Planner, Dev, Reviewer)
~18K tokens
Tests + security still run
Skips unused stages

tasuki mode fast

Standard feature

New endpoints, components, integrations. Complexity 5–6.

~$0.68

~$0.50 – $1.50 per task

6 agents, full pipeline
~46K tokens
TDD + OWASP audit
3-round review loop

tasuki mode standard

Full feature

Complex features, auth systems. Complexity 8–10.

~$1–3

varies with retries & fix rounds

All 9 agents
~71K tokens baseline
Debugger rounds if needed
Full variant analysis

tasuki mode serious

$50 in API credits ≈ 50–80 tasks ≈ roughly one month of active development.
~60% savings from model tiering · ~86% from filtered memory loading · ~40% from agent routing

FAQ

Common questions

Just say "tasuki" in your prompt and the pipeline activates. The pipeline-trigger hook detects the keyword and starts the full process. Everything else — memory loading, hooks, stage tracking — is automatic.

Runs 5 bash detectors against your actual files — no API calls, zero tokens, $0. Detects stack from real imports, versions from package files, paths from directory structure. Generates .tasuki/ with 9 configured agents, 9 hooks, verified project facts, and an initialized knowledge graph. Takes about 10 seconds.

Yes, for Claude Code. Stages 1–9 run automatically via Agent Teams — each agent runs as a real teammate with its own context window. Tasuki activates Agent Teams automatically during onboard. It stops twice for your input: after the Planner shows you the plan and before Frontend starts building. Everything else is automatic. On Cursor and other tools, the pipeline is structured role-switching — the AI follows instructions in order, but you guide transitions between stages.

That's fine — the QA agent writes the tests first for new features. The tdd-guard hook only blocks edits to implementation files when no corresponding test file exists — it won't break your existing codebase.

No. Tasuki is a CLI tool that runs entirely locally — it generates config files. Your code goes wherever your existing AI tool sends it, same as before. Tasuki adds zero new data flows.

13 profiles auto-detected: FastAPI, Django, Flask, Next.js, SvelteKit, Nuxt, Express, NestJS, Rails, Gin, Spring Boot, Laravel, and Generic fallback. If your stack isn't listed, Generic still configures all 9 agents — it just won't have stack-specific migration patterns.

Agent Teams is Claude Code's experimental feature for real multi-agent orchestration. Each agent runs as a subprocess with its own context window and model, not just a prompt persona. Tasuki activates it automatically during onboard. To opt out, set CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS=0.

Yes. Run brew tap ForeroAlexander/tasuki && brew install tasuki. Works on macOS and Linux.

Real multi-agent orchestration.
Not just prompt-switching.

From onboard to feature — in one prompt.

Real teammates,
not prompt personas.

Sequential by design,
not by accident.

Built for Claude Code.
Compatible with the rest.

Memory that persists
across sessions.

Claude remembers.
Tasuki's agents learn.

Tasuki is free.
You pay for API credits.

Common questions

Real orchestration. Persistent memory.
A process your AI
can't skip.

Real multi-agent orchestration.Not just prompt-switching.

From onboard to feature — in one prompt.

Real teammates,not prompt personas.

Sequential by design,not by accident.

Built for Claude Code.Compatible with the rest.

Memory that persistsacross sessions.

Claude remembers.Tasuki's agents learn.

Tasuki is free.You pay for API credits.

Common questions

Real orchestration. Persistent memory.A process your AIcan't skip.

Real multi-agent orchestration.
Not just prompt-switching.

Real teammates,
not prompt personas.

Sequential by design,
not by accident.

Built for Claude Code.
Compatible with the rest.

Memory that persists
across sessions.

Claude remembers.
Tasuki's agents learn.

Tasuki is free.
You pay for API credits.

Real orchestration. Persistent memory.
A process your AI
can't skip.