Agentic Dev System - Part 3: The Architecture

🏛️ System Overview: The 5-Layer Cake

Forget single-agent prompting. This is a full system architecture with separation of concerns. Each layer handles one job well.

👤 Layer 1: User Interface

Where you interact — Telegram, CLI, or Web Dashboard. You describe what you want in natural language.

🧠 Layer 2: Orchestrator

The brain. Decomposes tasks, assigns agents, manages state, handles failures. This is where the magic happens.

🤖 Layer 3: Specialized Agents

Focused workers — Coder, Reviewer, Tester, Researcher. Each is an instance of Claude Code with a specific role.

💾 Layer 4: Memory and Context

Persistent knowledge — project memory, skills, session history, codebase index. Feeds context to agents.

🔧 Layer 5: Tools and Infrastructure

Terminal, Git, Docker, CI/CD, Cloudflare deployment. The execution environment.

🤖 The Agent Roster

Six specialized roles. Each spawned as a sub-agent via Claude Code's ACP protocol.

🎯

Orchestrator Agent

Receives user requests, decomposes into tasks, assigns to worker agents, monitors progress, integrates results. The PM of the operation.

🔍

Researcher Agent

Reads codebases, searches documentation, indexes files, builds context. Provides the knowledge that other agents need.

💻

Coder Agent(s)

Writes code. Can spawn multiple instances for parallel work. Each gets a focused task with full context from Researcher.

🧪

Tester Agent

Runs tests, generates test cases, validates output, reports failures. The quality gate.

👀

Reviewer Agent

Code review — checks for bugs, security issues, style violations, architectural consistency. The second pair of eyes.

🚀

Deployer Agent

Handles builds, Docker, deployment, monitoring. Takes tested code and ships it.

🔄 The Agentic Workflow

Here's how a task flows through the system:

User Request (Telegram/CLI) │ ▼ ┌─────────────────────────────────────────┐ │ ORCHESTRATOR │ │ 1. Parse request │ │ 2. Load project memory + skills │ │ 3. Decompose into subtasks │ │ 4. Create execution plan │ └─────────────┬───────────────────────────┘ │ ┌─────────┼─────────┐ ▼ ▼ ▼ ┌────────┐ ┌────────┐ ┌────────┐ │RESEARCH│ │ CODER │ │ CODER │ ← Parallel execution │ Agent │ │ Agent │ │ Agent │ └───┬────┘ └───┬────┘ └───┬────┘ │ │ │ └──────────┼──────────┘ ▼ ┌──────────────┐ │ TESTER │ ← Automated verification │ Agent │ └──────┬───────┘ │ ┌──────┴───────┐ ▼ ▼ ┌─────────┐ ┌──────────┐ │ PASS ✅ │ │ FAIL ❌ │──→ Back to Coder └────┬────┘ └──────────┘ ▼ ┌──────────┐ │ REVIEWER │ ← Human-like review │ Agent │ └────┬─────┘ ▼ ┌──────────┐ │ DEPLOYER │ ← Ship it │ Agent │ └──────────┘

💾 Memory Architecture

The memory system is what separates this from "just using Claude Code." Three tiers:

🧠 Tier 1: Project Memory

Architecture decisions
Tech stack choices
Coding conventions
Known gotchas

Stored: MEMORY.md in project root

📚 Tier 2: Skills Library

Reusable patterns
API documentation
Tool-specific guides
Proven workflows

Stored: ~/.hermes/skills/

🕐 Tier 3: Session History

Recent decisions
Code changes made
Test results
User feedback

Stored: session transcripts

Key design decision: Memory is FILE-BASED, not database-based. This means any Claude Code session can read it, you can edit it manually, and Git tracks changes. Simple beats complex on a Mac Mini.

📁 Project Structure

The complete directory layout for the agentic system:

agentic-dev/ ├── orchestrator.yaml # Agent config + task routing rules ├── agents.yaml # Agent definitions and capabilities ├── memory/ │ ├── MEMORY.md # Project-level persistent memory │ ├── DECISIONS.md # Architecture Decision Records │ └── skills/ # Reusable workflow patterns ├── agents/ │ ├── orchestrator.py # Task decomposition + routing │ ├── researcher.py # Codebase analysis + indexing │ ├── coder.py # Code generation + editing │ ├── tester.py # Test execution + generation │ ├── reviewer.py # Code review pipeline │ └── deployer.py # Build + deploy automation ├── tools/ │ ├── context_manager.py # RAG-based context injection │ ├── git_ops.py # Git workflow automation │ ├── test_runner.py # Multi-framework test runner │ └── notifier.py # Telegram/desk notifications ├── pipelines/ │ ├── feature.py # Full feature development pipeline │ ├── bugfix.py # Bug investigation + fix pipeline │ └── refactor.py # Safe refactoring pipeline ├── dashboard/ │ └── index.html # Real-time task monitoring └── config/ ├── models.yaml # Model routing (cheap vs powerful) └── quality.yaml # Quality gate thresholds

⚡ Model Routing Strategy

Not every task needs the most expensive model. Route intelligently:

🟢 Fast/Cheap (Haiku/GPT-4o-mini)

Code formatting
Simple file edits
Test execution
Documentation updates
Search and indexing

🔴 Powerful (Sonnet/Opus)

Architecture decisions
Complex debugging
Multi-file refactoring
Code review
Security analysis

Cost optimization: Route 70% of tasks to cheap models, 30% to powerful ones. Estimated savings: 60% vs using Sonnet for everything. Your Mac Mini handles the routing — no cloud dependency.

🔐 Quality Gates

Every piece of code passes through these gates before reaching you:

Code Generated by Coder Agent │ ▼ ┌──────────────────────────┐ │ Gate 1: Syntax + Types │ → ruff, mypy, typescript ├──────────────────────────┤ │ Gate 2: Unit Tests │ → pytest, jest, go test ├──────────────────────────┤ │ Gate 3: Integration │ → docker-compose test env ├──────────────────────────┤ │ Gate 4: Security Scan │ → bandit, semgrep ├──────────────────────────┤ │ Gate 5: Code Review │ → Reviewer Agent ├──────────────────────────┤ │ Gate 6: Human Approval │ → You (optional for minor) └──────────────────────────┘ │ ▼ MERGED ✅

🏗️ Agentic Dev System: The Architecture

🏛️ System Overview: The 5-Layer Cake

🤖 The Agent Roster

🔄 The Agentic Workflow

💾 Memory Architecture

🧠 Tier 1: Project Memory

📚 Tier 2: Skills Library

🕐 Tier 3: Session History

📁 Project Structure

⚡ Model Routing Strategy

🟢 Fast/Cheap (Haiku/GPT-4o-mini)

🔴 Powerful (Sonnet/Opus)

🔐 Quality Gates