Add wiki documentation for xml-pipeline.org

Comprehensive documentation set for XWiki: - Home, Installation, Quick Start guides - Writing Handlers and LLM Router guides - Architecture docs (Overview, Message Pump, Thread Registry, Shared Backend) - Reference docs (Configuration, Handler Contract, CLI) - Hello World tutorial - Why XML rationale - Pandoc conversion scripts (bash + PowerShell) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-20 20:40:47 -08:00 · 2026-01-20 20:40:47 -08:00 · 515c738abb
commit 515c738abb
parent c01428260c
17 changed files with 3632 additions and 0 deletions
--- a/docs/wiki/Home.md
+++ b/docs/wiki/Home.md
@ -0,0 +1,67 @@
 # xml-pipeline
 **A tamper-proof nervous system for multi-agent AI systems.**
 xml-pipeline (also called AgentServer) provides a schema-driven, Turing-complete message bus where AI agents communicate through validated XML payloads. It features automatic XSD generation, handler isolation, and built-in security guarantees against agent misbehavior.
 ## Why XML?
 While JSON dominates web APIs, XML provides critical features for secure multi-agent systems:
 - **Schema validation** — XSD enforces exact contracts on the wire
 - **Namespaces** — Safely mix vocabularies without collision
 - **Canonicalization** — C14N enables deterministic signing
 - **Repair tolerance** — Malformed XML can be recovered; malformed JSON cannot
 See [[Why XML]] for the full rationale.
 ## Key Features
 | Feature | Description |
 |---------|-------------|
 | **Schema-Driven** | Define payloads as Python dataclasses; XSD generated automatically |
 | **Handler Isolation** | Handlers are sandboxed—cannot forge identity or escape threads |
 | **Thread Tracking** | Opaque UUIDs hide topology; call chains tracked privately |
 | **LLM Router** | Multi-backend routing with failover, rate limiting, retries |
 | **Multiprocess Ready** | CPU-bound handlers run in ProcessPoolExecutor |
 | **Shared State** | Redis/Manager backends for distributed deployments |
 ## Quick Links
 ### Getting Started
 - [[Installation]] — Install the package
 - [[Quick Start]] — Run your first organism in 5 minutes
 - [[Configuration]] — Configure organisms via YAML
 ### Guides
 - [[Writing Handlers]] — Create message handlers
 - [[Using the LLM Router]] — Call language models from handlers
 - [[Multiprocess Handlers]] — Run CPU-bound work in separate processes
 ### Architecture
 - [[Architecture Overview]] — How xml-pipeline works
 - [[Message Pump]] — The streaming message processor
 - [[Thread Registry]] — Call chain tracking with opaque UUIDs
 - [[Shared Backend]] — Cross-process state with Redis
 ### Tutorials
 - [[Hello World Tutorial]] — Build a greeting agent step by step
 - [[Calculator Tool Tutorial]] — Create a tool that agents can call
 ### Reference
 - [[Handler Contract]] — Handler function signature and return types
 - [[Configuration Reference]] — Complete organism.yaml specification
 - [[CLI Reference]] — Command-line interface
 ## Version
 Current version: **0.4.0**
 ## License
 MIT License
 ## Links
 - [GitHub Repository](https://github.com/xml-pipeline/xml-pipeline)
 - [PyPI Package](https://pypi.org/project/xml-pipeline/)
--- a/docs/wiki/Installation.md
+++ b/docs/wiki/Installation.md
@ -0,0 +1,97 @@
 # Installation
 ## Requirements
 - Python 3.11 or higher
 - pip, uv, or pipx
 ## Install from PyPI
 ```bash
 pip install xml-pipeline
 ```
 ## Install with Optional Features
 xml-pipeline has optional dependencies for different use cases:
 ```bash
 # Core only (minimal)
 pip install xml-pipeline
 # With Anthropic Claude support
 pip install xml-pipeline[anthropic]
 # With OpenAI support
 pip install xml-pipeline[openai]
 # With Redis for distributed deployments
 pip install xml-pipeline[redis]
 # With web search capability
 pip install xml-pipeline[search]
 # With interactive console (for examples)
 pip install xml-pipeline[console]
 # Everything
 pip install xml-pipeline[all]
 # Development (includes testing tools)
 pip install xml-pipeline[dev]
 ```
 ## Install from Source
 ```bash
 # Clone the repository
 git clone https://github.com/xml-pipeline/xml-pipeline.git
 cd xml-pipeline
 # Create virtual environment
 python -m venv .venv
 # Activate (Windows)
 .venv\Scripts\activate
 # Activate (Linux/macOS)
 source .venv/bin/activate
 # Install in development mode
 pip install -e ".[all]"
 ```
 ## Verify Installation
 ```bash
 # Check version
 xml-pipeline version
 # Or use the short alias
 xp version
 ```
 Expected output:
 ```
 xml-pipeline 0.4.0
 Python 3.11.x
 Features: anthropic, console, redis, search
 ```
 ## Environment Variables
 Create a `.env` file in your project root for API keys:
 ```env
 # LLM Provider Keys (add the ones you need)
 XAI_API_KEY=xai-...
 ANTHROPIC_API_KEY=sk-ant-...
 OPENAI_API_KEY=sk-...
 ```
 The library automatically loads `.env` files via python-dotenv.
 ## Next Steps
 - [[Quick Start]] — Run your first organism
 - [[Configuration]] — Learn about organism.yaml
--- a/docs/wiki/LLM-Router.md
+++ b/docs/wiki/LLM-Router.md
@ -0,0 +1,303 @@
 # LLM Router
 The LLM Router provides a unified interface for language model calls. Agents request a model by name; the router handles backend selection, failover, rate limiting, and retries.
 ## Overview
 ```
 ┌─────────────────────────────────────────────────────────────────┐
 │                        Agent Handler                             │
 │   response = await complete("grok-4.1", messages)               │
 └─────────────────────────────────┬───────────────────────────────┘
                                  │
                                  ▼
 ┌─────────────────────────────────────────────────────────────────┐
 │                         LLM Router                               │
 │  • Find backends serving model                                   │
 │  • Select backend (strategy)                                     │
 │  • Retry on failure                                              │
 │  • Track usage per agent                                         │
 └────────────┬────────────────┬────────────────┬──────────────────┘
             │                │                │
             ▼                ▼                ▼
      ┌──────────┐     ┌──────────┐     ┌──────────┐
      │   XAI    │     │Anthropic │     │  Ollama  │
      │ Backend  │     │ Backend  │     │ Backend  │
      └──────────┘     └──────────┘     └──────────┘
 ```
 ## Quick Start
 ### Basic Usage
 ```python
 from xml_pipeline.platform.llm_api import complete
 response = await complete(
    model="grok-4.1",
    messages=[
        {"role": "system", "content": "You are helpful."},
        {"role": "user", "content": "Hello!"},
    ],
 )
 print(response.content)
 ```
 ### In a Handler
 ```python
 async def my_agent(payload: Query, metadata: HandlerMetadata) -> HandlerResponse:
    from xml_pipeline.platform.llm_api import complete
    response = await complete(
        model="grok-4.1",
        messages=[
            {"role": "system", "content": metadata.usage_instructions},
            {"role": "user", "content": payload.question},
        ],
        temperature=0.7,
        max_tokens=2048,
    )
    return HandlerResponse(
        payload=Answer(text=response.content),
        to="output",
    )
 ```
 ## Configuration
 ### organism.yaml
 ```yaml
 llm:
  strategy: failover           # Backend selection strategy
  retries: 3                   # Max retry attempts
  retry_base_delay: 1.0        # Base delay for backoff
  retry_max_delay: 60.0        # Max delay between retries
  backends:
    - provider: xai
      api_key_env: XAI_API_KEY
      priority: 1              # Lower = preferred
      rate_limit_tpm: 100000   # Tokens per minute
      max_concurrent: 20       # Concurrent request limit
    - provider: anthropic
      api_key_env: ANTHROPIC_API_KEY
      priority: 2
    - provider: openai
      api_key_env: OPENAI_API_KEY
      priority: 3
    - provider: ollama
      base_url: http://localhost:11434
      supported_models: [llama3, mistral]
 ```
 ### Environment Variables
 ```env
 # .env file
 XAI_API_KEY=xai-abc123...
 ANTHROPIC_API_KEY=sk-ant-...
 OPENAI_API_KEY=sk-...
 ```
 ## Supported Providers
 | Provider | Models | Auth |
 |----------|--------|------|
 | `xai` | grok-* | Bearer token |
 | `anthropic` | claude-* | x-api-key header |
 | `openai` | gpt-*, o1-*, o3-* | Bearer token |
 | `ollama` | Any local model | None (local) |
 ### Model Routing
 The router automatically selects backends based on model name:
 - `grok-4.1` → XAI backend
 - `claude-sonnet-4` → Anthropic backend
 - `gpt-4o` → OpenAI backend
 - `llama3` → Ollama (if in `supported_models`)
 ## Strategies
 ### Failover (Default)
 Tries backends in priority order. Falls back on error.
 ```yaml
 llm:
  strategy: failover
  backends:
    - provider: xai
      priority: 1        # Try first
    - provider: anthropic
      priority: 2        # Fallback
 ```
 ### Round-Robin
 Distributes requests evenly across backends.
 ```yaml
 llm:
  strategy: round-robin
 ```
 ### Least-Loaded
 Routes to the backend with lowest current load.
 ```yaml
 llm:
  strategy: least-loaded
 ```
 ## Response Format
 ```python
@dataclass
 class LLMResponse:
    content: str                    # Generated text
    model: str                      # Model used
    usage: Dict[str, int]           # Token counts
    finish_reason: str              # stop, length, tool_calls
    raw: Any                        # Provider-specific response
 ```
 ### Usage Dict
 ```python
 response.usage = {
    "prompt_tokens": 150,
    "completion_tokens": 50,
    "total_tokens": 200,
 }
 ```
 ## Parameters
 ```python
 response = await complete(
    model="grok-4.1",              # Required: model name
    messages=[...],                # Required: conversation
    temperature=0.7,               # Optional: randomness (0-2)
    max_tokens=2048,               # Optional: response limit
    top_p=0.9,                     # Optional: nucleus sampling
    stop=["END"],                  # Optional: stop sequences
 )
 ```
 ## Error Handling
 ### Rate Limits
 On 429 responses:
 1. Reads `Retry-After` header
 2. Falls back to exponential backoff with jitter
 3. Tries next backend (if failover)
 ### Provider Errors
 On 5xx responses:
 1. Logs error
 2. Retries with backoff
 3. Tries next backend (if failover)
 ### All Backends Failed
 ```python
 from xml_pipeline.llm.router import BackendError
 try:
    response = await complete(model, messages)
 except BackendError as e:
    # All backends failed
    logger.error(f"LLM call failed: {e}")
 ```
 ## Rate Limiting
 Each backend has independent limits:
 - **Token bucket**: Limits tokens per minute (`rate_limit_tpm`)
 - **Semaphore**: Limits concurrent requests (`max_concurrent`)
 Requests wait if limits are reached.
 ## Token Tracking
 Track usage per agent:
 ```python
 from xml_pipeline.llm.router import get_router
 router = get_router()
 # Get usage for agent
 usage = router.get_agent_usage("greeter")
 print(f"Total tokens: {usage.total_tokens}")
 print(f"Requests: {usage.request_count}")
 # Reset tracking
 router.reset_agent_usage("greeter")
 ```
 ## Best Practices
 ### 1. Use System Prompts
 ```python
 response = await complete(
    model="grok-4.1",
    messages=[
        {"role": "system", "content": metadata.usage_instructions},
        {"role": "user", "content": payload.query},
    ],
 )
 ```
 ### 2. Handle Errors Gracefully
 ```python
 try:
    response = await complete(model, messages)
 except BackendError:
    return HandlerResponse(
        payload=ErrorResponse(message="LLM unavailable"),
        to=metadata.from_id,
    )
 ```
 ### 3. Set Appropriate Limits
 ```yaml
 llm:
  backends:
    - provider: xai
      rate_limit_tpm: 50000   # Conservative limit
      max_concurrent: 10      # Prevent overload
 ```
 ### 4. Use Failover for Reliability
 ```yaml
 llm:
  strategy: failover
  backends:
    - provider: xai
      priority: 1
    - provider: anthropic
      priority: 2  # Backup
 ```
 ## See Also
 - [[Writing Handlers]] — Using LLM in handlers
 - [[Configuration]] — Full LLM configuration
 - [[Architecture Overview]] — System architecture
--- a/docs/wiki/Quick-Start.md
+++ b/docs/wiki/Quick-Start.md
@ -0,0 +1,159 @@
 # Quick Start
 Get an organism running in 5 minutes.
 ## 1. Install the Package
 ```bash
 pip install xml-pipeline[console]
 ```
 ## 2. Create a Project Directory
 ```bash
 mkdir my-organism
 cd my-organism
 ```
 ## 3. Initialize Configuration
 ```bash
 xml-pipeline init my-organism
 ```
 This creates:
 ```
 my-organism/
 ├── config/
 │   └── organism.yaml
 ├── handlers/
 │   └── hello.py
 └── .env.example
 ```
 ## 4. Examine the Generated Files
 ### config/organism.yaml
 ```yaml
 organism:
  name: my-organism
  port: 8765
 listeners:
  - name: greeter
    payload_class: handlers.hello.Greeting
    handler: handlers.hello.handle_greeting
    description: A friendly greeting handler
    peers: []
 ```
 ### handlers/hello.py
 ```python
 from dataclasses import dataclass
 from third_party.xmlable import xmlify
 from xml_pipeline.message_bus.message_state import HandlerMetadata, HandlerResponse
@xmlify
@dataclass
 class Greeting:
    """A greeting request."""
    name: str
@xmlify
@dataclass
 class GreetingResponse:
    """A greeting response."""
    message: str
 async def handle_greeting(payload: Greeting, metadata: HandlerMetadata) -> HandlerResponse:
    """Handle a greeting and respond."""
    return HandlerResponse(
        payload=GreetingResponse(message=f"Hello, {payload.name}!"),
        to=metadata.from_id,  # Reply to sender
    )
 ```
 ## 5. Run the Organism
 ```bash
 xml-pipeline run config/organism.yaml
 ```
 You should see:
 ```
 Organism: my-organism
 Listeners: 1
 Root thread: abc123-...
 Routing: ['greeter.greeting']
 ```
 ## 6. Try the Interactive Console
 If you installed with `[console]`:
 ```bash
 python -m examples.console
 ```
 Type `@greeter Alice` to send a greeting message.
 ## What Just Happened?
 1. **Payload defined** — `Greeting` dataclass with `@xmlify` decorator
 2. **XSD generated** — Schema auto-created at `schemas/greeter/v1.xsd`
 3. **Handler registered** — `handle_greeting` mapped to `greeter.greeting` root tag
 4. **Message pump started** — Waiting for messages
 ## Understanding the Message Flow
 ```
 Input: @greeter Alice
   │
   ▼
 ┌─────────────────────────────────────┐
 │  XML Envelope Created               │
 │  <message>                          │
 │    <meta>                           │
 │      <from>console</from>           │
 │      <to>greeter</to>               │
 │      <thread>uuid-123</thread>      │
 │    </meta>                          │
 │    <greeting>                       │
 │      <name>Alice</name>             │
 │    </greeting>                      │
 │  </message>                         │
 └─────────────────────────────────────┘
   │
   ▼
 ┌─────────────────────────────────────┐
 │  Pipeline Processing                │
 │  1. Repair (fix malformed XML)      │
 │  2. C14N (canonicalize)             │
 │  3. Envelope validation             │
 │  4. Payload extraction              │
 │  5. XSD validation                  │
 │  6. Deserialization → Greeting      │
 │  7. Route to greeter handler        │
 └─────────────────────────────────────┘
   │
   ▼
 ┌─────────────────────────────────────┐
 │  Handler Execution                  │
 │  handle_greeting(                   │
 │    payload=Greeting(name="Alice"),  │
 │    metadata=HandlerMetadata(...)    │
 │  )                                  │
 │  → HandlerResponse(...)             │
 └─────────────────────────────────────┘
   │
   ▼
 Output: Hello, Alice!
 ```
 ## Next Steps
 - [[Writing Handlers]] — Create your own handlers
 - [[Configuration]] — Customize organism.yaml
 - [[Hello World Tutorial]] — Step-by-step tutorial
--- a/docs/wiki/README.md
+++ b/docs/wiki/README.md
@ -0,0 +1,97 @@
 # Wiki Documentation
 This directory contains documentation for the xml-pipeline.org XWiki.
 ## Structure
 ```
 wiki/
 ├── Home.md                      # Wiki home page
 ├── Installation.md              # Installation guide
 ├── Quick-Start.md               # Quick start guide
 ├── Writing-Handlers.md          # Handler guide
 ├── LLM-Router.md                # LLM router guide
 ├── Why-XML.md                   # Rationale for XML
 ├── architecture/
 │   ├── Overview.md              # Architecture overview
 │   ├── Message-Pump.md          # Message pump details
 │   ├── Thread-Registry.md       # Thread registry details
 │   └── Shared-Backend.md        # Shared backend details
 ├── reference/
 │   ├── Configuration.md         # Full configuration reference
 │   ├── Handler-Contract.md      # Handler specification
 │   └── CLI.md                   # CLI reference
 ├── tutorials/
 │   └── Hello-World.md           # Step-by-step tutorial
 ├── convert-to-xwiki.sh          # Bash conversion script
 ├── convert-to-xwiki.ps1         # PowerShell conversion script
 └── README.md                    # This file
 ```
 ## Converting to XWiki Format
 ### Prerequisites
 Install Pandoc: https://pandoc.org/installing.html
 ### Convert All Files
 **Windows (PowerShell):**
 ```powershell
 cd docs/wiki
 .\convert-to-xwiki.ps1
 ```
 **Linux/macOS (Bash):**
 ```bash
 cd docs/wiki
 chmod +x convert-to-xwiki.sh
 ./convert-to-xwiki.sh
 ```
 ### Output
 Converted files are placed in `xwiki/` directory with `.xwiki` extension.
 ## Uploading to XWiki
 ### Option 1: XWiki REST API
 ```bash
 # Upload a single page
 curl -u admin:password -X PUT \
  'https://xml-pipeline.org/rest/wikis/xwiki/spaces/Docs/pages/Home' \
  -H 'Content-Type: text/plain' \
  -d @xwiki/Home.xwiki
 ```
 ### Option 2: XWiki Import
 1. Go to XWiki Administration
 2. Content → Import
 3. Upload the files
 ### Option 3: Copy/Paste
 1. Create page in XWiki
 2. Switch to Wiki editing mode
 3. Paste converted content
 ## Wiki Links
 The Markdown files use `[[Page Name]]` wiki-link syntax. Pandoc converts these to XWiki's `[[Page Name]]` format.
 If your XWiki uses different space structure, you may need to adjust links:
 ```
 [[Installation]]           → [[Docs.Installation]]
 [[architecture/Overview]]  → [[Docs.Architecture.Overview]]
 ```
 ## Updating Documentation
 1. Edit the Markdown files in this directory
 2. Run the conversion script
 3. Upload to XWiki
 Keep Markdown as the source of truth for version control.
--- a/docs/wiki/Why-XML.md
+++ b/docs/wiki/Why-XML.md
@ -0,0 +1,254 @@
 # Why XML?
 XML is the right format for a sovereign, attack-resistant message bus in a multi-agent system. JSON is not.
 ## The Short Answer
 | Feature | XML | JSON |
 |---------|-----|------|
 | Schema validation | XSD (built-in, precise) | JSON Schema (optional, lossy) |
 | Namespaces | Native support | None |
 | Canonicalization | C14N standard | No standard |
 | Repair tolerance | lxml recover mode | Parser fails |
 | Comments | Supported | Forbidden |
 | Mixed content | Native | Fragile |
 ## JSON's Origins
 JSON (JavaScript Object Notation) was invented in the early 2000s as a subset of JavaScript literal syntax for simple data exchange in web browsers. It was never designed as a general-purpose format—just a quick way to serialize objects for Ajax calls.
 It became popular because:
 - Simple for JavaScript developers
 - Human-readable
 - Web API boom (REST over SOAP)
 - Low barrier to entry
 ## Why JSON Fails for Multi-Agent Systems
 ### No Schema Enforcement
 JSON Schema exists but is:
 - Optional (rarely enforced on wire)
 - Lossy (can't express all constraints)
 - Inconsistently implemented
 Result: Messages accepted without validation, bugs discovered at runtime.
 ### No Namespaces
 Can't safely mix vocabularies:
 ```json
 {
  "name": "Alice",      // User name? Product name?
  "type": "admin"       // User type? Message type?
 }
 ```
 ### No Canonicalization
 No standard way to normalize for signing:
 ```json
 {"a": 1, "b": 2}
 {"b": 2, "a": 1}
 ```
 Same data? Different bytes. Can't sign reliably.
 ### No Repair Tolerance
 One syntax error → entire payload rejected:
 ```json
 {"name": "Alice",}     // Trailing comma → FAIL
 ```
 ### Escaping Hell
 Strings with special characters are fragile:
 ```json
 {"message": "She said \"hello\""}   // Manual escaping
 ```
 Easy to break, security vulnerability vector.
 ## Why JSON Fails for LLM Integration
 ### Hallucination Fragility
 LLMs routinely produce invalid JSON:
 - Trailing commas
 - Missing quotes
 - Wrong nesting
 - Comments (forbidden!)
 Result: Massive prompt bloat ("You MUST output valid JSON, NO trailing commas EVER...") and post-processing parsers.
 ### No Graceful Degradation
 One parse error → entire response lost. No partial recovery.
 ### Injection Attacks
 User input in strings can break JSON structure:
 ```json
 {"user_input": "Alice", "role": "admin"}
 ```
 If user provides `", "role": "admin"` in their name → injection.
 ## Why XML Succeeds
 ### Schema as Contract
 XSD enforces exact structure on the wire:
 ```xml
 <xs:element name="greeting">
  <xs:complexType>
    <xs:sequence>
      <xs:element name="name" type="xs:string"/>
    </xs:sequence>
  </xs:complexType>
 </xs:element>
 ```
 Every message validated before processing. No ambiguity.
 ### Namespaces
 Safe vocabulary mixing:
 ```xml
 <message xmlns="https://xml-pipeline.org/ns/envelope/v1">
  <user:profile xmlns:user="https://example.org/user">
    <user:name>Alice</user:name>
  </user:profile>
 </message>
 ```
 ### Canonicalization (C14N)
 Deterministic representation for signing:
 ```python
 c14n_bytes = etree.tostring(tree, method='c14n')
 signature = sign(c14n_bytes)
 ```
 Same logical content → same bytes → verifiable signatures.
 ### Repair Tolerance
 lxml recover mode fixes common issues:
 ```python
 parser = etree.XMLParser(recover=True)
 tree = etree.fromstring(broken_xml, parser)
 ```
 Partial documents, encoding issues, missing tags → recovered.
 ### Self-Describing
 Elements carry meaning:
 ```xml
 <greeting>
  <name>Alice</name>
 </greeting>
 ```
 vs JSON:
 ```json
 ["Alice"]  // What is this?
 ```
 ## LLM + XML = Reliable
 ### Natural Streaming
 XML streams naturally (can process before complete).
 ### Repair on Output
 LLM produces broken XML? lxml fixes it:
 ```python
 from lxml import etree
 parser = etree.XMLParser(recover=True)
 tree = etree.fromstring(llm_output, parser)
 # Works even with minor errors
 ```
 ### Schema-Guided Generation
 XSD tells LLM exactly what to produce:
 ```
 Generate XML matching this schema:
 <greeting><name>string</name></greeting>
 ```
 Clear contract, fewer hallucinations.
 ### Graceful Validation
 Validation errors become helpful feedback:
 ```xml
 <huh>
  <error>Element 'greeting' missing required element 'name'</error>
 </huh>
 ```
 LLM can self-correct.
 ## The Trade-Offs
 ### XML is More Verbose
 ```xml
 <greeting><name>Alice</name></greeting>
 ```
 vs
 ```json
 {"name": "Alice"}
 ```
 **But:** Compression eliminates this on wire. And verbosity aids debugging.
 ### XML Parsing is Slower
 Microseconds more than JSON parsing.
 **But:** Network latency dominates. And lxml is highly optimized.
 ### XML is "Old"
 True. Also mature, battle-tested, standards-based.
 ## Conclusion
 JSON won the web because it was "good enough" for stateless HTTP requests.
 XML wins for multi-agent systems because:
 - Security requires schema enforcement
 - Signing requires canonicalization
 - LLMs require repair tolerance
 - Complexity requires namespaces
 **JSON won the web. XML wins the swarm.**
 ## Further Reading
 - [W3C XML Schema](https://www.w3.org/XML/Schema)
 - [Exclusive XML Canonicalization](https://www.w3.org/TR/xml-exc-c14n/)
 - [lxml Documentation](https://lxml.de/)
--- a/docs/wiki/Writing-Handlers.md
+++ b/docs/wiki/Writing-Handlers.md
@ -0,0 +1,308 @@
 # Writing Handlers
 Handlers are async Python functions that process messages. This guide covers everything you need to know to write effective handlers.
 ## Basic Handler Structure
 Every handler follows this pattern:
 ```python
 from dataclasses import dataclass
 from third_party.xmlable import xmlify
 from xml_pipeline.message_bus.message_state import HandlerMetadata, HandlerResponse
 # 1. Define your payload (input)
@xmlify
@dataclass
 class MyPayload:
    """Description of what this payload represents."""
    field1: str
    field2: int
 # 2. Define your response (output)
@xmlify
@dataclass
 class MyResponse:
    """Description of the response."""
    result: str
 # 3. Write the handler
 async def my_handler(payload: MyPayload, metadata: HandlerMetadata) -> HandlerResponse:
    """Process the payload and return a response."""
    result = f"Processed {payload.field1} with {payload.field2}"
    return HandlerResponse(
        payload=MyResponse(result=result),
        to="next-listener",  # Where to send the response
    )
 ```
 ## The @xmlify Decorator
 The `@xmlify` decorator enables automatic XML serialization and XSD generation:
 ```python
 from dataclasses import dataclass
 from third_party.xmlable import xmlify
@xmlify
@dataclass
 class Greeting:
    name: str                    # Required field
    formal: bool = False         # Optional with default
    count: int = 1               # Optional with default
 ```
 This generates XML like:
 ```xml
 <greeting>
  <name>Alice</name>
  <formal>true</formal>
  <count>3</count>
 </greeting>
 ```
 ### Supported Types
 | Python Type | XML Representation |
 |-------------|-------------------|
 | `str` | Text content |
 | `int` | Integer text |
 | `float` | Decimal text |
 | `bool` | `true` / `false` |
 | `list[T]` | Repeated elements |
 | `Optional[T]` | Optional element |
 | `@xmlify` class | Nested element |
 ### Nested Payloads
 ```python
@xmlify
@dataclass
 class Address:
    street: str
    city: str
@xmlify
@dataclass
 class Person:
    name: str
    address: Address  # Nested payload
 ```
 ## HandlerMetadata
 The `metadata` parameter provides context about the message:
 ```python
@dataclass
 class HandlerMetadata:
    thread_id: str              # Opaque thread UUID
    from_id: str                # Who sent this message
    own_name: str | None        # This listener's name (agents only)
    is_self_call: bool          # True if message is from self
    usage_instructions: str     # Peer schemas for LLM prompts
    todo_nudge: str             # System reminders about pending todos
 ```
 ### Usage Examples
 ```python
 async def my_handler(payload: MyPayload, metadata: HandlerMetadata) -> HandlerResponse:
    # Log who sent the message
    print(f"Received from: {metadata.from_id}")
    # Check if this is a self-call (agent iterating)
    if metadata.is_self_call:
        print("This is a self-call")
    # For agents: use peer schemas in LLM prompts
    if metadata.usage_instructions:
        system_prompt = metadata.usage_instructions + "\n\nYour custom instructions..."
 ```
 ## Response Types
 ### Forward to Target
 Send the response to a specific listener:
 ```python
 return HandlerResponse(
    payload=MyResponse(result="done"),
    to="next-listener",
 )
 ```
 ### Respond to Caller
 Return to whoever sent the message:
 ```python
 return HandlerResponse.respond(
    payload=ResultPayload(value=42)
 )
 ```
 This uses the thread registry to route back up the call chain.
 ### Terminate Chain
 End processing with no response:
 ```python
 return None
 ```
 Use this for terminal handlers (logging, display, etc.).
 ## Handler Patterns
 ### Simple Tool
 A stateless transformation:
 ```python
@xmlify
@dataclass
 class AddPayload:
    a: int
    b: int
@xmlify
@dataclass
 class AddResult:
    sum: int
 async def add_handler(payload: AddPayload, metadata: HandlerMetadata) -> HandlerResponse:
    result = payload.a + payload.b
    return HandlerResponse.respond(payload=AddResult(sum=result))
 ```
 ### LLM Agent
 An agent that uses language models:
 ```python
 async def research_handler(payload: ResearchQuery, metadata: HandlerMetadata) -> HandlerResponse:
    from xml_pipeline.platform.llm_api import complete
    # Build prompt with peer awareness
    system_prompt = f"""
 {metadata.usage_instructions}
 You are a research agent. Answer the query using available tools.
 """
    response = await complete(
        model="grok-4.1",
        messages=[
            {"role": "system", "content": system_prompt},
            {"role": "user", "content": payload.query},
        ],
    )
    return HandlerResponse(
        payload=ResearchResult(answer=response.content),
        to="summarizer",
    )
 ```
 ### Terminal Handler
 A handler that displays output and ends the chain:
 ```python
 async def console_output(payload: TextOutput, metadata: HandlerMetadata) -> None:
    print(f"[{payload.source}] {payload.text}")
    return None  # Chain ends here
 ```
 ### Self-Iterating Agent
 An agent that calls itself to continue reasoning:
 ```python
 async def thinking_agent(payload: ThinkPayload, metadata: HandlerMetadata) -> HandlerResponse:
    # Check if we should continue thinking
    if payload.iteration >= 5:
        return HandlerResponse(
            payload=FinalAnswer(answer=payload.current_answer),
            to="output",
        )
    # Continue thinking by calling self
    return HandlerResponse(
        payload=ThinkPayload(
            iteration=payload.iteration + 1,
            current_answer=f"Refined: {payload.current_answer}",
        ),
        to=metadata.own_name,  # Self-call
    )
 ```
 ## Error Handling
 Handlers should handle errors gracefully:
 ```python
 async def safe_handler(payload: MyPayload, metadata: HandlerMetadata) -> HandlerResponse:
    try:
        result = await risky_operation(payload)
        return HandlerResponse.respond(payload=SuccessResult(data=result))
    except ValidationError as e:
        return HandlerResponse.respond(payload=ErrorResult(error=str(e)))
    except Exception as e:
        # Log and return generic error
        logger.exception("Handler failed")
        return HandlerResponse.respond(payload=ErrorResult(error="Internal error"))
 ```
 ## Registration
 Register handlers in `organism.yaml`:
 ```yaml
 listeners:
  - name: calculator.add
    payload_class: handlers.calc.AddPayload
    handler: handlers.calc.add_handler
    description: "Adds two numbers and returns the sum"
 ```
 The `description` is important—it's used in auto-generated tool prompts for LLM agents.
 ## CPU-Bound Handlers
 For computationally expensive handlers, mark them as `cpu_bound`:
 ```yaml
 listeners:
  - name: analyzer
    payload_class: handlers.analyze.AnalyzePayload
    handler: handlers.analyze.analyze_handler
    description: "Heavy document analysis"
    cpu_bound: true
 ```
 These run in a separate process pool to avoid blocking the event loop.
 ## Security Notes
 Handlers are **untrusted code**. The system enforces:
 1. **Identity injection** — `<from>` is always set by the pump, never by handlers
 2. **Thread isolation** — Handlers see only opaque UUIDs
 3. **Peer constraints** — Agents can only send to declared peers
 Even compromised handlers cannot:
 - Forge sender identity
 - Access other threads
 - Discover organism topology
 - Route to undeclared peers
 ## See Also
 - [[Handler Contract]] — Complete handler specification
 - [[Configuration]] — Registering handlers
 - [[LLM Router]] — Using language models
--- a/docs/wiki/architecture/Message-Pump.md
+++ b/docs/wiki/architecture/Message-Pump.md
@ -0,0 +1,346 @@
 # Message Pump
 The Message Pump (StreamPump) is the heart of xml-pipeline. It orchestrates message flow from ingress through processing to handler dispatch and response handling.
 ## Overview
 The pump uses [aiostream](https://aiostream.readthedocs.io/) for stream-based processing with concurrent fan-out capabilities.
 ```python
 from xml_pipeline.message_bus.stream_pump import StreamPump
 pump = StreamPump(config)
 await pump.start()
 # Inject a message
 await pump.inject(raw_bytes, from_id="console")
 await pump.shutdown()
 ```
 ## Architecture
 ```
                    ┌─────────────────────┐
                    │   Message Source    │
                    │ (Console, WebSocket)│
                    └──────────┬──────────┘
                               │
                               ▼
 ┌──────────────────────────────────────────────────────────────┐
 │                      INGRESS PIPELINE                         │
 │                                                              │
 │  ┌─────────┐   ┌──────┐   ┌──────────┐   ┌─────────────┐   │
 │  │ Repair  │ → │ C14N │ → │ Envelope │ → │   Payload   │   │
 │  │  Step   │   │ Step │   │ Validate │   │  Extraction │   │
 │  └─────────┘   └──────┘   └──────────┘   └─────────────┘   │
 │                                                              │
 │  ┌──────────┐   ┌─────────┐   ┌─────────────┐              │
 │  │  Thread  │ → │   XSD   │ → │ Deserialize │              │
 │  │  Assign  │   │ Validate│   │   to class  │              │
 │  └──────────┘   └─────────┘   └─────────────┘              │
 │                                                              │
 └──────────────────────────────────────────────────────────────┘
                               │
                               ▼
                    ┌─────────────────────┐
                    │   ROUTING TABLE     │
                    │                     │
                    │  root_tag → [       │
                    │    Listener1,       │
                    │    Listener2,       │
                    │  ]                  │
                    └──────────┬──────────┘
                               │
              ┌────────────────┼────────────────┐
              ▼                ▼                ▼
    ┌─────────────────┐ ┌─────────────┐ ┌─────────────────┐
    │   Handler A     │ │  Handler B  │ │   Handler C     │
    │  (async/main)   │ │ (cpu_bound) │ │   (async)       │
    └────────┬────────┘ └──────┬──────┘ └────────┬────────┘
             │                 │                  │
             └─────────────────┼──────────────────┘
                               │
                               ▼
                    ┌─────────────────────┐
                    │  RESPONSE HANDLER   │
                    │                     │
                    │  • Serialize        │
                    │  • Wrap envelope    │
                    │  • Inject <from>    │
                    │  • Re-inject        │
                    └─────────────────────┘
 ```
 ## Pipeline Steps
 ### repair_step
 Fixes malformed XML using lxml's recover mode:
 ```python
 async def repair_step(state: MessageState) -> MessageState:
    parser = etree.XMLParser(recover=True)
    state.envelope_tree = etree.fromstring(state.raw_bytes, parser)
    return state
 ```
 Handles:
 - Missing closing tags
 - Invalid characters
 - Encoding issues
 ### c14n_step
 Canonicalizes XML using Exclusive C14N:
 ```python
 async def c14n_step(state: MessageState) -> MessageState:
    c14n_bytes = etree.tostring(state.envelope_tree, method='c14n')
    state.envelope_tree = etree.fromstring(c14n_bytes)
    return state
 ```
 Ensures deterministic representation for signing.
 ### envelope_validation_step
 Validates against `envelope.xsd`:
 ```xml
 <message xmlns="https://xml-pipeline.org/ns/envelope/v1">
  <meta>
    <from>...</from>
    <to>...</to>        <!-- optional -->
    <thread>...</thread> <!-- optional -->
  </meta>
  <!-- payload element -->
 </message>
 ```
 ### payload_extraction_step
 Extracts the payload element:
 ```python
 async def payload_extraction_step(state: MessageState) -> MessageState:
    # Find first non-meta child
    for child in state.envelope_tree:
        if child.tag != 'meta':
            state.payload_tree = child
            break
    return state
 ```
 ### thread_assignment_step
 Assigns or inherits thread UUID:
 ```python
 async def thread_assignment_step(state: MessageState) -> MessageState:
    meta = state.envelope_tree.find('meta')
    thread_elem = meta.find('thread')
    if thread_elem is not None:
        state.thread_id = thread_elem.text
    else:
        # New thread - assign UUID
        state.thread_id = str(uuid.uuid4())
    return state
 ```
 ### xsd_validation_step
 Validates payload against listener's schema:
 ```python
 async def xsd_validation_step(state: MessageState) -> MessageState:
    listener = find_listener_for_payload(state.payload_tree)
    schema = load_schema(listener.schema_path)
    if not schema.validate(state.payload_tree):
        state.error = "XSD validation failed"
    return state
 ```
 ### deserialization_step
 Converts XML to typed dataclass:
 ```python
 async def deserialization_step(state: MessageState) -> MessageState:
    listener = find_listener_for_payload(state.payload_tree)
    state.payload = xmlify_deserialize(
        state.payload_tree,
        listener.payload_class
    )
    return state
 ```
 ## Routing
 ### Root Tag Derivation
 Root tag = `{listener_name}.{dataclass_name}` (lowercase):
 ```
 Listener: greeter
 Dataclass: Greeting
 Root tag: greeter.greeting
 ```
 ### Routing Table
 ```python
 routing_table = {
    "greeter.greeting": [greeter_listener],
    "calculator.add": [calculator_listener],
    "search.query": [google_listener, bing_listener],  # Broadcast
 }
 ```
 ### Broadcast
 Multiple listeners can share a root tag (`broadcast: true`):
 ```yaml
 listeners:
  - name: search.google
    broadcast: true
    # ...
  - name: search.bing
    broadcast: true
    # ...
 ```
 All matching listeners execute concurrently.
 ## Handler Dispatch
 ### Async Handlers (Default)
 Run in the main event loop:
 ```python
 async def _dispatch_async(state, listener):
    metadata = build_metadata(state, listener)
    response = await listener.handler(state.payload, metadata)
    await self._process_response(response, listener, state)
 ```
 ### CPU-Bound Handlers
 Dispatched to ProcessPoolExecutor:
 ```python
 async def _dispatch_to_process_pool(state, listener):
    # Store data in shared backend
    payload_uuid, metadata_uuid = store_task_data(
        self._backend, state.payload, metadata
    )
    # Submit to pool
    task = WorkerTask(
        thread_uuid=state.thread_id,
        payload_uuid=payload_uuid,
        handler_path=listener.handler_path,
        metadata_uuid=metadata_uuid,
    )
    loop = asyncio.get_event_loop()
    result = await loop.run_in_executor(
        self._process_pool, execute_handler, task
    )
    # Fetch response from backend
    response = fetch_response(self._backend, result.response_uuid)
    await self._process_response(response, listener, state)
 ```
 ## Response Processing
 When a handler returns `HandlerResponse`:
 ```python
 async def _process_response(response, listener, state):
    if response is None:
        return  # Chain terminates
    # Validate target
    if listener.is_agent and listener.peers:
        if response.to not in listener.peers:
            await self._emit_error(state, "Routing error")
            return
    # Build new envelope
    payload_xml = xmlify_serialize(response.payload)
    new_state = MessageState(
        raw_bytes=wrap_in_envelope(
            payload_xml,
            from_id=listener.name,  # SYSTEM INJECTS
            thread_id=get_new_thread_id(response, state),
        ),
    )
    # Re-inject
    await self._process(new_state)
 ```
 ## Error Handling
 ### Pipeline Errors
 If any step sets `state.error`, processing stops and `<huh>` is emitted:
 ```xml
 <huh xmlns="https://xml-pipeline.org/ns/core/v1">
  <error>XSD validation failed</error>
  <original-attempt>base64-encoded-bytes</original-attempt>
 </huh>
 ```
 ### Routing Errors
 If an agent tries to route to an undeclared peer:
 ```xml
 <SystemError>
  <code>routing</code>
  <message>Message could not be delivered.</message>
  <retry-allowed>true</retry-allowed>
 </SystemError>
 ```
 ## Lifecycle
 ```python
 # Start
 pump = StreamPump(config)
 await pump.start()
 # Inject messages
 await pump.inject(raw_bytes, from_id="console")
 # Shutdown (graceful)
 await pump.shutdown()
 ```
 ## Configuration
 ```yaml
 process_pool:
  workers: 4
  max_tasks_per_child: 100
 backend:
  type: redis
  redis_url: "redis://localhost:6379"
 ```
 ## See Also
 - [[Architecture Overview]] — High-level view
 - [[Thread Registry]] — Thread tracking
 - [[Shared Backend]] — Cross-process state
 - [[Writing Handlers]] — Handler patterns
--- a/docs/wiki/architecture/Overview.md
+++ b/docs/wiki/architecture/Overview.md
@ -0,0 +1,256 @@
 # Architecture Overview
 xml-pipeline implements a stream-based message pump where all communication flows through validated XML envelopes. The architecture enforces strict isolation between handlers (untrusted code) and the system (trusted zone).
 ## High-Level Architecture
 ```
 ┌─────────────────────────────────────────────────────────────────────┐
 │                        TRUSTED ZONE (System)                        │
 │  • Thread registry (UUID ↔ call chain mapping)                      │
 │  • Listener registry (name → peers, schema)                         │
 │  • Envelope injection (<from>, <thread>, <to>)                      │
 │  • Peer constraint enforcement                                      │
 └─────────────────────────────────────────────────────────────────────┘
                               ↕
                    Coroutine Capture Boundary
                               ↕
 ┌─────────────────────────────────────────────────────────────────────┐
 │                      UNTRUSTED ZONE (Handlers)                      │
 │  • Receive typed payload + metadata                                 │
 │  • Return HandlerResponse or None                                   │
 │  • Cannot forge identity, escape thread, or probe topology          │
 └─────────────────────────────────────────────────────────────────────┘
 ```
 ## Core Components
 ### Message Pump (StreamPump)
 The central orchestrator that:
 1. Receives raw XML bytes
 2. Runs messages through preprocessing pipeline
 3. Routes to appropriate handlers
 4. Processes responses and re-injects
 See [[Message Pump]] for details.
 ### Pipeline Steps
 Messages flow through ordered processing stages:
 ```
 Raw Bytes
    │
    ▼
 ┌─────────────────┐
 │  repair_step    │  Fix malformed XML (lxml recover mode)
 └────────┬────────┘
         ▼
 ┌─────────────────┐
 │   c14n_step     │  Canonicalize XML (Exclusive C14N)
 └────────┬────────┘
         ▼
 ┌─────────────────┐
 │ envelope_valid  │  Validate against envelope.xsd
 └────────┬────────┘
         ▼
 ┌─────────────────┐
 │ payload_extract │  Extract payload from envelope
 └────────┬────────┘
         ▼
 ┌─────────────────┐
 │ thread_assign   │  Assign or inherit thread UUID
 └────────┬────────┘
         ▼
 ┌─────────────────┐
 │  xsd_validate   │  Validate against listener's XSD
 └────────┬────────┘
         ▼
 ┌─────────────────┐
 │ deserialize     │  XML → @xmlify dataclass
 └────────┬────────┘
         ▼
 ┌─────────────────┐
 │    routing      │  Match to listener(s)
 └────────┬────────┘
         ▼
    Handler
 ```
 ### Thread Registry
 Maps opaque UUIDs to call chains:
 ```
 UUID: 550e8400-e29b-41d4-...
 Chain: system.organism.console.greeter.calculator
        │       │        │       │        │
        │       │        │       │        └─ Current handler
        │       │        │       └─ Previous hop
        │       │        └─ Entry point
        │       └─ Organism name
        └─ Root
 ```
 Handlers only see the UUID. The actual chain is private to the system.
 See [[Thread Registry]] for details.
 ### Listener Registry
 Tracks registered listeners:
 ```
 name: "greeter"
  ├── payload_class: Greeting
  ├── handler: handle_greeting
  ├── description: "Friendly greeting handler"
  ├── agent: true
  ├── peers: [shouter, calculator]
  └── schema: schemas/greeter/v1.xsd
 ```
 ### Context Buffer
 Stores message history per thread:
 ```
 Thread: uuid-123
  ├── Slot 0: Greeting(name="Alice") from console
  ├── Slot 1: GreetingResponse(message="Hello!") from greeter
  └── Slot 2: ShoutResponse(text="HELLO!") from shouter
 ```
 Append-only, immutable slots. Auto-GC when thread is pruned.
 ## Message Flow
 ### 1. Message Arrival
 External message arrives (console, WebSocket, etc.):
 ```xml
 <message xmlns="https://xml-pipeline.org/ns/envelope/v1">
  <meta>
    <from>console</from>
    <to>greeter</to>
  </meta>
  <greeting>
    <name>Alice</name>
  </greeting>
 </message>
 ```
 ### 2. Pipeline Processing
 Message flows through pipeline steps. Each step transforms `MessageState`:
 ```python
@dataclass
 class MessageState:
    raw_bytes: bytes | None          # Input
    envelope_tree: Element | None    # After repair
    payload_tree: Element | None     # After extraction
    payload: Any | None              # After deserialization
    thread_id: str | None            # After assignment
    from_id: str | None              # Sender
    target_listeners: list | None    # After routing
    error: str | None                # If step fails
 ```
 ### 3. Handler Dispatch
 Handler receives typed payload + metadata:
 ```python
 async def handle_greeting(payload: Greeting, metadata: HandlerMetadata):
    # payload.name == "Alice"
    # metadata.thread_id == "uuid-123"
    # metadata.from_id == "console"
 ```
 ### 4. Response Processing
 Handler returns `HandlerResponse`:
 ```python
 return HandlerResponse(
    payload=GreetingResponse(message="Hello, Alice!"),
    to="shouter",
 )
 ```
 System:
 1. Validates `to` against peer list
 2. Serializes payload to XML
 3. Creates new envelope with injected `<from>`
 4. Re-injects into pipeline
 ## Trust Boundaries
 ### What the System Controls
 | Aspect | System Responsibility |
 |--------|----------------------|
 | `<from>` | Always injected from listener.name |
 | `<thread>` | Managed by thread registry |
 | `<to>` validation | Checked against peers list |
 | Schema enforcement | XSD validation on every message |
 | Call chain | Private, never exposed to handlers |
 ### What Handlers Control
 | Aspect | Handler Capability |
 |--------|-------------------|
 | Payload content | Full control |
 | Target selection | Via `HandlerResponse.to` (validated) |
 | Response/no response | Return value |
 | Self-iteration | Call own name |
 ### What Handlers Cannot Do
 - Forge sender identity
 - Access other threads
 - Discover topology
 - Route to undeclared peers
 - Modify message history
 - Access other handlers' state
 ## Multiprocess Architecture
 For CPU-bound handlers:
 ```
 ┌─────────────────────────────────────────────────────────────────┐
 │  Main Process (StreamPump)                                       │
 │  - Ingress pipeline                                             │
 │  - Routing decisions                                            │
 │  - Response re-injection                                        │
 └───────────────────────────┬─────────────────────────────────────┘
                            │ UUID + handler_path (minimal IPC)
              ┌─────────────┼─────────────┐
              ▼             ▼             ▼
 ┌─────────────────┐ ┌─────────────┐ ┌─────────────────┐
 │ Python Async    │ │ ProcessPool │ │ (Future: WASM)  │
 │ (main process)  │ │ (N workers) │ │                 │
 │ - Default mode  │ │ - cpu_bound │ │                 │
 └────────┬────────┘ └──────┬──────┘ └────────┬────────┘
         │                 │                  │
         └─────────────────┼──────────────────┘
                           ▼
 ┌─────────────────────────────────────────────────────────────────┐
 │  Shared Backend (Redis / Manager / Memory)                       │
 │  - Context buffer slots                                         │
 │  - Thread registry mappings                                     │
 └─────────────────────────────────────────────────────────────────┘
 ```
 See [[Shared Backend]] for details.
 ## See Also
 - [[Message Pump]] — Detailed pump architecture
 - [[Thread Registry]] — Call chain tracking
 - [[Shared Backend]] — Cross-process state
 - [[Handler Contract]] — Handler specification
--- a/docs/wiki/architecture/Shared-Backend.md
+++ b/docs/wiki/architecture/Shared-Backend.md
@ -0,0 +1,339 @@
 # Shared Backend
 The Shared Backend enables cross-process state sharing for multiprocess deployments. It provides storage for the Context Buffer and Thread Registry.
 ## Overview
 By default, xml-pipeline uses in-memory storage (single process). For CPU-bound handlers running in separate processes, you need shared state:
 ```
 ┌────────────────────┐     ┌────────────────────┐
 │   Main Process     │     │  Worker Process    │
 │   (StreamPump)     │     │  (cpu_bound)       │
 └─────────┬──────────┘     └──────────┬─────────┘
          │                           │
          └───────────┬───────────────┘
                      │
                      ▼
          ┌─────────────────────┐
          │   Shared Backend    │
          │  (Redis/Manager)    │
          └─────────────────────┘
 ```
 ## Backend Types
 ### InMemoryBackend (Default)
 Single-process, thread-safe storage using Python dictionaries.
 ```python
 from xml_pipeline.memory import get_shared_backend, BackendConfig
 config = BackendConfig(backend_type="memory")
 backend = get_shared_backend(config)
 ```
 **Use when:**
 - Single process deployment
 - Development/testing
 - No CPU-bound handlers
 ### ManagerBackend
 Uses `multiprocessing.Manager` for local multi-process sharing.
 ```python
 config = BackendConfig(backend_type="manager")
 backend = get_shared_backend(config)
 ```
 **Use when:**
 - Local deployment with CPU-bound handlers
 - No Redis available
 - Single machine, multiple processes
 ### RedisBackend
 Distributed storage with TTL-based auto-cleanup.
 ```python
 config = BackendConfig(
    backend_type="redis",
    redis_url="redis://localhost:6379",
    redis_prefix="xp:",
    redis_ttl=86400,  # 24 hours
 )
 backend = get_shared_backend(config)
 ```
 **Use when:**
 - Distributed deployment
 - Multiple machines
 - Need persistence
 - Production environments
 ## Configuration
 ### Via organism.yaml
 ```yaml
 backend:
  type: redis                          # memory | manager | redis
  redis_url: "redis://localhost:6379"  # Redis connection URL
  redis_prefix: "xp:"                  # Key prefix for multi-tenancy
  redis_ttl: 86400                     # Key TTL in seconds
 ```
 ### Programmatic
 ```python
 from xml_pipeline.memory import get_shared_backend, BackendConfig
 config = BackendConfig(
    backend_type="redis",
    redis_url="redis://localhost:6379",
    redis_prefix="myapp:",
    redis_ttl=3600,
 )
 backend = get_shared_backend(config)
 ```
 ## Storage Schema
 ### Context Buffer
 Stores message history per thread.
 **In-Memory/Manager:**
 ```python
 _buffers = {
    "thread-uuid-1": [slot_bytes_0, slot_bytes_1, ...],
    "thread-uuid-2": [...],
 }
 ```
 **Redis:**
 ```
 {prefix}buffer:{thread_id} → LIST of pickled BufferSlots
 ```
 ### Thread Registry
 Maps UUIDs to call chains.
 **In-Memory/Manager:**
 ```python
 _chain_to_uuid = {"console.greeter": "uuid-123"}
 _uuid_to_chain = {"uuid-123": "console.greeter"}
 ```
 **Redis:**
 ```
 {prefix}chain:{chain} → {uuid}
 {prefix}uuid:{uuid} → {chain}
 ```
 ## API
 ### Buffer Operations
 ```python
 # Append a slot
 index = backend.buffer_append(thread_id, slot_bytes)
 # Get all slots for thread
 slots = backend.buffer_get_thread(thread_id)
 # Get specific slot
 slot = backend.buffer_get_slot(thread_id, index)
 # Check thread exists
 exists = backend.buffer_thread_exists(thread_id)
 # Delete thread
 deleted = backend.buffer_delete_thread(thread_id)
 # List all threads
 threads = backend.buffer_list_threads()
 # Clear all (testing)
 backend.buffer_clear()
 ```
 ### Registry Operations
 ```python
 # Set chain ↔ UUID mapping
 backend.registry_set(chain, uuid)
 # Get UUID from chain
 uuid = backend.registry_get_uuid(chain)
 # Get chain from UUID
 chain = backend.registry_get_chain(uuid)
 # Delete mapping
 deleted = backend.registry_delete(uuid)
 # List all mappings
 all_mappings = backend.registry_list_all()
 # Clear all (testing)
 backend.registry_clear()
 ```
 ### Serialization
 Slots are serialized using pickle:
 ```python
 from xml_pipeline.memory import serialize_slot, deserialize_slot
 # Serialize for storage
 slot_bytes = serialize_slot(buffer_slot)
 # Deserialize after retrieval
 slot = deserialize_slot(slot_bytes)
 ```
 ## Integration
 ### With ContextBuffer
 ```python
 from xml_pipeline.memory import get_context_buffer
 # Uses shared backend automatically if configured
 buffer = get_context_buffer(backend=backend)
 # Check if using shared storage
 print(buffer.is_shared)  # True
 ```
 ### With ThreadRegistry
 ```python
 from xml_pipeline.message_bus.thread_registry import get_registry
 registry = get_registry(backend=backend)
 # Check if using shared storage
 print(registry.is_shared)  # True
 ```
 ### With StreamPump
 The pump automatically uses the configured backend:
 ```yaml
 backend:
  type: redis
  redis_url: "redis://localhost:6379"
 process_pool:
  workers: 4
 listeners:
  - name: analyzer
    cpu_bound: true  # Uses shared backend for data exchange
 ```
 ## Worker Data Flow
 For CPU-bound handlers, data flows through the backend:
 ```
 1. Main Process
   ├── Serialize payload + metadata
   ├── Store in backend (payload_uuid, metadata_uuid)
   └── Submit WorkerTask to ProcessPool
 2. Worker Process
   ├── Fetch payload + metadata from backend
   ├── Execute handler
   ├── Store response in backend (response_uuid)
   └── Return WorkerResult
 3. Main Process
   ├── Fetch response from backend
   ├── Clean up temporary data
   └── Process response normally
 ```
 ## TTL and Cleanup
 ### Redis TTL
 Redis keys automatically expire:
 ```yaml
 backend:
  redis_ttl: 86400  # Keys expire after 24 hours
 ```
 ### Manual Cleanup
 ```python
 # Delete specific thread
 backend.buffer_delete_thread(thread_id)
 backend.registry_delete(uuid)
 # Clear all (testing only)
 backend.buffer_clear()
 backend.registry_clear()
 ```
 ## Multi-Tenancy
 Use prefixes to isolate different organisms:
 ```yaml
 # Organism A
 backend:
  type: redis
  redis_prefix: "orgA:"
 # Organism B
 backend:
  type: redis
  redis_prefix: "orgB:"
 ```
 ## Monitoring
 ### Redis Info
 ```python
 info = backend.info()
 # {'buffer_threads': 5, 'registry_entries': 12}
 ```
 ### Health Check
 ```python
 is_healthy = backend.ping()  # True if connected
 ```
 ## Testing
 ```python
 import pytest
 from xml_pipeline.memory import InMemoryBackend
@pytest.fixture
 def backend():
    backend = InMemoryBackend()
    yield backend
    backend.close()
 def test_buffer_operations(backend):
    backend.buffer_append("thread-1", b"data")
    assert backend.buffer_thread_exists("thread-1")
 ```
 ## See Also
 - [[Architecture Overview]] — High-level architecture
 - [[Message Pump]] — How the pump uses backends
 - [[Configuration]] — Backend configuration options
--- a/docs/wiki/architecture/Thread-Registry.md
+++ b/docs/wiki/architecture/Thread-Registry.md
@ -0,0 +1,261 @@
 # Thread Registry
 The Thread Registry maps opaque UUIDs to call chains, enabling thread tracking while hiding topology from handlers.
 ## Purpose
 When agents communicate, they form call chains:
 ```
 console → greeter → calculator → back to greeter → shouter
 ```
 The registry:
 1. **Tracks call chains** for routing responses
 2. **Provides opaque UUIDs** to handlers (hiding topology)
 3. **Manages chain pruning** when handlers respond
 ## Concepts
 ### Call Chain
 A dot-separated path showing message flow:
 ```
 system.organism.console.greeter.calculator
 │      │        │       │       │
 │      │        │       │       └─ Current position
 │      │        │       └─ Greeter called calculator
 │      │        └─ Console called greeter
 │      └─ Organism name
 └─ Root
 ```
 ### Opaque UUID
 What handlers actually see:
 ```
 550e8400-e29b-41d4-a716-446655440000
 ```
 Handlers never see the actual chain. This prevents:
 - Topology probing
 - Call chain forgery
 - Thread hijacking
 ## API
 ### Initialize Root
 At boot time:
 ```python
 from xml_pipeline.message_bus.thread_registry import get_registry
 registry = get_registry()
 root_uuid = registry.initialize_root("my-organism")
 # Creates: system.my-organism → uuid
 ```
 ### Get or Create
 Get UUID for a chain (creates if needed):
 ```python
 uuid = registry.get_or_create("console.greeter")
 # Returns: existing UUID or creates new one
 ```
 ### Lookup
 Get chain for a UUID:
 ```python
 chain = registry.lookup(uuid)
 # Returns: "console.greeter" or None
 ```
 ### Extend Chain
 When forwarding to a new handler:
 ```python
 new_uuid = registry.extend_chain(current_uuid, "calculator")
 # Before: console.greeter (uuid-123)
 # After: console.greeter.calculator (uuid-456)
 ```
 ### Prune for Response
 When a handler returns `.respond()`:
 ```python
 target, new_uuid = registry.prune_for_response(current_uuid)
 # Before: console.greeter.calculator (uuid-456)
 # After: console.greeter (uuid-123)
 # target: "greeter"
 ```
 ### Register External Thread
 For messages arriving with pre-assigned UUIDs:
 ```python
 registry.register_thread(
    thread_id="external-uuid",
    initiator="console",
    target="greeter"
 )
 # Creates: system.organism.console.greeter → external-uuid
 ```
 ## Thread Lifecycle
 ### Creation
 ```
 1. External message arrives without thread
   │
   ▼
 2. thread_assignment_step generates UUID
   │
   ▼
 3. Registry maps: chain → UUID
 ```
 ### Extension
 ```
 1. Handler A forwards to Handler B
   │
   ▼
 2. Pump calls extend_chain(uuid_A, "B")
   │
   ▼
 3. Registry creates: chain.B → uuid_B
 ```
 ### Pruning
 ```
 1. Handler B calls .respond()
   │
   ▼
 2. Pump calls prune_for_response(uuid_B)
   │
   ▼
 3. Registry:
   - Looks up chain: "...A.B"
   - Prunes last segment: "...A"
   - Returns target "A" and uuid_A
   │
   ▼
 4. Response routed to Handler A
 ```
 ### Cleanup
 ```
 1. Chain exhausted (root reached) or
   Handler returns None
   │
   ▼
 2. UUID mapping removed
   │
   ▼
 3. Context buffer for thread deleted
 ```
 ## Shared Backend Support
 For multiprocess deployments, the registry can use a shared backend:
 ```python
 from xml_pipeline.memory.shared_backend import get_shared_backend, BackendConfig
 # Use Redis for distributed deployments
 config = BackendConfig(backend_type="redis", redis_url="redis://localhost:6379")
 backend = get_shared_backend(config)
 registry = get_registry(backend=backend)
 ```
 ### Storage Schema (Redis)
 ```
 xp:chain:{chain} → {uuid}     # Chain to UUID
 xp:uuid:{uuid} → {chain}      # UUID to Chain
 ```
 ## Security Properties
 ### What Handlers See
 ```python
 metadata.thread_id = "550e8400-..."  # Opaque UUID
 metadata.from_id = "greeter"         # Only immediate caller
 ```
 ### What Handlers Don't See
 - Full call chain
 - Other thread UUIDs
 - Thread count or topology
 - Parent/child relationships
 ### Why This Matters
 Even compromised handlers cannot:
 - **Forge thread IDs** — UUIDs are cryptographically random
 - **Discover topology** — Chain hidden behind UUID
 - **Hijack threads** — Registry validates all operations
 - **Probe other threads** — No enumeration API
 ## Debugging
 For operators (not exposed to handlers):
 ```python
 # Dump all mappings
 chains = registry.debug_dump()
 # {'uuid-123': 'console.greeter', 'uuid-456': 'console.greeter.calc'}
 # Clear (testing only)
 registry.clear()
 ```
 ## Example Flow
 ```
 1. Console sends @greeter hello
   ├── UUID assigned: uuid-1
   └── Chain: system.org.console.greeter
 2. Greeter forwards to calculator
   ├── extend_chain(uuid-1, "calculator")
   ├── New UUID: uuid-2
   └── Chain: system.org.console.greeter.calculator
 3. Calculator responds
   ├── prune_for_response(uuid-2)
   ├── Target: "greeter"
   └── UUID: uuid-1 (back to greeter's context)
 4. Greeter responds
   ├── prune_for_response(uuid-1)
   ├── Target: "console"
   └── Chain exhausted → cleanup
 ```
 ## Configuration
 No explicit configuration needed. The registry:
 - Initializes automatically at pump startup
 - Uses shared backend if configured
 - Cleans up on thread termination
 ## See Also
 - [[Architecture Overview]] — High-level architecture
 - [[Message Pump]] — How the pump uses the registry
 - [[Shared Backend]] — Cross-process storage
--- a/docs/wiki/convert-to-xwiki.ps1
+++ b/docs/wiki/convert-to-xwiki.ps1
@ -0,0 +1,76 @@
 # Convert Markdown wiki docs to XWiki format using Pandoc
 #
 # Prerequisites:
 #   - Pandoc installed (https://pandoc.org/installing.html)
 #   - Run from docs/wiki directory
 #
 # Usage:
 #   .\convert-to-xwiki.ps1
 #
 # Output:
 #   Creates xwiki/ directory with converted files
 $ErrorActionPreference = "Stop"
 $ScriptDir = Split-Path -Parent $MyInvocation.MyCommand.Path
 $OutputDir = Join-Path $ScriptDir "xwiki"
 Write-Host "Converting Markdown to XWiki format..."
 Write-Host "Output directory: $OutputDir"
 Write-Host ""
 # Create output directory structure
 New-Item -ItemType Directory -Force -Path $OutputDir | Out-Null
 New-Item -ItemType Directory -Force -Path (Join-Path $OutputDir "architecture") | Out-Null
 New-Item -ItemType Directory -Force -Path (Join-Path $OutputDir "reference") | Out-Null
 New-Item -ItemType Directory -Force -Path (Join-Path $OutputDir "tutorials") | Out-Null
 # Function to convert a single file
 function Convert-File {
    param (
        [string]$Input,
        [string]$Output
    )
    if (Test-Path $Input) {
        Write-Host "Converting: $Input"
        pandoc -f markdown -t xwiki $Input -o $Output
    }
 }
 # Convert root level docs
 Convert-File (Join-Path $ScriptDir "Home.md") (Join-Path $OutputDir "Home.xwiki")
 Convert-File (Join-Path $ScriptDir "Installation.md") (Join-Path $OutputDir "Installation.xwiki")
 Convert-File (Join-Path $ScriptDir "Quick-Start.md") (Join-Path $OutputDir "Quick-Start.xwiki")
 Convert-File (Join-Path $ScriptDir "Writing-Handlers.md") (Join-Path $OutputDir "Writing-Handlers.xwiki")
 Convert-File (Join-Path $ScriptDir "LLM-Router.md") (Join-Path $OutputDir "LLM-Router.xwiki")
 Convert-File (Join-Path $ScriptDir "Why-XML.md") (Join-Path $OutputDir "Why-XML.xwiki")
 # Convert architecture docs
 Convert-File (Join-Path $ScriptDir "architecture\Overview.md") (Join-Path $OutputDir "architecture\Overview.xwiki")
 Convert-File (Join-Path $ScriptDir "architecture\Message-Pump.md") (Join-Path $OutputDir "architecture\Message-Pump.xwiki")
 Convert-File (Join-Path $ScriptDir "architecture\Thread-Registry.md") (Join-Path $OutputDir "architecture\Thread-Registry.xwiki")
 Convert-File (Join-Path $ScriptDir "architecture\Shared-Backend.md") (Join-Path $OutputDir "architecture\Shared-Backend.xwiki")
 # Convert reference docs
 Convert-File (Join-Path $ScriptDir "reference\Configuration.md") (Join-Path $OutputDir "reference\Configuration.xwiki")
 Convert-File (Join-Path $ScriptDir "reference\Handler-Contract.md") (Join-Path $OutputDir "reference\Handler-Contract.xwiki")
 Convert-File (Join-Path $ScriptDir "reference\CLI.md") (Join-Path $OutputDir "reference\CLI.xwiki")
 # Convert tutorials
 Convert-File (Join-Path $ScriptDir "tutorials\Hello-World.md") (Join-Path $OutputDir "tutorials\Hello-World.xwiki")
 Write-Host ""
 Write-Host "Conversion complete!"
 Write-Host ""
 Write-Host "Files created in: $OutputDir"
 Write-Host ""
 Write-Host "Next steps:"
 Write-Host "  1. Review the converted files"
 Write-Host "  2. Upload to XWiki via REST API or import feature"
 Write-Host ""
 Write-Host "XWiki REST API example:"
 Write-Host "  curl -u admin:password -X PUT ``"
 Write-Host "    'https://xml-pipeline.org/rest/wikis/xwiki/spaces/Docs/pages/Home' ``"
 Write-Host "    -H 'Content-Type: text/plain' ``"
 Write-Host "    -d @$OutputDir\Home.xwiki"
--- a/docs/wiki/convert-to-xwiki.sh
+++ b/docs/wiki/convert-to-xwiki.sh
@ -0,0 +1,75 @@
 #!/bin/bash
 # Convert Markdown wiki docs to XWiki format using Pandoc
 #
 # Prerequisites:
 #   - Pandoc installed (https://pandoc.org/installing.html)
 #   - Run from docs/wiki directory
 #
 # Usage:
 #   ./convert-to-xwiki.sh
 #
 # Output:
 #   Creates xwiki/ directory with converted files
 set -e
 SCRIPT_DIR="$(cd "$(dirname "${BASH_SOURCE[0]}")" && pwd)"
 OUTPUT_DIR="$SCRIPT_DIR/xwiki"
 echo "Converting Markdown to XWiki format..."
 echo "Output directory: $OUTPUT_DIR"
 echo ""
 # Create output directory structure
 mkdir -p "$OUTPUT_DIR"
 mkdir -p "$OUTPUT_DIR/architecture"
 mkdir -p "$OUTPUT_DIR/reference"
 mkdir -p "$OUTPUT_DIR/tutorials"
 # Function to convert a single file
 convert_file() {
    local input="$1"
    local output="$2"
    if [ -f "$input" ]; then
        echo "Converting: $input"
        pandoc -f markdown -t xwiki "$input" -o "$output"
    fi
 }
 # Convert root level docs
 convert_file "$SCRIPT_DIR/Home.md" "$OUTPUT_DIR/Home.xwiki"
 convert_file "$SCRIPT_DIR/Installation.md" "$OUTPUT_DIR/Installation.xwiki"
 convert_file "$SCRIPT_DIR/Quick-Start.md" "$OUTPUT_DIR/Quick-Start.xwiki"
 convert_file "$SCRIPT_DIR/Writing-Handlers.md" "$OUTPUT_DIR/Writing-Handlers.xwiki"
 convert_file "$SCRIPT_DIR/LLM-Router.md" "$OUTPUT_DIR/LLM-Router.xwiki"
 convert_file "$SCRIPT_DIR/Why-XML.md" "$OUTPUT_DIR/Why-XML.xwiki"
 # Convert architecture docs
 convert_file "$SCRIPT_DIR/architecture/Overview.md" "$OUTPUT_DIR/architecture/Overview.xwiki"
 convert_file "$SCRIPT_DIR/architecture/Message-Pump.md" "$OUTPUT_DIR/architecture/Message-Pump.xwiki"
 convert_file "$SCRIPT_DIR/architecture/Thread-Registry.md" "$OUTPUT_DIR/architecture/Thread-Registry.xwiki"
 convert_file "$SCRIPT_DIR/architecture/Shared-Backend.md" "$OUTPUT_DIR/architecture/Shared-Backend.xwiki"
 # Convert reference docs
 convert_file "$SCRIPT_DIR/reference/Configuration.md" "$OUTPUT_DIR/reference/Configuration.xwiki"
 convert_file "$SCRIPT_DIR/reference/Handler-Contract.md" "$OUTPUT_DIR/reference/Handler-Contract.xwiki"
 convert_file "$SCRIPT_DIR/reference/CLI.md" "$OUTPUT_DIR/reference/CLI.xwiki"
 # Convert tutorials
 convert_file "$SCRIPT_DIR/tutorials/Hello-World.md" "$OUTPUT_DIR/tutorials/Hello-World.xwiki"
 echo ""
 echo "Conversion complete!"
 echo ""
 echo "Files created in: $OUTPUT_DIR"
 echo ""
 echo "Next steps:"
 echo "  1. Review the converted files"
 echo "  2. Upload to XWiki via REST API or import feature"
 echo ""
 echo "XWiki REST API example:"
 echo "  curl -u admin:password -X PUT \\"
 echo "    'https://xml-pipeline.org/rest/wikis/xwiki/spaces/Docs/pages/Home' \\"
 echo "    -H 'Content-Type: text/plain' \\"
 echo "    -d @$OUTPUT_DIR/Home.xwiki"
--- a/docs/wiki/reference/CLI.md
+++ b/docs/wiki/reference/CLI.md
@ -0,0 +1,129 @@
 # CLI Reference
 xml-pipeline provides a command-line interface for running and managing organisms.
 ## Commands
 ### xml-pipeline run
 Run an organism from a configuration file.
 ```bash
 xml-pipeline run [CONFIG_PATH]
 ```
 **Arguments:**
 - `CONFIG_PATH` — Path to organism.yaml (default: `config/organism.yaml`)
 **Examples:**
 ```bash
 xml-pipeline run config/organism.yaml
 xml-pipeline run                        # Uses default path
 xp run config/my-organism.yaml          # Short alias
 ```
 ### xml-pipeline init
 Create a new organism configuration from template.
 ```bash
 xml-pipeline init [NAME]
 ```
 **Arguments:**
 - `NAME` — Organism name (default: `my-organism`)
 **Creates:**
 ```
 {NAME}/
 ├── config/
 │   └── organism.yaml
 ├── handlers/
 │   └── hello.py
 └── .env.example
 ```
 **Examples:**
 ```bash
 xml-pipeline init my-agent
 xml-pipeline init                       # Uses default name
 ```
 ### xml-pipeline check
 Validate configuration without running.
 ```bash
 xml-pipeline check [CONFIG_PATH]
 ```
 **Arguments:**
 - `CONFIG_PATH` — Path to organism.yaml (default: `config/organism.yaml`)
 **Output:**
 ```
 Config valid: hello-world
  Listeners: 3
  LLM backends: 1
 ```
 **Examples:**
 ```bash
 xml-pipeline check config/organism.yaml
 xml-pipeline check                      # Uses default path
 ```
 ### xml-pipeline version
 Show version and installed features.
 ```bash
 xml-pipeline version
 ```
 **Output:**
 ```
 xml-pipeline 0.4.0
 Python 3.11.5
 Features: anthropic, console, redis, search
 ```
 ## Short Alias
 The `xp` command is a short alias for `xml-pipeline`:
 ```bash
 xp run config/organism.yaml
 xp init my-agent
 xp check
 xp version
 ```
 ## Environment Variables
 | Variable | Description |
 |----------|-------------|
 | `XAI_API_KEY` | xAI (Grok) API key |
 | `ANTHROPIC_API_KEY` | Anthropic (Claude) API key |
 | `OPENAI_API_KEY` | OpenAI API key |
 Create a `.env` file in your project root:
 ```env
 XAI_API_KEY=xai-...
 ANTHROPIC_API_KEY=sk-ant-...
 ```
 ## Exit Codes
 | Code | Meaning |
 |------|---------|
 | 0 | Success |
 | 1 | Configuration error |
 | 2 | Runtime error |
 ## See Also
 - [[Installation]] — Installing xml-pipeline
 - [[Quick Start]] — Getting started
 - [[Configuration]] — Configuration reference
--- a/docs/wiki/reference/Configuration.md
+++ b/docs/wiki/reference/Configuration.md
@ -0,0 +1,196 @@
 # Configuration Reference
 Organisms are configured via YAML files. The default location is `config/organism.yaml`.
 ## Minimal Configuration
 ```yaml
 organism:
  name: my-organism
 listeners:
  - name: greeter
    payload_class: handlers.hello.Greeting
    handler: handlers.hello.handle_greeting
    description: Greeting handler
 ```
 ## Full Configuration Reference
 ```yaml
 # ============================================================
 # ORGANISM SECTION
 # Core identity and network settings
 # ============================================================
 organism:
  name: "my-organism"              # Human-readable name (required)
  port: 8765                       # WebSocket port (optional)
  identity: "config/identity.key"  # Ed25519 private key path (optional)
  tls:                             # TLS settings (optional)
    cert: "certs/fullchain.pem"
    key: "certs/privkey.pem"
 # ============================================================
 # LLM SECTION
 # Language model routing configuration
 # ============================================================
 llm:
  strategy: failover               # failover | round-robin | least-loaded
  retries: 3                       # Max retry attempts
  retry_base_delay: 1.0            # Base delay for exponential backoff
  retry_max_delay: 60.0            # Maximum delay between retries
  backends:
    - provider: xai                # xai | anthropic | openai | ollama
      api_key_env: XAI_API_KEY     # Environment variable name
      priority: 1                  # Lower = preferred (for failover)
      rate_limit_tpm: 100000       # Tokens per minute limit
      max_concurrent: 20           # Max concurrent requests
    - provider: anthropic
      api_key_env: ANTHROPIC_API_KEY
      priority: 2
    - provider: ollama
      base_url: http://localhost:11434
      supported_models: [llama3, mistral]
 # ============================================================
 # BACKEND SECTION (Optional)
 # Shared state for multiprocess deployments
 # ============================================================
 backend:
  type: memory                     # memory | manager | redis
  # Redis-specific settings (when type: redis)
  redis_url: "redis://localhost:6379"
  redis_prefix: "xp:"              # Key prefix for multi-tenancy
  redis_ttl: 86400                 # TTL in seconds (24 hours)
 # ============================================================
 # PROCESS POOL SECTION (Optional)
 # Worker processes for CPU-bound handlers
 # ============================================================
 process_pool:
  workers: 4                       # Number of worker processes
  max_tasks_per_child: 100         # Restart workers after N tasks
 # ============================================================
 # LISTENERS SECTION
 # Message handlers (tools and agents)
 # ============================================================
 listeners:
  # Simple tool (non-agent)
  - name: calculator.add
    payload_class: handlers.calc.AddPayload
    handler: handlers.calc.add_handler
    description: "Adds two numbers"
  # LLM Agent
  - name: researcher
    payload_class: handlers.research.ResearchQuery
    handler: handlers.research.research_handler
    description: "Research agent that searches and synthesizes"
    agent: true                    # Marks as LLM agent
    peers:                         # Allowed call targets
      - calculator.add
      - web_search
    prompt: |                      # System prompt for LLM
      You are a research assistant.
      Use tools to find information.
  # CPU-bound handler (runs in process pool)
  - name: librarian
    payload_class: handlers.librarian.Query
    handler: handlers.librarian.handle_query
    description: "Document analysis with heavy computation"
    cpu_bound: true                # Dispatch to ProcessPoolExecutor
 # ============================================================
 # GATEWAYS SECTION (Optional)
 # Federation with remote organisms
 # ============================================================
 gateways:
  - name: remote_search
    remote_url: "wss://search.example.org"
    trusted_identity: "keys/search_node.pub"
    description: "Federated search gateway"
 ```
 ## Section Details
 ### organism
 | Field | Type | Required | Description |
 |-------|------|----------|-------------|
 | `name` | string | Yes | Human-readable organism name |
 | `port` | int | No | WebSocket server port |
 | `identity` | path | No | Ed25519 private key for signing |
 | `tls.cert` | path | No | TLS certificate path |
 | `tls.key` | path | No | TLS private key path |
 ### llm.backends[]
 | Field | Type | Required | Description |
 |-------|------|----------|-------------|
 | `provider` | string | Yes | `xai`, `anthropic`, `openai`, `ollama` |
 | `api_key_env` | string | Depends | Env var containing API key |
 | `base_url` | string | No | Override API endpoint |
 | `priority` | int | No | Lower = preferred (default: 1) |
 | `rate_limit_tpm` | int | No | Tokens per minute limit |
 | `max_concurrent` | int | No | Max concurrent requests |
 | `supported_models` | list | No | Models this backend serves (ollama) |
 ### listeners[]
 | Field | Type | Required | Description |
 |-------|------|----------|-------------|
 | `name` | string | Yes | Unique listener name |
 | `payload_class` | string | Yes | Import path to `@xmlify` dataclass |
 | `handler` | string | Yes | Import path to handler function |
 | `description` | string | Yes | Short description (used in prompts) |
 | `agent` | bool | No | Is this an LLM agent? (default: false) |
 | `peers` | list | No | Allowed call targets for agents |
 | `prompt` | string | No | System prompt for LLM agents |
 | `cpu_bound` | bool | No | Run in ProcessPoolExecutor (default: false) |
 | `broadcast` | bool | No | Allow shared root tag (default: false) |
 ### backend
 | Field | Type | Required | Description |
 |-------|------|----------|-------------|
 | `type` | string | No | `memory`, `manager`, `redis` (default: memory) |
 | `redis_url` | string | If redis | Redis connection URL |
 | `redis_prefix` | string | No | Key prefix (default: `xp:`) |
 | `redis_ttl` | int | No | Key TTL in seconds |
 ### process_pool
 | Field | Type | Required | Description |
 |-------|------|----------|-------------|
 | `workers` | int | No | Number of worker processes (default: CPU count) |
 | `max_tasks_per_child` | int | No | Tasks before worker restart |
 ## Environment Variables
 API keys should be stored in environment variables, referenced via `api_key_env`:
 ```env
 # .env file
 XAI_API_KEY=xai-abc123...
 ANTHROPIC_API_KEY=sk-ant-...
 OPENAI_API_KEY=sk-...
 ```
 ## Validation
 Validate your configuration without running:
 ```bash
 xml-pipeline check config/organism.yaml
 ```
 ## See Also
 - [[Quick Start]] — Get started quickly
 - [[Writing Handlers]] — Create handlers
 - [[LLM Router]] — LLM backend details
--- a/docs/wiki/reference/Handler-Contract.md
+++ b/docs/wiki/reference/Handler-Contract.md
@ -0,0 +1,293 @@
 # Handler Contract
 The complete specification for handler functions in xml-pipeline.
 ## Signature
 Every handler must be declared as:
 ```python
 async def handler(
    payload: PayloadDataclass,
    metadata: HandlerMetadata
 ) -> HandlerResponse | None:
    ...
 ```
 ### Requirements
 | Aspect | Requirement |
 |--------|-------------|
 | Function type | Must be `async def` |
 | First parameter | XSD-validated `@xmlify` dataclass |
 | Second parameter | `HandlerMetadata` (required) |
 | Return type | `HandlerResponse` or `None` |
 ## HandlerMetadata
 ```python
@dataclass
 class HandlerMetadata:
    thread_id: str              # Opaque thread UUID
    from_id: str                # Who sent this message
    own_name: str | None        # This listener's name (agents only)
    is_self_call: bool          # True if message from self
    usage_instructions: str     # Peer schemas for LLM prompts
    todo_nudge: str             # System reminders
 ```
 ### Field Details
 #### thread_id
 Opaque UUID for the current conversation thread.
 - Use for thread-scoped storage
 - Never parse or make assumptions about format
 - Maps internally to call chain (hidden)
 #### from_id
 The registered name of the listener that sent this message.
 - Only shows immediate sender, not full chain
 - Use for logging/debugging
 - Don't use for routing (use `HandlerResponse.to`)
 #### own_name
 This listener's registered name. Only set for agents (`agent: true`).
 - Enables self-referential reasoning
 - Used for self-iteration: `to=metadata.own_name`
 - `None` for non-agent listeners
 #### is_self_call
 `True` if this message was sent by this same handler.
 - Detect iteration loops
 - Handle self-messages differently if needed
 #### usage_instructions
 Auto-generated documentation of peer capabilities.
 - Contains XSD schemas of declared peers
 - Inject into LLM system prompts
 - Empty if no peers declared
 #### todo_nudge
 System-generated reminders about pending todos.
 - Contains info about raised watchers
 - Used by agents for task tracking
 - Empty if no pending todos
 ## HandlerResponse
 ```python
@dataclass
 class HandlerResponse:
    payload: Any    # @xmlify dataclass instance
    to: str         # Target listener name
 ```
 ### Construction Methods
 #### Direct Construction
 Forward to a specific listener:
 ```python
 return HandlerResponse(
    payload=MyResponse(data="result"),
    to="next-handler",
 )
 ```
 #### Respond to Caller
 Return to whoever sent the message:
 ```python
 return HandlerResponse.respond(
    payload=ResultPayload(value=42)
 )
 ```
 ### Return None
 End the chain with no response:
 ```python
 return None
 ```
 ## Return Types
 | Return | Effect |
 |--------|--------|
 | `HandlerResponse(to="x")` | Forward to listener "x" |
 | `HandlerResponse.respond()` | Return to caller (prunes chain) |
 | `None` | Terminate chain |
 ## Envelope Control
 The system enforces these rules on responses:
 | Field | Handler Control | System Override |
 |-------|-----------------|-----------------|
 | `<from>` | None | Always `listener.name` |
 | `<to>` | Via `response.to` | Validated against peers |
 | `<thread>` | None | Managed by registry |
 | `<payload>` | Full control | — |
 ## Peer Constraints
 Agents can only send to declared peers:
 ```yaml
 listeners:
  - name: greeter
    agent: true
    peers: [shouter, logger]  # Only these allowed
 ```
 ### Violation Handling
 If agent sends to undeclared peer:
 1. Message **blocked** (never routed)
 2. `SystemError` returned to agent
 3. Thread stays alive (can retry)
 ```xml
 <SystemError>
  <code>routing</code>
  <message>Message could not be delivered.</message>
  <retry-allowed>true</retry-allowed>
 </SystemError>
 ```
 ## Response Semantics
 ### Critical: Pruning on Respond
 When you call `.respond()`, the call chain is **pruned**:
 ```
 Before: console → greeter → calculator
                           ↑ (you respond here)
 After:  console → greeter
                 ↑ (response delivered here)
 ```
 **Consequences:**
 - Sub-agents you called are terminated
 - Their state/context is lost
 - You cannot call them again in this context
 **Therefore:** Complete ALL sub-tasks before responding.
 ## Examples
 ### Simple Tool
 ```python
@xmlify
@dataclass
 class AddPayload:
    a: int
    b: int
@xmlify
@dataclass
 class AddResult:
    sum: int
 async def add_handler(payload: AddPayload, metadata: HandlerMetadata) -> HandlerResponse:
    result = payload.a + payload.b
    return HandlerResponse.respond(payload=AddResult(sum=result))
 ```
 ### LLM Agent
 ```python
 async def research_handler(payload: ResearchQuery, metadata: HandlerMetadata) -> HandlerResponse:
    from xml_pipeline.platform.llm_api import complete
    response = await complete(
        model="grok-4.1",
        messages=[
            {"role": "system", "content": metadata.usage_instructions},
            {"role": "user", "content": payload.query},
        ],
    )
    return HandlerResponse(
        payload=ResearchResult(answer=response.content),
        to="summarizer",
    )
 ```
 ### Self-Iterating Agent
 ```python
 async def thinking_agent(payload: ThinkPayload, metadata: HandlerMetadata) -> HandlerResponse:
    if payload.iteration >= 5:
        # Done thinking - respond to caller
        return HandlerResponse.respond(
            payload=FinalAnswer(answer=payload.current_answer)
        )
    # Continue thinking by calling self
    return HandlerResponse(
        payload=ThinkPayload(
            iteration=payload.iteration + 1,
            current_answer=f"Refined: {payload.current_answer}",
        ),
        to=metadata.own_name,  # Self-call
    )
 ```
 ### Terminal Handler
 ```python
 async def console_output(payload: TextOutput, metadata: HandlerMetadata) -> None:
    print(f"[{payload.source}] {payload.text}")
    return None  # Chain ends
 ```
 ### Error Handling
 ```python
 async def safe_handler(payload: MyPayload, metadata: HandlerMetadata) -> HandlerResponse:
    try:
        result = await risky_operation(payload)
        return HandlerResponse.respond(payload=SuccessResult(data=result))
    except ValidationError as e:
        return HandlerResponse.respond(payload=ErrorResult(error=str(e)))
    except Exception:
        logger.exception("Handler failed")
        return HandlerResponse.respond(payload=ErrorResult(error="Internal error"))
 ```
 ## Security Properties
 Handlers are **untrusted code**. Even compromised handlers cannot:
 - Forge sender identity
 - Access other threads
 - Discover organism topology
 - Route to undeclared peers
 - Modify message history
 ## See Also
 - [[Writing Handlers]] — Practical guide
 - [[Configuration]] — Registering handlers
 - [[Architecture Overview]] — System architecture
--- a/docs/wiki/tutorials/Hello-World.md
+++ b/docs/wiki/tutorials/Hello-World.md
@ -0,0 +1,376 @@
 # Hello World Tutorial
 Build a complete greeting agent from scratch. By the end, you'll understand payloads, handlers, configuration, and message flow.
 ## What We're Building
 A greeting system with three components:
 ```
 User Input → Greeter Agent → Shouter Tool → Output
    │            │               │            │
    │            │               │            │
  "Alice"    "Hello,        "HELLO,      Displayed
             Alice!"        ALICE!"      to user
 ```
 ## Prerequisites
 - Python 3.11+
 - xml-pipeline installed (`pip install xml-pipeline[console]`)
 ## Step 1: Create Project Structure
 ```bash
 mkdir hello-world
 cd hello-world
 mkdir -p config handlers
 ```
 ## Step 2: Define Payloads
 Create `handlers/payloads.py`:
 ```python
 from dataclasses import dataclass
 from third_party.xmlable import xmlify
@xmlify
@dataclass
 class Greeting:
    """Request to greet someone."""
    name: str
@xmlify
@dataclass
 class GreetingResponse:
    """A friendly greeting."""
    message: str
@xmlify
@dataclass
 class ShoutRequest:
    """Request to shout text."""
    text: str
@xmlify
@dataclass
 class ShoutResponse:
    """Shouted text."""
    text: str
@xmlify
@dataclass
 class ConsoleOutput:
    """Text to display."""
    text: str
    source: str = "system"
 ```
 **What's happening:**
 - `@xmlify` enables XML serialization
 - `@dataclass` provides the fields
 - Each class becomes a valid XML payload
 ## Step 3: Write Handlers
 Create `handlers/greeter.py`:
 ```python
 from xml_pipeline.message_bus.message_state import HandlerMetadata, HandlerResponse
 from handlers.payloads import Greeting, GreetingResponse, ShoutRequest
 async def handle_greeting(payload: Greeting, metadata: HandlerMetadata) -> HandlerResponse:
    """
    Receive a greeting request, create a friendly message,
    then forward to the shouter to make it exciting.
    """
    # Create the greeting
    message = f"Hello, {payload.name}! Welcome to xml-pipeline!"
    # Forward to shouter (will come back to us? No - goes to output)
    return HandlerResponse(
        payload=ShoutRequest(text=message),
        to="shouter",
    )
 ```
 Create `handlers/shouter.py`:
 ```python
 from xml_pipeline.message_bus.message_state import HandlerMetadata, HandlerResponse
 from handlers.payloads import ShoutRequest, ConsoleOutput
 async def handle_shout(payload: ShoutRequest, metadata: HandlerMetadata) -> HandlerResponse:
    """
    Take text and SHOUT IT!
    Then send to console for display.
    """
    shouted = payload.text.upper() + "!!!"
    return HandlerResponse(
        payload=ConsoleOutput(text=shouted, source="shouter"),
        to="console-output",
    )
 ```
 Create `handlers/output.py`:
 ```python
 from xml_pipeline.message_bus.message_state import HandlerMetadata
 from handlers.payloads import ConsoleOutput
 async def handle_output(payload: ConsoleOutput, metadata: HandlerMetadata) -> None:
    """
    Display text to console.
    Returns None to end the message chain.
    """
    print(f"\n[{payload.source}] {payload.text}\n")
    return None  # Chain ends here
 ```
 ## Step 4: Configure the Organism
 Create `config/organism.yaml`:
 ```yaml
 organism:
  name: hello-world
 listeners:
  # The greeter agent
  - name: greeter
    payload_class: handlers.payloads.Greeting
    handler: handlers.greeter.handle_greeting
    description: "Greets people by name"
    peers:
      - shouter
  # The shouter tool
  - name: shouter
    payload_class: handlers.payloads.ShoutRequest
    handler: handlers.shouter.handle_shout
    description: "Makes text LOUD"
    peers:
      - console-output
  # Output handler
  - name: console-output
    payload_class: handlers.payloads.ConsoleOutput
    handler: handlers.output.handle_output
    description: "Displays text to console"
 ```
 ## Step 5: Verify Configuration
 ```bash
 xml-pipeline check config/organism.yaml
 ```
 Expected output:
 ```
 Config valid: hello-world
  Listeners: 3
  LLM backends: 0
 ```
 ## Step 6: Create Test Script
 Create `test_greeting.py`:
 ```python
 import asyncio
 from xml_pipeline.message_bus.stream_pump import StreamPump
 from xml_pipeline.config.loader import load_config
 async def main():
    # Load configuration
    config = load_config("config/organism.yaml")
    # Create and start the pump
    pump = StreamPump(config)
    await pump.start()
    print("Organism started! Injecting greeting...")
    # Create a greeting message
    greeting_xml = b"""<?xml version="1.0"?>
    <message xmlns="https://xml-pipeline.org/ns/envelope/v1">
      <meta>
        <from>test</from>
        <to>greeter</to>
      </meta>
      <greeting>
        <name>Alice</name>
      </greeting>
    </message>
    """
    # Inject the message
    await pump.inject(greeting_xml, from_id="test")
    # Give it time to process
    await asyncio.sleep(1)
    # Shutdown
    await pump.shutdown()
    print("Done!")
 if __name__ == "__main__":
    asyncio.run(main())
 ```
 ## Step 7: Run It
 ```bash
 python test_greeting.py
 ```
 Expected output:
 ```
 Organism started! Injecting greeting...
 [shouter] HELLO, ALICE! WELCOME TO XML-PIPELINE!!!!
 Done!
 ```
 ## What Just Happened?
 Let's trace the message flow:
 ### 1. Message Injection
 ```xml
 <greeting>
  <name>Alice</name>
 </greeting>
 ```
 Injected with `from=test`, `to=greeter`.
 ### 2. Pipeline Processing
 ```
 Raw bytes
    ↓
 repair_step      → Valid XML tree
    ↓
 c14n_step        → Canonicalized
    ↓
 envelope_valid   → Matches envelope.xsd
    ↓
 payload_extract  → Extracts <greeting>
    ↓
 thread_assign    → UUID: abc-123
    ↓
 xsd_validate     → Matches Greeting schema
    ↓
 deserialize      → Greeting(name="Alice")
    ↓
 routing          → target: greeter
 ```
 ### 3. Handler Dispatch
 ```python
 # greeter receives:
 payload = Greeting(name="Alice")
 metadata.thread_id = "abc-123"
 metadata.from_id = "test"
 ```
 ### 4. Response Processing
 Greeter returns:
 ```python
 HandlerResponse(
    payload=ShoutRequest(text="Hello, Alice!..."),
    to="shouter",
 )
 ```
 System:
 1. Validates `shouter` is in greeter's peers ✓
 2. Serializes ShoutRequest to XML
 3. Wraps in envelope with `from=greeter`
 4. Re-injects into pipeline
 ### 5. Chain Continues
 Shouter receives ShoutRequest, returns ConsoleOutput to `console-output`.
 ### 6. Chain Terminates
 `handle_output` returns `None` → chain ends.
 ## Exercises
 ### Exercise 1: Add a Counter
 Modify the shouter to count how many times it's been called:
 ```python
 # Hint: Use a module-level variable (simple) or
 # the context buffer (proper way)
 ```
 ### Exercise 2: Add Error Handling
 What happens if someone sends an empty name? Add validation:
 ```python
 async def handle_greeting(payload: Greeting, metadata: HandlerMetadata):
    if not payload.name.strip():
        return HandlerResponse(
            payload=ConsoleOutput(text="Error: Name required", source="greeter"),
            to="console-output",
        )
    # ... rest of handler
 ```
 ### Exercise 3: Make Greeter an LLM Agent
 Convert greeter to use an LLM for creative greetings:
 ```python
 from xml_pipeline.platform.llm_api import complete
 async def handle_greeting(payload: Greeting, metadata: HandlerMetadata):
    response = await complete(
        model="grok-4.1",
        messages=[
            {"role": "system", "content": "Generate a creative, friendly greeting."},
            {"role": "user", "content": f"Greet someone named {payload.name}"},
        ],
    )
    return HandlerResponse(
        payload=ShoutRequest(text=response.content),
        to="shouter",
    )
 ```
 Don't forget to add LLM configuration:
 ```yaml
 llm:
  backends:
    - provider: xai
      api_key_env: XAI_API_KEY
 ```
 ## Summary
 You've learned:
 - **Payloads**: Define messages with `@xmlify` dataclasses
 - **Handlers**: Async functions that process and respond
 - **Configuration**: Wire everything in organism.yaml
 - **Message Flow**: How messages traverse the pipeline
 - **Chaining**: Handlers forward to each other via `HandlerResponse`
 ## Next Steps
 - [[Writing Handlers]] — More handler patterns
 - [[Configuration]] — Full configuration reference
 - [[Architecture Overview]] — Deep dive into internals