fixing docs

2026-01-06 19:28:25 -08:00 · 2026-01-06 19:28:25 -08:00 · 86ad3c7eb2
commit 86ad3c7eb2
parent e6aa9cbd61
4 changed files with 0 additions and 396 deletions
--- a/README_v2.0.md
+++ b/README_v2.0.md
@ -1,94 +0,0 @@
 # AgentServer — The Living Substrate (v2.0)
 ***"It just works... safely."***
 **January 03, 2026**  
 **Architecture: Autonomous Schema-Driven, Turing-Complete Multi-Agent Organism**
 ## The Rant
 **Why XML?**  
 [Why not JSON?](docs/why-not-json.md)
 XML is the sovereign wire format — standards-based, self-describing, attack-resistant, and evolvable without drift. JSON was a quick hack that escaped into the wild and became the default for everything, including AI tool calling, where its brittleness causes endless prompt surgery and validation headaches.
 This project chooses XML deliberately. The organism enforces contracts exactly (XSD validation, no transcription bugs), tolerates dirty streams (repair + dummy extraction), and keeps reasoning visible. No fragile conventions. No escaping hell. Just bounded, auditable computation.
 Read the full rant [here](docs/why-not-json.md) for the history, pitfalls, and why XML wins permanently.
 ## What It Is
 AgentServer is a production-ready substrate for the `xml-pipeline` nervous system. Version 2.0 stabilizes the design around exact XSD validation, typed dataclass handlers, mandatory hierarchical threading, and strict out-of-band privileged control.
 See [Core Architectural Principles](docs/core-principles-v2.0.md) for the single canonical source of truth.
 ## Core Philosophy
 - **Autonomous DNA:** Listeners declare their contract via `@xmlify` dataclasses; the organism auto-generates XSDs, examples, and tool prompts.
 - **Schema-Locked Intelligence:** Payloads validated directly against XSD (lxml) → deserialized to typed instances → pure handlers.
 - **Multi-Response Tolerance:** Handlers return raw bytes; bus wraps in `<dummy></dummy>` and extracts multiple payloads (perfect for parallel tool calls or dirty LLM output).
 - **Computational Sovereignty:** Turing-complete via self-calls, subthreading primitives, and visible reasoning — all bounded by thread hierarchy and local-only control.
 ## Developer Experience — Create a Listener in 12 Lines
 **No manual schemas. No brittle JSON conventions. No hand-written prompts.**  
 Just declare a dataclass contract and a one-line human description. The organism handles validation, XSD, examples, and tool prompts automatically.
 ```python
 from xmlable import xmlify
 from dataclasses import dataclass
 from xml_pipeline import Listener, bus  # bus is the global MessageBus
@xmlify
@dataclass
 class AddPayload:
    a: int
    b: int
 def add_handler(payload: AddPayload) -> bytes:
    result = payload.a + payload.b
    return f"<result>{result}</result>".encode("utf-8")
 Listener(
    payload_class=AddPayload,
    handler=add_handler,
    name="calculator.add",
    description="Adds two integers and returns their sum."
 ).register()  # ← Boom: XSD, example, prompt auto-generated + registered
 ```
 The organism now speaks `<add>` — fully validated, typed, and discoverable.<br/>
 Unlike rigid platforms requiring custom mappings or fragile item structures, this is pure Python — typed, testable, and sovereign.
 ## Key Features
 ### 1. The Autonomous Schema Layer
 - Dataclass → cached XSD + example + rich tool prompt (mandatory description + field docs).
 - Namespaces: `https://xml-pipeline.org/ns/<category>/<name>/v1` (served live via domain for discoverability).
 ### 2. Thread-Based Lifecycle & Reasoning
 - Mandatory `<thread/>` with hierarchical IDs for reliable subthreading and audit trails.
 - LLM agents reason via open self-calls and optional `<todo-until/>`.
 - All thought steps visible as messages — no hidden state.
 ### 3. Message Pump
 - Single linear pipeline with repair, C14N, XSD validation, deserialization, handler execution, and multi-payload extraction.
 - Supports clean tools and forgiving LLM streams alike.
 - Thread-base message queue with bounded memory.
 ### 4. Structural Control
 - Bootstrap from `organism.yaml`.
 - Runtime changes (hot-reload, add/remove listeners) via local-only OOB channel (localhost WSS or Unix socket — GUI-ready).
 - Main bus oblivious to privileged ops.
 ### 5. Federation & Introspection
 - YAML-declared gateways with trusted keys.
 - Controlled meta queries (schema/example/prompt/capability list).
 ## Technical Stack
 - **Validation & Parsing:** lxml (XSD, C14N, repair) + xmlable (round-trip).
 - **Protocol:** Mandatory WSS (TLS) + TOTP on main port.
 - **Identity:** Ed25519 (signing, federation, privileged).
 - **Format:** Exclusive C14N XML (wire sovereign).
 ## Why This Matters
 AgentServer v2.0 is a bounded, auditable, owner-controlled organism where the **XSD is the security**, the **thread is the memory**, and the **OOB channel is the sovereignty**.
 One port. Many bounded minds. Autonomous yet obedient evolution. 🚀
 ---
 *XML wins. Safely. Permanently.*
--- a/docs/core-principles-v2.0.md
+++ b/docs/core-principles-v2.0.md
@ -1,103 +0,0 @@
 # AgentServer v2.0 — Core Architectural Principles
 **January 03, 2026**  
 **Architecture: Autonomous Schema-Driven, Turing-Complete Multi-Agent Organism**
 These principles are the single canonical source of truth for the project. All documentation, code, and future decisions must align with this file.
 ## Identity & Communication
 - All traffic uses the universal `<message>` envelope defined in `envelope.xsd` (namespace `https://xml-pipeline.org/ns/envelope/v1`).
 - Mandatory `<from/>` and `<thread/>` (convo_id string, supports hierarchical dot notation for subthreading, e.g., "root.1.research").
 - Optional `<to/>` (rare direct routing; most flows use payload namespace/root).
 - Exclusive C14N on ingress and egress.
 - Malformed XML repaired on ingress; repairs logged in `<huh/>` metadata.
 ## Identity Injection & Handler Purity
 - Handlers are pure, stateless functions with no knowledge of routing, thread context, or their own registered name.
 - On ingress (external or gateway messages): <from> is provided and authenticated by the client/gateway (enforced by envelope validation).
 - On response generation (after handler execution and multi-payload extraction):
  - The dispatcher injects <from> using the executing listener's registered name (e.g., "calculator.add" or "researcher").
  - For meta/primitive responses: <from> is injected as "core".
 - &ltthread&gt is inherited from the incoming message (or assigned/updated for primitives like spawn-thread).
 - &ltto&gt remains optional and rarely used.
 - This ensures every enveloped message has a trustworthy, auditable <from> without handler involvement, preventing spoofing and keeping capability code minimal/testable.
 ## Configuration & Composition
 - YAML file (`organism.yaml`) is the bootstrap source of truth, loaded at startup.
 - Defines initial listeners, agents, gateways, meta privileges, and OOB channel configuration.
 - Runtime structural changes (add/remove listeners, rewire agents, etc.) via local-only privileged commands on the dedicated OOB channel (hot-reload capability).
 - No remote or unprivileged structural changes ever.
 ## Autonomous Schema Layer
 - Listeners defined by `@xmlify`-decorated dataclass (payload contract) + pure handler function.
 - Mandatory human-readable description string (short "what this does" blurb for tool prompt lead-in).
 - Registration (at startup or via hot-reload) automatically generates:
  - XSD cached on disk (`schemas/<name>/v1.xsd`)
  - Example XML
  - Tool description prompt fragment (includes description, params with field docs if present, example input)
 - All capability namespaces under `https://xml-pipeline.org/ns/<category>/<name>/v1`.
 - Root element derived from payload class name (lowercase) or explicit.
 ## Message Pump
 - Single linear pipeline on main port: ingress → repair → C14N → envelope validation → payload routing.
 - Routing key = (payload namespace, root element); unique per listener.
 - Meta requests (`https://xml-pipeline.org/ns/meta/v1`) handled by privileged core handler.
 - User payloads:
  - Validated directly against listener's cached XSD (lxml)
  - On success → deserialized to typed dataclass instance (`xmlable.from_xml`)
  - Handler called with instance → returns raw bytes (XML fragment, possibly dirty/multi-root)
  - Bytes wrapped in `<dummy></dummy>` → repaired/parsed → all top-level payload elements extracted
  - Each extracted payload wrapped in separate response envelope (inherits thread/from, optional new subthread if primitive used)
  - Enveloped responses buffered and sent sequentially
 - Supports single clean response, multi-payload emission (parallel tools/thoughts), and dirty LLM output tolerance.
 - Message pump tracks token budgets per agent and thread, enforcing token limits and preventing abuse. The LLM abstraction layer informs the message bus on the actual token usage.
 - Message pump uses asynchronous non-blocking I/O for maximum throughput.
 ## Reasoning & Iteration
 - LLM agents iterate via open self-calls (same root tag, same thread ID).
 - Conversation thread = complete memory and audit trail (all messages logged).
 - Subthreading natively supported via hierarchical thread IDs and primitives (e.g., reserved payload to spawn "parent.sub1").
 - Optional structured constructs like `<todo-until/>` for visible planning.
 - No hidden loops or state machines; all reasoning steps are visible messages.
 - Thread management follows the dynamic call tracing model (see thread-management.md). Paths are built by appending target listener names on emission, with automatic popping on responses. Agents remain oblivious, enabling natural delegation and parallelism.
 ## Security & Sovereignty
 - Privileged messages (per `privileged-msg.xsd`) handled exclusively on dedicated OOB channel.
 - OOB channel bound to localhost by default (safe for local GUI); separate port/socket from main bus.
 - Main MessageBus and pump oblivious to privileged operations — no routing or handling for privileged roots.
 - Remote privileged attempts impossible (channel not exposed); any leak to main port logged as security event and dropped.
 - Ed25519 identity key used for envelope signing, federation auth, and privileged command verification.
 - No agent may modify organism structure, register listeners, or access host resources beyond declared scope.
 - “No Paperclippers” manifesto injected as first system message for every LLM-based listener.
 ## Federation
 - Gateways declared in YAML with trusted remote public key.
 - Remote tools referenced by gateway name in agent tool lists.
 - Regular messages flow bidirectionally; privileged messages never forwarded or accepted.
 ## Introspection (Meta)
 - Controlled via YAML flags (`allow_list_capabilities`, `allow_schema_requests`, etc.).
 - Supports `request-schema`, `request-example`, `request-prompt`, `list-capabilities`.
 - Remote meta queries optionally allowed per YAML (federation peers).
 ## Technical Constraints
 - Mandatory WSS (TLS) + TOTP on main port.
 - OOB channel WSS or Unix socket, localhost-default.
 - Internal: lxml trees → XSD validation → xmlable deserialization → dataclass → handler → bytes → dummy extraction.
 - Single process, async non-blocking.
 - XML is the sovereign wire format; everything else is implementation detail.
 ## Scheduled Computation
 - Timers and delays implemented as normal listeners using async sleeps.
 - Caller idles naturally; wakeup messages bubble back via standard tracing.
 - Enables recurrent tasks (e.g., periodic monitoring) without blocking or external schedulers.
 ## Bounded Stateful Listeners
 - Pure tools remain stateless.
 - Stateful capabilities (e.g., calculator memory, game state) store data per thread path UUID.
 - Ensures isolation across conversations, automatic cleanup on idle, and minimal mutable state.
 - Handler closes over or receives UUID for access — still oblivious to readable path.
 ## Resource Stewardship 
 - The Message Pump ensures fair execution and prevents "Paperclip" runaway scenarios via internal Thread-Level Scheduling. Every thread is subject to Token-Rate Monitoring and Fair-Share Queuing, ensuring that a high-volume agent (like a deep-thinking LLM) cannot block high-priority system events or starve simpler organs (like tools).
 These principles are now locked. All existing docs will be updated to match this file exactly. Future changes require explicit discussion and amendment here first.
--- a/docs/message-pump-v1.0.md
+++ b/docs/message-pump-v1.0.md
@ -1,60 +0,0 @@
 # Message Pump — End-to-End Flow (v2.0)
 The AgentServer message pump processes individual messages through a single, linear, attack-resistant pipeline. The outer dispatcher runs a continuous async loop, draining per-thread message buffers (queues) until empty — enabling persistent, branched reasoning without artificial limits.
 ```mermaid
 flowchart TD
    A["WebSocket Ingress\n(enqueue to thread buffer)"] --> B["Dispatcher Loop:\nSelect next message\n(per thread_scheduling strategy)"]
    B --> C["Repair + Exclusive C14N"]
    C --> D["Envelope Validation (lxml)"]
    D --> E["Extract Payload Tree"]
    E --> F{"Payload Namespace?"}
    F -->|meta/v1| G["Core Meta Handler\n(introspection & reserved primitives)"]
    F -->|capability| H["Route by (namespace, root)"]
    H --> I["Validate Payload vs Listener XSD (lxml)"]
    I --> J["Deserialize to Dataclass Instance (xmlable)"]
    J --> K["Call handler(instance) → raw bytes"]
    K --> L["Wrap bytes in <dummy></dummy>"]
    L --> M["Repair/Parse → Extract all top-level payloads"]
    M --> N["Wrap each payload in separate envelope\n(enqueue to target thread buffers)"]
    G --> N
    N --> O["Exclusive C14N + Sign"]
    O --> P["WebSocket Egress\n(sequential per connection)"]
    P --> B["Continue dispatcher loop if buffers non-empty"]
 ```
 ## Detailed Stages (Per-Message)
 1. **Ingress/Enqueue**: Raw bytes → repair → preliminary tree → enqueue to target thread buffer.
 2. **Dispatcher Loop**: Single async non-blocking loop selects next message from per-thread queues (breadth-first default for fairness).
 3. **Processing**:
   - Full repair + C14N.
   - Envelope validation.
   - Routing decision:
     - **Meta Branch** (`https://xml-pipeline.org/ns/meta/v1` namespace): Handled directly by privileged core handler (no listener lookup or XSD validation needed).
       - Purpose: Introspection and reserved organism primitives.
       - Examples:
         - `request-schema`, `request-example`, `request-prompt`, `list-capabilities` (returns XSD bytes, example XML, prompt fragment, or capability list).
         - Thread primitives like `spawn-thread`, `clear-context`.
       - Privileged: Controlled via YAML `meta` flags (e.g., `allow_schema_requests: "admin"` or "none"). Remote queries optional.
       - Why separate: Faster, safer (no user listener involved), topology privacy preserved.
     - Capability namespace → normal listener route (XSD validation + deserialization).
   - Typed handler call → raw bytes.
 4. **Response Handling**:
   - Dummy wrap → extract multi-payloads.
   - Each enqueued as new message(s) in appropriate thread buffer(s).
 5. **Egress**: Dequeue → C14N/sign → send.
 ## Key Properties
 - Continuous looping until all thread buffers empty — natural iteration/subthreading without one-shot constraints.
 - Multi-payload enqueues enable parallel branches/thoughts.
 - Scheduling balances deep dives vs fair exploration.
 - Attack-resistant at every step.
 XML in → queued → processed → multi-out → re-queued. Loops forever if needed. Safely. Permanently.
--- a/docs/message-pump-v2.0.md
+++ b/docs/message-pump-v2.0.md
@ -1,139 +0,0 @@
 # Message Pump — End-to-End Flow (v2.0)
 The AgentServer message pump processes individual messages through a single, linear, attack-resistant pipeline. The outer dispatcher runs a continuous async loop, draining per-thread message buffers (queues) until empty — enabling persistent, branched reasoning without artificial limits.
 ```mermaid
 flowchart TD
    subgraph Ingress
        A[Raw Bytes] --> B[Repair + C14N]
        B --> C[Enqueue to Thread Queue]
    end
    subgraph DispatcherLoop
        D[Dequeue Next Message] --> E{Envelope Valid?}
        E -->|No| F[Discard / System Error]
        E -->|Yes| G{Payload Namespace?}
        G -->|Meta| H["Core Handler<br>(raw payload)"]
        G -->|Normal| I[Validate Payload<br>vs Cached XSD]
        I -->|Fail| F
        I -->|Pass| J["Deserialize to<br>@xmlify Dataclass"]
        J --> K["Call Handler<br>(typed instance → bytes)"]
        H --> L["Wrap bytes in<br>&ltdummy&gt&lt/dummy&gt"]
        K --> L
    end
    subgraph Response
        L --> M[Repair + Extract<br>Multi-Payloads]
        M --> N{Extracted Payloads?}
        N -->|0| O["Optional: Inject<br>&ltagent-error&gt or Idle"]
        N -->|1 or more| P[For Each Payload:]
        P --> Q[Determine Target Listener]
        Q --> R["Append Target Name<br>to Current Path<br>(new thread ID)"]
        R --> S["Inject <from><br>(listener name)"]
        S --> T["Enqueue New Message(s)<br>to Deepened Path(s)<br>(parallel if multi)"]
        T --> U[On Response Bubbling:<br>Pop Last Segment<br>Route to Parent Path]
    end
    C --> DispatcherLoop
    DispatcherLoop --> Response
    Response --> DispatcherLoop
    style Ingress fill:#f0f8ff
    style DispatcherLoop fill:#f0fff0
    style Response fill:#fff0f0
 ```
 ```mermaid
 flowchart TD
    subgraph MessagePump
    subgraph Init
    start([Start])
    wrapstart["Boot Msg<br>&ltmessage&gt{...}&lt/message&gt"]
    end
    enq1([QUEUE 1])
    rawwaiting{Raw<br>Msg<br>Waiting?}
    waitRaw([Wait])
    subgraph Process
    extract["Extract<br>Tree"]
    split["Split<br>Tree"]
    subgraph foreach [For Each Message]
        getmsg[Get Msg]
        badTo{&ltTo&gt Missing?}
        endnoto([Discard])
        addfrom["Add .from"]
        repair[Repair + C14N]
        validate[Validate]
        invalidMsg{Bad<br>Message?}
        badmsg([Discard])
        more{More?}
    end
    enqueue([QUEUE 2])
    xmlWaiting{XML<br>Waiting?}
    waitXml([Wait])
    subgraph Async
    lookup[Lookup Listener] 
    route[Route]
    wait[await Response]
    wrap["Wrap<br>&ltfrom&gt{...}&lt/from&gt"]
    end
    end
    end
    start --> wrapstart --> enq1 --> rawwaiting 
    rawwaiting --> |NO| waitRaw
    rawwaiting ---> |YES| extract
    extract --> split --> foreach
    getmsg --> badTo
    badTo --> |YES| endnoto
    badTo --> |NO| addfrom --> repair --> validate --> invalidMsg
    invalidMsg ---> |NO| more --> |Yes| getmsg
    invalidMsg --> |YES| badmsg
    more --> |NO| enqueue
    enqueue --> xmlWaiting
    xmlWaiting --> |NO| waitXml
    xmlWaiting ---> |YES| lookup --> route --> wait --> wrap --> enq1
 ```
 ## Detailed Stages (Per-Message)
 ### Boot Message
   - Since all agents are listeners, there would be no way for a client to initiate a message pump.
   - The boot message is a dummy message that is enqueued to the root thread buffer. Any listener may chose to register a root tag for it.
   - The root thread buffer is the only one that is drained by the dispatcher loop.
   - if a listener (like a human agent) is registered for the boot message, it will receive the boot message and then async await for keyboard input.
 ### Queue 1
   - The first buffer holds raw unprocessed messages from the network.
 ### Queue 2
   - The second buffer holds messages that have been processed and are ready to be sent back to the network.
   - 
 1. **Ingress/Enqueue**: Raw bytes → repair → preliminary tree → enqueue to target thread buffer.
 2. **Dispatcher Loop**: Single async non-blocking loop selects next message from per-thread queues (breadth-first default for fairness).
 3. **Processing**:
   - Full repair + C14N.
   - Envelope validation.
   - Routing decision:
     - **Meta Branch** (`https://xml-pipeline.org/ns/meta/v1` namespace): Handled directly by privileged core handler (no listener lookup or XSD validation needed).
       - Purpose: Introspection and reserved organism primitives.
       - Examples:
         - `request-schema`, `request-example`, `request-prompt`, `list-capabilities` (returns XSD bytes, example XML, prompt fragment, or capability list).
         - Thread primitives like `spawn-thread`, `clear-context`.
       - Privileged: Controlled via YAML `meta` flags (e.g., `allow_schema_requests: "admin"` or "none"). Remote queries optional.
       - Why separate: Faster, safer (no user listener involved), topology privacy preserved.
     - Capability namespace → normal listener route (XSD validation + deserialization).
   - Typed handler call → raw bytes.
 4. **Response Handling**:
   - Dummy wrap → extract multi-payloads.
   - Each enqueued as new message(s) in appropriate thread buffer(s).
 5. **Egress**: Dequeue → C14N/sign → send.
 ## Key Properties
 - Continuous looping until all thread buffers empty — natural iteration/subthreading without one-shot constraints.
 - Multi-payload enqueues enable parallel branches/thoughts.
 - Scheduling balances deep dives vs fair exploration.
 - Attack-resistant at every step.
 XML in → queued → processed → multi-out → re-queued. Loops forever if needed. Safely. Permanently.