add system overview: how and why the pieces fit together

2026-02-28 02:03:52 -08:00
parent e638376a1f
commit c700721806
1 changed files with 419 additions and 0 deletions
--- a/OPENCLAW_TRICKS.md
+++ b/OPENCLAW_TRICKS.md
@@ -6,6 +6,425 @@ production setup that's been running since early 2026.

 ---

+## How This System Evolved
+
+This section explains what we built, why, and how the pieces fit together. The
+rest of the document has the specific prompts and schemas — this part gives you
+the big picture so you understand the design decisions.
+
+### The Starting Point
+
+OpenClaw gives you a persistent workspace directory and the ability to run
+tools (shell, web, browser, messaging). Out of the box, the agent wakes up
+fresh every session with no memory of what happened before. The core challenge
+is: **how do you turn a stateless LLM into a stateful personal assistant that
+knows who you are, where you are, what you need, and what it was working on?**
+
+The answer is files. The workspace IS the agent's brain. Every piece of state,
+every rule, every memory lives in files that the agent reads on startup and
+updates as things change.
+
+### The Workspace Files — Separation of Concerns
+
+We started with one big AGENTS.md that had everything: rules, procedures,
+medication instructions, git workflow, travel preferences. It worked at first
+but quickly became unwieldy. The context window was getting eaten by
+instructions the agent didn't need for most tasks.
+
+The solution was **factoring out rulesets into focused files:**
+
+- **AGENTS.md** — the master file. Responsibilities, session startup
+  procedure, high-level rules. Think of it as the table of contents that
+  points to everything else.
+- **SOUL.md** — personality, tone, values. Separated because it's
+  philosophical, not procedural. Also: it's fun to let the agent evolve this
+  one over time.
+- **USER.md** — info about the human. Timezone, communication preferences,
+  basics the agent needs every session.
+- **MEMORY.md** — curated long-term memory. Only loaded in private sessions
+  (security — prevents leaking personal context in group chats).
+- **HEARTBEAT.md** — what to check on periodic heartbeat polls. Kept small
+  intentionally to minimize token burn on frequent checks.
+- **TOOLS.md** — environment-specific notes (hostnames, device names, channel
+  IDs). Separated from skills because this is YOUR setup, skills are shared.
+
+Then the procedural rulesets got their own files too:
+
+- **`memory/medications-instructions.md`** — full medication protocol. Dosages,
+  schedules, safety rules, overdose prevention logic. Only loaded when
+  medication tasks come up.
+- **`memory/checklist-pr.md`** — PR quality gate. What to verify before pushing,
+  after sub-agent pushes, before assigning to the human for review.
+- **`memory/checklist-flights.md`** — flight time verification, prep block
+  creation, landing checklist triggers.
+- **`memory/checklist-medications.md`** — pre-action verification before
+  reporting medication status or sending reminders.
+- **`memory/checklist-messaging.md`** — rules for every outgoing message: URL
+  verification, timezone conversion, status claim verification.
+- **`memory/landing-checklist.md`** — post-flight procedures (update location,
+  timezone, check meds, sync calendar).
+
+The key insight: **MEMORY.md has a "checklists" section at the very top** that
+says "before doing X, read file Y." The agent reads MEMORY.md at session start,
+sees the checklist index, and knows which file to consult before any action.
+This way the detailed instructions aren't always in context — they're loaded
+on-demand.
+
+```markdown
+## ⛔ CHECKLISTS (read the relevant file before acting)
+
+- **Any PR/code work** → `memory/checklist-pr.md`
+- **Any message to the user** → `memory/checklist-messaging.md`
+- **Medications** → `memory/checklist-medications.md`
+- **Flights/travel** → `memory/checklist-flights.md`
+```
+
+### The Daily Context State File
+
+This was probably the single most impactful addition. It's a JSON file
+(`memory/daily-context.json`) that every session reads on every message. It
+tracks the current state of the human: where they are, what timezone they're
+in, whether they're sleeping, when they last took meds, when they last sent a
+message.
+
+Why JSON instead of markdown? Because it's machine-readable. The agent can
+update individual fields programmatically, and the structure is unambiguous.
+Markdown is great for instructions; JSON is great for state.
+
+The daily context file means:
+- The agent always knows the human's timezone (critical for a frequent
+  traveler)
+- Medication reminders fire based on actual timestamps, not guesses
+- Sleep predictions inform when to send alerts vs stay quiet
+- Location tracking happens automatically
+
+### The Two-Tier Memory System
+
+Daily files (`memory/YYYY-MM-DD.md`) are raw logs — what happened each day.
+Topic tables, decisions, lessons, notes. Think of these as a daily journal.
+
+MEMORY.md is curated long-term memory — the distilled essence. Standing rules,
+key facts, lessons learned. Think of this as what a human would "just know"
+about their life.
+
+During heartbeats, the agent periodically reviews recent daily files and
+promotes significant items to MEMORY.md, and removes stale info. It's
+explicitly modeled on how human memory works: raw experience gets processed
+into lasting knowledge, and irrelevant details fade.
+
+The security model: MEMORY.md is only loaded in private (1:1 DM) sessions.
+In group chats, the agent works from daily files and daily-context.json only.
+This prevents personal context from leaking into shared channels.
+
+### Medication Tracking
+
+This is a safety-critical system. The design is deliberately paranoid:
+
+- **CSV as source of truth.** Not the daily-context boolean, not the agent's
+  memory — the CSV log file is authoritative.
+- **Double-verification before any action.** The daily context has a
+  `hasTakenDailyMedsToday` boolean AND a `dailyMedsTimestamp`. A midnight
+  cron resets the boolean. But if the human is in a timezone ahead of the
+  server, the reset happens at the wrong time. So the rule is: always verify
+  the timestamp falls on today's date in the human's timezone. The boolean
+  is a convenience hint, never the source of truth.
+- **Interval medications anchored to actual doses.** Some meds are "every 5
+  days" or "every 14 days." The next dose date is calculated from the last
+  actual dose timestamp, not from the intended schedule. If a dose is missed,
+  the next date is unknown until the missed dose is taken and logged.
+- **Overdose prevention.** The agent blocks logging if it detects a same-day
+  duplicate batch. This is the highest-priority safety rule — a duplicate
+  daily batch of certain medications could cause cardiac arrest.
+- **Escalating alerts.** If daily meds are >26h since last dose, the agent
+  escalates aggressively — ntfy push notification if chat messages go
+  unacknowledged.
+
+The medication instructions file is loaded on-demand (not every session),
+keeping context costs low for non-medication conversations.
+
+### Sleep Tracking
+
+The agent infers sleep from activity gaps rather than requiring explicit
+"I'm going to sleep" statements. On every message, it checks: was there a
+gap since the last message that overlaps with the predicted sleep window? If
+so, log sleep start = last activity before gap, wake = first activity after.
+
+Activity isn't just chat messages — it includes Gitea commits, comments,
+scheduled flight departures, and any other observable actions.
+
+Sleep predictions drift over time (the human in this setup tends to sleep
+~30min later each day), so the agent tracks the trend and extrapolates.
+Caffeine intake adjusts predictions forward. The predictions feed into:
+- When to send alerts vs stay quiet
+- Sitrep "sleep conflict" warnings (appointment during predicted sleep)
+- General awareness of the human's schedule
+
+### Location & Timezone Tracking
+
+The agent always knows where the human is. This matters because:
+- Times must always be displayed in the human's local timezone, not the
+  server's timezone
+- "Today" and "tomorrow" mean different things in different timezones
+- Medication timing needs to account for timezone changes
+- Flight prep needs local airport info
+
+The landing checklist (triggered automatically after every flight) updates
+location, timezone, nearest airport, and lodging in the daily context file.
+It also checks if any cron jobs have hardcoded timezones that need updating.
+
+### The Gitea Notification Poller
+
+OpenClaw has heartbeats, but those are periodic (every ~30min). For Gitea
+issues and PRs, we wanted near-realtime response. The solution: a tiny Python
+script that polls the Gitea notifications API every 2 seconds and wakes the
+agent via OpenClaw's `/hooks/wake` endpoint when new notifications arrive.
+
+Key design decisions:
+- **The poller never marks notifications as read.** That's the agent's job
+  after it processes them. This prevents the poller and agent from racing.
+- **It tracks notification IDs, not counts.** This way it only fires on
+  genuinely new notifications, not re-reads of existing ones.
+- **The wake message tells the agent to route output to Gitea/Mattermost,
+  not to DM.** This prevents chatty notification processing from disturbing
+  the human.
+- **Zero dependencies.** Just Python stdlib (`urllib`, `json`, `time`). Runs
+  anywhere.
+
+Here's the full source:
+
+```python
+#!/usr/bin/env python3
+"""
+Gitea notification poller.
+Polls for unread notifications and wakes OpenClaw when the count
+changes. The AGENT marks notifications as read after processing —
+the poller never marks anything as read.
+
+Required env vars:
+  GITEA_URL        - Gitea instance URL
+  GITEA_TOKEN      - Gitea API token
+  HOOK_TOKEN       - OpenClaw hooks auth token
+
+Optional env vars:
+  GATEWAY_URL      - OpenClaw gateway URL (default: http://127.0.0.1:18789)
+  POLL_DELAY       - Delay between polls in seconds (default: 2)
+"""
+
+import json
+import os
+import sys
+import time
+import urllib.request
+import urllib.error
+
+GITEA_URL = os.environ.get("GITEA_URL", "").rstrip("/")
+GITEA_TOKEN = os.environ.get("GITEA_TOKEN", "")
+GATEWAY_URL = os.environ.get("GATEWAY_URL", "http://127.0.0.1:18789").rstrip(
+    "/"
+)
+HOOK_TOKEN = os.environ.get("HOOK_TOKEN", "")
+POLL_DELAY = int(os.environ.get("POLL_DELAY", "2"))
+
+
+def check_config():
+    missing = []
+    if not GITEA_URL:
+        missing.append("GITEA_URL")
+    if not GITEA_TOKEN:
+        missing.append("GITEA_TOKEN")
+    if not HOOK_TOKEN:
+        missing.append("HOOK_TOKEN")
+    if missing:
+        print(
+            f"ERROR: Missing required env vars: {', '.join(missing)}",
+            file=sys.stderr,
+        )
+        sys.exit(1)
+
+
+def gitea_unread_ids():
+    """Return set of unread notification IDs."""
+    req = urllib.request.Request(
+        f"{GITEA_URL}/api/v1/notifications?status-types=unread",
+        headers={"Authorization": f"token {GITEA_TOKEN}"},
+    )
+    try:
+        with urllib.request.urlopen(req, timeout=10) as resp:
+            notifs = json.loads(resp.read())
+            return {n["id"] for n in notifs}
+    except Exception as e:
+        print(
+            f"WARN: Gitea API failed: {e}", file=sys.stderr, flush=True
+        )
+        return set()
+
+
+def wake_openclaw(count):
+    text = (
+        f"[Gitea Notification] {count} new notification(s). "
+        "Check your Gitea notification inbox via API, process them, "
+        "and mark as read when done. "
+        "Route all output to Gitea comments or Mattermost #git/#claw. "
+        "Do NOT reply to this session — respond with NO_REPLY."
+    )
+    payload = json.dumps({"text": text, "mode": "now"}).encode()
+    req = urllib.request.Request(
+        f"{GATEWAY_URL}/hooks/wake",
+        data=payload,
+        headers={
+            "Authorization": f"Bearer {HOOK_TOKEN}",
+            "Content-Type": "application/json",
+        },
+        method="POST",
+    )
+    try:
+        with urllib.request.urlopen(req, timeout=5) as resp:
+            status = resp.status
+        print(f"  Wake responded: {status}", flush=True)
+        return True
+    except Exception as e:
+        print(
+            f"WARN: Failed to wake OpenClaw: {e}",
+            file=sys.stderr,
+            flush=True,
+        )
+        return False
+
+
+def main():
+    check_config()
+    print(
+        f"Gitea notification poller started (delay={POLL_DELAY}s)",
+        flush=True,
+    )
+
+    last_seen_ids = gitea_unread_ids()
+    print(
+        f"Initial unread: {len(last_seen_ids)} notification(s)", flush=True
+    )
+
+    while True:
+        time.sleep(POLL_DELAY)
+
+        current_ids = gitea_unread_ids()
+        new_ids = current_ids - last_seen_ids
+
+        if not new_ids:
+            last_seen_ids = current_ids
+            continue
+
+        ts = time.strftime("%H:%M:%S")
+        print(
+            f"[{ts}] {len(new_ids)} new notification(s) "
+            f"({len(current_ids)} total unread), waking agent",
+            flush=True,
+        )
+
+        wake_openclaw(len(new_ids))
+        last_seen_ids = current_ids
+
+
+if __name__ == "__main__":
+    main()
+```
+
+Run it as a background service (launchd on macOS, systemd on Linux) with the
+env vars set. It's intentionally simple — no frameworks, no async, no
+dependencies.
+
+### The Daily Diary
+
+Every day gets a `memory/YYYY-MM-DD.md` file. The agent appends a topic
+summary to a table after every meaningful conversation:
+
+```markdown
+## Topics
+
+| Time (TZ) | Topic                                              |
+| ---------- | ------------------------------------------------- |
+| 14:30      | Discussed PR review workflow                      |
+| 16:00      | Medication logged                                 |
+| 18:45      | Flight prep blocks created for tomorrow's flight  |
+```
+
+This serves multiple purposes:
+- Any session can see what's been discussed today without loading full
+  conversation history
+- The agent can use `sessions_history` to get details on a specific topic
+  if needed
+- During memory maintenance, the agent reviews these to decide what's worth
+  promoting to MEMORY.md
+- It's a simple audit trail of what happened
+
+### The Requirement Capture Rule
+
+One of the most important rules: **every single requirement, preference, or
+instruction the human provides MUST be captured in a file immediately.** Not
+"mentally noted" — written to disk. Because the agent wakes up fresh every
+session, a "mental note" is worthless. If it's not in a file, it didn't
+happen.
+
+This applies to everything: project rules ("no mocks in tests"), workflow
+preferences ("fewer PRs, don't over-split"), corrections, new policies.
+Immediate write to the daily file, and to MEMORY.md if it's a standing rule.
+
+### PII-Aware Output Routing
+
+A lesson learned the hard way: **the audience determines what you can say,
+not who asked.** If the human asks for a medication status report in a group
+channel, the agent can't just dump it there — other people can read it. The
+rule: if the output would contain PII and the channel isn't private, redirect
+to DM and reply in-channel with "sent privately."
+
+This is enforced at multiple levels:
+- AGENTS.md has a warning banner at the top
+- The checklist system catches it before action
+- Channel-specific rule files (like our `memory/JAMES_CHAT.md`) define what's
+  shareable per-channel
+
+### Sub-Agent Isolation
+
+When spawning coding sub-agents for PR work, each one MUST clone to a fresh
+temporary directory. Never share git clones between agents — dirty working
+directories cause false CI results, merge conflicts, and wrong file state.
+The rule: `cd $(mktemp -d) && git clone <url> . && ...`
+
+### The Heartbeat System
+
+OpenClaw polls the agent periodically with a configurable prompt. The agent
+reads HEARTBEAT.md and decides what to do. We keep HEARTBEAT.md small
+(focused checklist) to minimize token burn on these frequent checks.
+
+The heartbeat handles:
+- Gitea inbox triage (check for new assignments)
+- Flight prep block creation (look ahead 7 days)
+- Open project review (can I take a step on anything?)
+- Workspace sync (commit and push changes)
+- Periodic memory maintenance
+
+State tracking in `memory/heartbeat-state.json` prevents redundant checks
+(e.g., don't re-check email if you checked 10 minutes ago).
+
+The key output rule: heartbeats should either be `HEARTBEAT_OK` (nothing to
+do) or a direct alert. Work narration goes to a designated status channel,
+never to the human's DM.
+
+### Putting It All Together
+
+The system works as a loop:
+
+1. **Session starts** → read SOUL, USER, daily-context, today's daily file
+2. **Message arrives** → check daily-context for state changes (sleep gap?
+   location change? meds overdue?), respond to the message, update state
+3. **Heartbeat fires** → check inbox, projects, flights, sync workspace
+4. **External event** (Gitea poller wake) → process notification, respond
+   via appropriate channel
+5. **Session ends** → state persists in files for next session
+
+The files are the continuity. The agent is stateless; the workspace is not.
+
+---
+
 ## Table of Contents

 1. [Workspace Bootstrapping](#workspace-bootstrapping)