Email Agent System

An email-based personal AI agent built on Val Town. Users interact entirely via email — sending messages, tasks, ideas, and memories to memory-do@valtown.email. The agent classifies, stores, executes, and replies.

Architecture

email-agent/
  CLAUDE.md             — This file. System instructions and conventions.
  main.ts               — HTTP endpoint for status/admin/debugging (type: http)
  email-handler.ts      — Email ingestion point (type: email, address: memory-do@valtown.email)
  task-runner.ts        — Interval that processes pending/recurring tasks (type: interval)
  lib/
    classifier.ts       — Claude-powered email classification → unified entries
    scheduler.ts        — Natural language → ISO datetime conversion
    agent.ts            — Task execution engine with per-user system prompt
    attachments.ts      — Inbound/outbound attachment storage (blob + DB)
  db/
    schema.ts           — SQLite schema (unified entries + FTS5 + entry links + migration)
    queries.ts          — Unified CRUD, FTS search, user context builder

How It Works

Email Flow (email-handler.ts)

Email arrives at memory-do@valtown.email
Sender is identified/created as a user
Interaction is logged as an entry (type: interaction)
Email is classified by Claude into unified entries — any type: memories, tasks, ideas, people, meetings, projects, or anything new
System instructions are extracted and persisted to the user's personal system prompt
All entries are stored in the unified entries table with FTS synced
Entries are cross-linked to each other and to the source interaction
Instant tasks execute immediately via the agent; results are emailed back
Future/recurring tasks are queued with computed next_run_at timestamps
Acknowledgment email is sent summarizing what was captured, with follow-up questions

Task Runner (task-runner.ts)

Runs on an interval (every 15 minutes)
Finds task entries where next_run_at <= now and status = 'pending'
Executes each via the agent with full user context (including per-user system prompt)
Emails results to the user
Recurring tasks get rescheduled to their next occurrence
Failed tasks still get rescheduled if recurring

HTTP Admin (main.ts)

GET / or GET /stats — system-wide counts, entries by type, link counts
GET /user/:email — full user profile with entries grouped by type and links
GET /migrate — one-time migration from old tables to unified schema

Database

Uses project-scoped SQLite (std/sqlite/main.ts). Do NOT change to std/sqlite or std/sqlite/global.ts — this project uses its own database, not the user's global one.

Unified Model

Everything is an entry with a type column. No new tables needed for new content types — just use a new type string.

Tables

users — unique sender emails, optional name, per-user system_prompt (their personal "CLAUDE.md")
entries — all content: memories, tasks, ideas, interactions, people, meetings, projects, and any future type
entry_links — relationship graph between entries (cross-references, co-extraction, parent-child, etc.)

Entry Types (non-exhaustive — new types can be added without schema changes)

Type	Purpose	Key metadata fields
`memory`	Long-term facts/preferences	`{ category: "preference" \| "fact" \| "context" \| "general" }`
`task`	Actionable items	`{ task_type: "instant" \| "future" \| "recurring", result: "..." }`
`idea`	Captured ideas (explicit only)	`{ status: "seed" \| "exploring" \| ... }`
`interaction`	Raw email log	`{ direction: "inbound" \| "outbound" }`
`person`	People mentioned by the user	`{ relationship: "...", contact: { ... } }`
`meeting`	Scheduled conversations	`{ with: [...], date: "...", agenda: [...] }`
`project`	Active codebases/builds	`{ repo: "...", url: "...", tech: [...] }`

Entry Fields

id — primary key
user_id — owner
type — the content type (any string)
status — lifecycle: active, pending, in_progress, completed, failed, archived
title — optional, used by tasks/ideas/meetings
content — the main body text (always present)
tags — comma-separated, searchable via FTS
metadata — JSON blob for type-specific fields
source_entry_id — which entry spawned this (creates a provenance chain)
next_run_at — for scheduled entries (tasks, meeting reminders)
schedule_description — natural language schedule for recurring entries
created_at, updated_at — timestamps

Entry Links (the Graph)

The entry_links table creates a relationship graph between entries:

from_entry_id → to_entry_id with a relationship label
Relationships: related, created, co_extracted, parent, blocks, became, etc.
Enables: "This idea became this project", "These entries were extracted from the same email", "This person is connected to this meeting"

FTS5

Single entries_fts table covers all entry types. Indexes title, content, and tags. Synced manually on every insert in queries.ts.

Indexes

idx_entries_user_type — entries by user + type
idx_entries_user_status — entries by user + status
idx_entries_pending_run — partial index for due task lookup
idx_entries_type_created — entries by type + creation date
idx_entry_links_from/to — link traversal

Per-User System Prompt

Each user has a system_prompt column on the users table — their personal "CLAUDE.md" that controls how the agent behaves for them.

How it works:

User emails: "Be more casual and always use emojis. Call me Paul."
Classifier extracts system_instructions: ["Use casual, informal tone with emojis", "Address user as Paul"]
Email handler calls updateSystemPrompt() — appends new instructions, deduplicates
Every future agent call loads this prompt and injects it into the system prompt as "User's Personal Instructions"
User can see their prompt via the admin endpoint, or say "show me my preferences"

What it captures:

Tone/style preferences ("be concise", "use bullet points")
Name/addressing preferences ("call me Captain")
Format preferences ("always include a summary section")
Any standing instruction the user gives

Attachments

Full bidirectional attachment support — receive files from users and send generated files back.

Inbound (Receiving)

Email attachments arrive as File objects in email.attachments[]
Each is stored in blob storage with key attachments/{userId}/{entryId}/{filename}
Metadata is tracked in the attachments table (filename, MIME type, size, blob key, is_text flag)
Text-readable files (txt, csv, json, md, html, xml, code files) have their content extracted and:
- Fed to the classifier as part of the email body
- Available to the agent when executing tasks via getTextAttachmentContents()
Binary files (images, PDFs, zips) are stored but content is not extracted

Outbound (Generating)

The agent can generate files by including <attachment> tags in its response:

<attachment filename="report.csv" mime_type="text/csv">
col1,col2
val1,val2
</attachment>

These are parsed from the response, stored in blob storage, and sent as email attachments
Both email-handler.ts (instant tasks) and task-runner.ts (scheduled tasks) handle outbound attachments
Outbound interaction entries record attachment_count and attachment_names in metadata

Storage

Blob storage: @valtown/sdk blob API — actual file content
attachments table: metadata linking blobs to entries and users
Max inbound text extraction: 100KB per file (configurable in lib/attachments.ts)

Tables

attachments (
  id, entry_id, user_id, filename, mime_type, size_bytes,
  blob_key, is_text, created_at
)

Environment Variables

ANTHROPIC_API_KEY — required. Used by all Claude API calls (classifier, scheduler, agent). Accessed implicitly by the Anthropic SDK.

Dependencies

npm:@anthropic-ai/sdk@0.39.0 — Claude API client (pin the version)
https://esm.town/v/std/sqlite/main.ts — project-scoped SQLite
https://esm.town/v/std/email — Val Town email sending

Conventions

Code Style

TypeScript everywhere (.ts files only)
File and val names in kebab-case (e.g., email-handler.ts, task-runner.ts)
Interfaces for all data shapes crossing boundaries (API responses, DB rows, function params)
Always use parameterized queries (args) — never interpolate user input into SQL
Emoji prefixes in console.log for easy scanning: 📧 📝 🧠 📊 💾 💡 ⏰ 🔄 ⚡ ✅ ❌ 📤 🔧 📋 🧬 👤

Error Handling

Every email processing step is wrapped in try/catch
Task failures still send a notification email to the user
Recurring tasks get rescheduled even on failure
FTS search failures are non-fatal (empty tables can cause issues)
JSON parse failures in classifier/scheduler fall back to sensible defaults

LLM Calls

All Claude calls use claude-sonnet-4-20250514 — keep this consistent
System prompts are defined as module-level constants (not inline)
Per-user system prompt is injected dynamically from the users.system_prompt column
Responses are parsed via regex (/\{[\s\S]*\}/) to extract JSON from potential markdown code blocks
Always have a fallback if JSON parsing fails

Database Patterns

initDatabase() is called at the top of every entry point — safe to call multiple times (CREATE TABLE IF NOT EXISTS)
FTS inserts happen immediately after main table inserts in addEntry()
Use Promise.all for independent parallel queries
getUserContext() is the canonical way to build a user context bundle for the agent
metadata is stored as JSON string, parsed in parseEntryRow()

Classifier Output

Returns a unified entries[] array — each entry has a type and optional metadata
system_instructions[] — persistent instructions extracted for the user's prompt
follow_up_questions[] — natural follow-ups included in the acknowledgment email
New entry types can be output by the classifier without any code changes
Voice input tolerance: the classifier prompt explicitly handles transcription errors

Working Style

Don't break the email flow — email-handler.ts is the critical path. Changes there should be tested carefully. A broken email handler means users get no response.
Idempotent schema — initDatabase() must always be safe to re-run. Use IF NOT EXISTS on everything.
Keep classifier prompts tight — The classification prompt directly controls what gets stored. Be precise about what counts as each type. Over-extraction creates noise.
Ideas require explicit intent — The classifier is specifically instructed to only capture ideas the user explicitly asks to store. Don't weaken this.
Agent has full context — When executing tasks, the agent gets memories, active tasks, recent ideas, recent conversations, FTS search results, AND the user's personal system prompt.
Multiple Claude calls per task execution — The agentic loop may make several calls (one initial + N tool-use rounds + 1 subject line). Log toolCallCount and iterationCount to monitor costs. The subject line is still a separate call.
Voice input tolerance — Users may send emails via voice transcription. Expect imperfect input. Interpret intent rather than demanding precision.
Cross-link everything — Every entry should be linked to its source and to related entries. The graph is what makes context retrieval powerful.
Follow-up questions — After processing, include 1-2 natural follow-ups in the reply to help deepen the conversation. Don't force them.

Agentic Tool-Use Loop

The agent (lib/agent.ts) uses an iterative tool-use loop, not a one-shot request-reply.

How it works:

Build initial context (memories, tasks, ideas, conversations, attachments, FTS search)
Send to Claude with tool definitions — Claude can call tools mid-execution
Loop: If Claude returns tool_use blocks, execute each tool and feed results back
Repeat until Claude returns a final end_turn response (or max 15 iterations)
If max iterations hit, ask Claude to wrap up with what it has

Available Tools:

Tool	Purpose	DB function used
`search_memories`	Full-text search across all entry types	`searchEntries()`
`get_entries`	Browse entries by type/status	`getEntries()`
`get_linked_entries`	Traverse the entry relationship graph	`getLinkedEntries()`
`store_memory`	Save a new memory discovered during task execution	`addEntry()`
`create_task`	Create follow-up/sub-tasks (instant, future, recurring)	`addEntry()` + `parseSchedule()`
`fetch_url`	Fetch live data from the web (news, APIs, weather, etc.)	`fetch()`

Constants:

MAX_TOOL_ITERATIONS = 15 — safety limit on the loop
fetch_url timeout: 15 seconds, response limit: 50KB
HTML responses are auto-stripped of tags for readability

AgentResult includes:

response — the final text response
suggestedSubject — email subject line (separate Claude call)
attachments — generated file attachments
toolCallCount — total number of tool calls made
iterationCount — total number of loop iterations

Known Limitations & Future Improvements

No conversation threading — Each email is processed independently. The agent has recent interaction history but doesn't maintain true conversation threads.
No email HTML rendering — Replies are plain text only.
No auth on HTTP admin — The endpoints are open. Add authentication if this becomes public-facing.
No rate limiting — No protection against flooding.
No memory deduplication — Same fact can be stored multiple times. Could use semantic similarity to deduplicate.
Single Claude model — All calls use the same model. Could use a lighter model for scheduling/subject lines.
No system prompt reset — User can add instructions but can't easily remove or reset them yet. Add "reset my preferences" as a recognized command.

Evolving the System

This agent should grow organically based on usage patterns. When you notice recurring needs:

New entry types need zero code changes — just use a new type string in the classifier. The unified model handles it.
Refine classification — If the classifier is miscategorizing things, tighten the prompt. Log examples of misclassifications.
New link relationships — If new relationships emerge between entries, add relationship types to entry_links.
Proactive suggestions — The agent should notice patterns and suggest improvements, like the journal system's "Claude Tasks" concept.
Keep this file updated — When the system changes, update CLAUDE.md. This is the source of truth.

Migration

The system was migrated from separate tables (memories, tasks, ideas, interactions) to a unified entries table. The old tables still exist but are no longer used. The /migrate endpoint runs the migration. Old FTS tables (memories_fts, tasks_fts, ideas_fts, interactions_fts) are superseded by entries_fts.

paulkinlan

email-agent

Email Agent System

Architecture

How It Works

Email Flow (email-handler.ts)

Task Runner (task-runner.ts)

HTTP Admin (main.ts)

Database

Unified Model

Tables

Entry Types (non-exhaustive — new types can be added without schema changes)

Entry Fields

Entry Links (the Graph)

FTS5

Indexes

Per-User System Prompt

How it works:

What it captures:

Attachments

Inbound (Receiving)

Outbound (Generating)

Storage

Tables

Environment Variables

Dependencies

Conventions

Code Style

Error Handling

LLM Calls

Database Patterns

Classifier Output

Working Style

Agentic Tool-Use Loop

How it works:

Available Tools:

Constants:

AgentResult includes:

Known Limitations & Future Improvements

Evolving the System

Migration