SlimArmor - Technical Handover Document

Last Updated: 2026-02-02
Status: Production-ready v4 (with Browser CLI)

Project Overview

SlimArmor is a mini vector database for Val Town that provides semantic search capabilities using SQLite with libSQL/Turso vector extensions.

What It Does

Stores text with AI-generated embeddings (4096 dimensions by default)
Enables semantic search (search by meaning, not keywords)
Returns distance scores for ranking results
Supports any OpenAI-compatible embedding API

Architecture

┌─────────────────────────────────────────────────────────────┐
│                      HTTP API (api.ts)                       │
│  /upsert, /search, /delete, /stats, /calibrate, etc.        │
└─────────────────────────────────────────────────────────────┘
                              │
                              ▼
┌─────────────────────────────────────────────────────────────┐
│                   Vector DB Core (vectordb.ts)               │
│  - setup(), upsert(), search(), remove(), stats()           │
│  - Content hash check (skip re-embedding unchanged text)    │
│  - Dimension assertion (fail fast on mismatch)              │
└─────────────────────────────────────────────────────────────┘
                    │                       │
                    ▼                       ▼
┌──────────────────────────┐  ┌────────────────────────────────┐
│   Embedding Provider     │  │   Val Town SQLite (Turso)      │
│   (OpenAI-compatible)    │  │   - F32_BLOB vector columns    │
│   - Nebius (default)     │  │   - libsql_vector_idx (DiskANN)│
│   - OpenAI               │  │   - vector_top_k queries       │
│   - OpenRouter           │  │   - vector_distance_cos        │
│   - Custom               │  └────────────────────────────────┘
└──────────────────────────┘

Key Files

`ui.ts` - Browser CLI

A terminal-style web interface for interacting with the vector database.

Features:

Monospace terminal aesthetic (GitHub dark theme)
Command history (↑/↓ arrows)
Clickable IDs in results
Color-coded distance scores (green=good, orange=medium, red=poor)
Session-based auth token storage
Mobile responsive

Access: GET /ui

`vectordb.ts` - Core Library

The main library that can be imported into other vals.

Key exports:

setup() - Creates table and index
upsert(id, text, meta?) - Insert/update with smart re-embedding
search(query, k?, maxDistance?) - Semantic search
remove(id) - Delete record
stats() - Count and storage estimate
get(id) - Get single record
listIds(limit?) - List all IDs
reindex() - Recreate index
getProviderInfo() - Current embedding config

Configuration via env vars:

EMBEDDING_PROVIDER - Preset: nebius, openai, openrouter
EMBEDDING_API_URL - Custom API URL
EMBEDDING_API_KEY - API key
EMBEDDING_MODEL - Model name
EMBEDDING_DIM - Vector dimensions

`api.ts` - HTTP Endpoints

RESTful API layer with admin/testing tools.

Core endpoints:

POST /upsert - Insert/update record
POST /search - Semantic search
POST /delete - Delete record
GET /get?id=... - Get record
GET /list - List IDs

Admin endpoints:

GET / - API info + provider config
GET /ping - Health check
GET /stats - Detailed storage stats
GET /seed?n=100 - Seed synthetic data
GET /calibrate?q=... - Threshold suggestions
POST /reindex - Recreate index
POST /clear?confirm=yes - Delete all

Database Schema

CREATE TABLE vectordb (
  id TEXT PRIMARY KEY,
  text TEXT NOT NULL,
  text_hash TEXT NOT NULL,      -- SHA-256 for change detection
  embedding F32_BLOB(4096),     -- Vector column (dimension varies by provider)
  meta_json TEXT,               -- Optional JSON metadata
  updated_at INTEGER NOT NULL   -- Unix timestamp ms
);

CREATE INDEX vectordb_embedding_idx 
ON vectordb (libsql_vector_idx(embedding, 'metric=cosine', 'max_neighbors=64', 'compress_neighbors=float8'));

Index Optimizations

We tested and applied these Turso-documented optimizations:

Setting	Value	Why
`metric=cosine`	Cosine distance	Standard for text embeddings
`max_neighbors=64`	64 neighbors	Down from default ~192, saves storage
`compress_neighbors=float8`	1 byte/dim	75% less index storage

Trade-off: Slightly lower recall accuracy, significantly lower storage.

Verified Performance (105 records)

Metric	Value
Storage per record	~22 KB
Estimated max records/GB	~47,500
Embedding latency (Nebius)	~460ms
Search latency	<100ms

Distance Distribution

From calibration with "machine learning" query:

Min: 0.46 (highly relevant)
Median: 0.64
Max: 0.67 (least relevant in top 20)

Recommended thresholds:

Tight: 0.5 (top 3 only)
Balanced: 0.64 (top 10)
Loose: 0.7 (include all)

Known Limitations

Single embedding dimension - Table created with fixed dimension. Changing providers requires clearing data.
No chunking - Each record = one embedding. Long documents should be pre-chunked by the user.
No hybrid search - Pure vector search, no FTS fallback. Could be added later.
Sync embedding calls - Each upsert calls embedding API synchronously. Batch support not implemented.
No pagination - Search returns up to k results, no cursor-based pagination.

Future Improvements (Not Implemented)

If continuing development, consider:

Chunking support - Auto-split long documents, store as docId::chunkN
Hybrid search - Add FTS5 table, merge vector + keyword results
Batch embeddings - Batch multiple texts in one API call
Background indexing - Queue-based async embedding
Metadata filtering - SQL WHERE clauses on meta_json fields
Multi-index - Support different embedding models in same DB

Testing

Smoke Test

GET /test

Inserts 5 demo records, runs searches, shows results.

Scale Test

GET /seed?n=1000

Seeds 1000 synthetic records (takes ~8 minutes).

Threshold Calibration

GET /calibrate?q=your+query

Analyzes distance distribution, suggests thresholds.

Environment Variables Reference

Variable	Required	Default	Description
`EMBEDDING_PROVIDER`	No	`nebius`	Preset: nebius, openai, openrouter
`NEBIUS_API_KEY`	If nebius	-	Nebius API key
`OPENAI_API_KEY`	If openai	-	OpenAI API key
`OPENROUTER_API_KEY`	If openrouter	-	OpenRouter API key
`EMBEDDING_API_URL`	No	(from preset)	Custom API URL
`EMBEDDING_API_KEY`	No	-	Generic API key fallback
`EMBEDDING_MODEL`	No	(from preset)	Override model name
`EMBEDDING_DIM`	No	(from preset)	Override dimensions

Troubleshooting

"Embedding dim mismatch"

Provider returned different dimension than expected. Check EMBEDDING_DIM env var matches your model.

"Missing API key"

Set the appropriate env var for your provider.

Search returns irrelevant results

Lower maxDistance (try 0.5 instead of 0.7).

Slow inserts

Normal - each insert requires an API call (~460ms). Batch support not implemented.

Index errors after changing providers

Clear data with POST /clear?confirm=yes and re-insert.

Code Quality Notes

TypeScript throughout
Proper error handling with typed errors
Parameterized SQL (no injection risk)
Content hash prevents unnecessary re-embedding
Dimension assertion fails fast on mismatch
30s timeout on embedding API calls
AbortController for cancellation

Session History Summary

Verified Val Town SQLite supports vectors - F32_BLOB, libsql_vector_idx, vector_top_k all work
Tested Nebius embedding API - Qwen3-Embedding-8B returns 4096 dims
Built core vectordb.ts - upsert, search, delete, stats
Added optimizations - compress_neighbors=float8, max_neighbors=64
Added distance scores - Returns cosine distance in results
Added maxDistance filter - Filter out low-relevance results
Added admin tools - /seed, /calibrate, /stats, /clear
Made multi-provider - Nebius, OpenAI, OpenRouter, custom
Documented everything - README, GUIDE, HANDOVER

Contact / Links

Val: https://www.val.town/x/kamenxrider/slimarmor
Endpoint: https://kamenxrider--95fbe492ffe111f0bee942dde27851f2.web.val.run
Module: https://esm.town/v/kamenxrider/slimarmor/vectordb.ts

This document is for the next developer/AI continuing work on SlimArmor.

kamenxrider

slimarmor