🛡️ SlimArmor - Mini Vector DB for Val Town

A lightweight, optimized vector database built on Val Town's SQLite (powered by Turso/libSQL). Supports any OpenAI-compatible embedding provider.

Features

✅ Semantic search with cosine similarity + distance scores
✅ Multi-provider support - Nebius, OpenAI, OpenRouter, or any OpenAI-compatible API
✅ Smart re-embedding - only re-embeds when text changes (content hash)
✅ Optimized storage - float8 compression, tuned DiskANN index
✅ Scale testing tools - seed data, calibrate thresholds, detailed stats
✅ Metadata filtering - filter by meta fields on search
✅ Hybrid search - vector + keyword boosting
✅ Batch upserts - submit many records in one call
✅ Chunking helper - split long docs into chunks
✅ Export/import - migrate data between providers

Supported Embedding Providers

Provider	Env Vars	Default Model	Dimensions
Nebius (default)	`NEBIUS_API_KEY`	Qwen3-Embedding-8B	4096
OpenAI	`EMBEDDING_PROVIDER=openai` + `OPENAI_API_KEY`	text-embedding-3-small	1536
OpenRouter	`EMBEDDING_PROVIDER=openrouter` + `OPENROUTER_API_KEY`	openai/text-embedding-3-small	1536
Custom	See below	(configurable)	(configurable)

Custom Provider Configuration

EMBEDDING_API_URL=https://your-api.com/v1/embeddings
EMBEDDING_API_KEY=your-key
EMBEDDING_MODEL=your-model-name
EMBEDDING_DIM=1536

Quick Stats (Nebius/4096 dims)

Metric	Value
Storage per record	~22 KB
Max records per 1GB	~47,500
Avg embedding time	~460ms
Recommended maxDistance	0.6 - 0.65

API Endpoints

Core API

Method	Endpoint	Description
`POST`	`/upsert`	Insert/update `{id, text, meta?}`
`POST`	`/search`	Search `{query, k?, maxDistance?}`
`POST`	`/delete`	Delete `{id}`
`GET`	`/get?id=...`	Get single record
`GET`	`/list?limit=...&offset=...&prefix=...`	List record IDs
`POST`	`/upsert_chunked`	Chunk + upsert `{id, text, meta?, chunkSize?, overlap?}`
`GET`	`/export?limit=...&offset=...`	Export records
`POST`	`/import`	Import records (batch upsert)

Admin / Testing

Method	Endpoint	Description
`GET`	`/`	API info + current provider
`GET`	`/ping`	Health check
`GET`	`/stats`	Detailed storage stats
`GET`	`/seed?n=100`	Seed N synthetic records
`GET`	`/calibrate?q=...`	Suggest distance thresholds
`GET`	`/validate`	Self-checks (optional write tests)
`POST`	`/reindex`	Recreate optimized index
`POST`	`/clear?confirm=yes`	Delete ALL records

Usage Examples

Upsert a record

curl -X POST https://YOUR_ENDPOINT/upsert \
  -H "Content-Type: application/json" \
  -d '{"id": "doc-1", "text": "Dogs are loyal pets", "meta": {"category": "animals"}}'

Search

curl -X POST https://YOUR_ENDPOINT/search \
  -H "Content-Type: application/json" \
  -d '{"query": "furry pets", "k": 5, "maxDistance": 0.64}'

Search with filters + hybrid boost

curl -X POST https://YOUR_ENDPOINT/search \
  -H "Content-Type: application/json" \
  -d '{
    "query": "machine learning",
    "k": 10,
    "filters": { "category": "tech" },
    "hybrid": { "enabled": true, "alpha": 0.25 }
  }'

Pagination

curl -X POST https://YOUR_ENDPOINT/search \
  -H "Content-Type: application/json" \
  -d '{"query": "notes", "k": 10, "offset": 10}'

Batch upsert

curl -X POST https://YOUR_ENDPOINT/upsert \
  -H "Content-Type: application/json" \
  -d '[
    {"id":"doc-1","text":"A short note","meta":{"category":"notes"}},
    {"id":"doc-2","text":"Another note","meta":{"category":"notes"}}
  ]'

Chunked upsert

curl -X POST https://YOUR_ENDPOINT/upsert_chunked \
  -H "Content-Type: application/json" \
  -d '{"id":"doc-long","text":"...long text...","chunkSize":800,"overlap":100}'

Export / Import

curl "https://YOUR_ENDPOINT/export?limit=200&offset=0"

curl -X POST https://YOUR_ENDPOINT/import \
  -H "Content-Type: application/json" \
  -d '{"records":[{"id":"doc-1","text":"hello"}]}'

Calibrate threshold

curl "https://YOUR_ENDPOINT/calibrate?q=machine+learning"

Distance Score Guide

Distance	Meaning	Recommendation
0.0 - 0.4	Very similar	Always include
0.4 - 0.6	Related	Include (tight mode)
0.6 - 0.7	Somewhat related	Include (balanced mode)
0.7+	Likely unrelated	Filter out

Default: Use maxDistance: 0.64 for balanced results.

Import as Module

import * as db from "https://esm.town/v/kamenxrider/slimarmor/vectordb.ts";

// Check provider configuration
console.log(db.getProviderInfo());
// → { provider: "nebius", model: "Qwen3-Embedding-8B", dimensions: 4096, ... }

// Setup (creates table + index)
await db.setup();

// Upsert
await db.upsert("doc-1", "Your text here", { category: "notes" });

// Search
const results = await db.search("search query", 10, 0.64);
// → [{ id, text, meta, distance }, ...]

// Delete
await db.remove("doc-1");

// Stats
const stats = await db.stats();
// → { count: 105, estimated_storage_mb: 2.26 }

Switching Providers

⚠️ Important: Embeddings from different providers/models are incompatible. If you switch:

Export any data you need
Clear the database: POST /clear?confirm=yes
Update environment variables
Re-insert your data

Tip: You can set EMBEDDING_DIM=auto to auto-detect dimensions on first setup.

Validation (Built-in Self-Checks)

Use the self-check endpoint to validate core behavior:

curl https://YOUR_ENDPOINT/validate

Optional write tests (require auth, embedding key, and ALLOW_WRITE_TESTS=1):

curl -H "Authorization: Bearer $ADMIN_TOKEN" \
  "https://YOUR_ENDPOINT/validate?write=yes"

If you can’t send headers (embedded preview), temporarily set ALLOW_WRITE_TESTS_NOAUTH=1 and call:

curl "https://YOUR_ENDPOINT/validate?write=yes"

Latest validation run (2026-02-03):

Status: OK in 3597 ms
Provider: Nebius Qwen/Qwen3-Embedding-8B (4096 dims)
Tests: auth, stats, list, search passed; write tests passed (write_upsert, filter_search, hybrid_search, write_delete)

Index Optimizations

SlimArmor uses optimized DiskANN settings for storage efficiency:

Setting	Value	Effect
`metric`	cosine	Cosine similarity
`max_neighbors`	64	66% fewer neighbors vs default
`compress_neighbors`	float8	75% less index storage

Index Tuning (Optional)

INDEX_METRIC=cosine            # cosine (default) or l2
INDEX_MAX_NEIGHBORS=64         # 8-256
INDEX_COMPRESS_NEIGHBORS=float8 # float8 (default), float16, floatb16, float32, float1bit, or none

# DiskANN knobs (optional; omit to use libSQL defaults)
INDEX_ALPHA=1.2                # >= 1; lower = sparser graph (faster, less accurate)
INDEX_SEARCH_L=200             # query-time effort (higher = more accurate, slower)
INDEX_INSERT_L=70              # insert-time effort (higher = better graph, slower inserts)

After changing index settings, rebuild the index:

curl -X POST -H "Authorization: Bearer $ADMIN_TOKEN" https://YOUR_ENDPOINT/reindex

Files

File	Description
`vectordb.ts`	Core library - import this in your vals
`api.ts`	HTTP API endpoints
`README.md`	This documentation
`GUIDE.md`	Step-by-step beginner guide
`HANDOVER.md`	Technical handover notes

Tech Stack

Runtime: Val Town (Deno)
Database: Val Town SQLite (Turso/libSQL)
Vector Index: DiskANN with cosine similarity
Embeddings: Any OpenAI-compatible API

License

MIT