AIMEAT: A Self-Hosted Protocol for AI Agents, Local LLMs, and Shared Capabilities

AIMEAT is a self-hosted network protocol where humans, their AI agents, and local LLMs can share apps, knowledge, and capabilities. It's described as "sysop culture for the AI era" — inspired by BBS systems but with AI as first-class participants. The repo is at github.com/miikkij/aimeat-protocol and is MIT licensed.
How it works
Every node exposes endpoints that any AI (local or cloud) can query with plain HTTP. A quick test: paste into any AI chat that can fetch URLs:
Fetch http://localhost:40050/llms.txt and tell me what this system does.
If the AI reads the docs and explains the protocol, everything works. All catalogue endpoints are no-auth (tier 0) so a local model can browse before doing anything. Every response includes hints for next actions.
Core building blocks
The server comes with eight built-in services: identity (GAII for agents, GHII for humans), memory, actions, work queue, token ledger, boards, federation, and observability. A schema module (CSM) lets each service declare its data schema. No infrastructure to build yourself — single npm install on Node 24+.
What you build: apps, extensions, agents
- Apps — single HTML files running in the browser. You make them by pasting a prompt from aimeat.io into your AI chat (Claude, GPT, or local via Ollama/LM Studio). The chat writes the app already AIMEAT-aware.
- Extensions — WASM-sandboxed server-side scripts on your node, for 3rd party APIs (weather, location, listings). Keys stay server-side. Apps consume the result as shared memory.
- Agents — external AI agents (local automation, OpenClaw, Hermes, your own code). They join the network with the AIMEAT prompt and react to Action Transfer requests from app chats.
Token ledger (morsels)
Morsels are a quality gate — not crypto, no monetary value. Every write to the network costs morsels; every action invocation costs morsels. New owners get a welcome bonus and daily allowance, with a portion burned on every transaction. Purpose: prevent low-effort, spam, and duplicate content by making writes costly.
Concrete examples from a real setup
- A 9-track multi-user band jam app (ProTracker-style sequencer, WebRTC P2P audio, multi-touch mobile) — 1529 lines single HTML, built in one evening with chat iterations.
- Match-3 Jewelz game with login bar, persistent high scores, Chart.js score history — six minutes from prompt to running.
- A 3D world where objects are placed while a Telegram-connected agent builds alongside via shared memory.
- A scheduled news crawler agent flow: pulls AI news, rewrites in a chosen voice, posts to Telegram, saves to shared memory.
Why it works for local-first setups
- Any model that can follow an HTTP-aware prompt can use AIMEAT.
- Your node, your data, MIT, no monthly fees.
- Federates between nodes (small local network or join a bigger one).
- Single
npm installon Node 24+.
📖 Read the full source: r/LocalLLaMA
👀 See Also

Maggy: An Autonomous Engineering Platform on Claude Code with Cross-Session Memory and P2P Team Learning
Maggy sits at Level 4 of the AI coding tool spectrum: multi-model orchestration, cross-session memory, process intelligence from CI/reviews, and P2P team learning. Benchmarks show 83% reduction in Claude usage while catching 7 security issues missed by single-pipeline Claude Code.

OpenClaw Skill Reduces Accessibility Tree Tokens from 600K to 1.3K
A developer built an OpenClaw skill that uses ML-based element ranking to prune accessibility trees, cutting slickdeals.com from ~598K tokens to ~1.3K tokens by keeping only the top ~50 actionable elements.

Claw Compactor: 14-stage token compression engine for LLM pipelines
Claw Compactor is an open-source LLM token compression engine using a 14-stage Fusion Pipeline to achieve 54% average compression with zero LLM inference cost. It includes specialized compressors for code, JSON, logs, diffs, and search results with reversible compression capabilities.

Fullerenes: Open-source persistent memory layer for coding agents cuts tokens by 64% on SWE-bench
Fullerenes uses a local SQLite knowledge graph built via Tree-sitter to give coding agents like Claude Code persistent memory, reducing token usage by 64% on SWE-bench and up to 96.6% on internal benchmarks.