SpruceChat Runs 0.5B LLM On-Device on Miyoo Handhelds via llama.cpp

What This Is
SpruceChat is a project that runs the Qwen2.5-0.5B language model entirely on-device on several handheld gaming consoles using llama.cpp. It requires no cloud connection or WiFi after the initial setup.
Key Details
The model lives in RAM after the first boot, and tokens stream in one by one during generation. It runs on the Miyoo A30, Miyoo Flip, Trimui Brick, and Trimui Smart Pro.
Performance on the Miyoo A30 (which has a Cortex-A7 quad-core processor):
- Model load: ~60 seconds on first boot
- Generation speed: ~1-2 tokens per second
- Prompt evaluation: ~3 tokens per second
The developer notes it's not fast, but it streams so you can watch it think. They mention 64-bit devices are quicker.
The AI is described as having "the personality of a spruce tree: patient, unhurried, quietly amazed by everything."
If the device is on WiFi, you can also hit the llama-server from a browser on a phone or laptop to chat with a real keyboard.
The repository is at https://github.com/RED-BASE/SpruceChat. The project was built with help from Claude, and there's already a collaborator working on expanding device support. The first release is up with both armhf and aarch64 binaries, and the model is included.
📖 Read the full source: r/LocalLLaMA
👀 See Also

Contextium: Open-Source Persistent Context Framework for Claude Code
Contextium is a structured git repo framework that provides persistent context for Claude Code sessions, using a CLAUDE.md file as a context router to lazy-load relevant markdown files. The open-source version includes a template with 6 sample apps and 27 integration docs.

Need MCP Server Provides Semantic Tool Discovery for AI Agents
An MCP server called Need enables semantic search over 10,000+ tools from brew, npm, pip, and cargo. When an agent requests a task like 'compress these PNGs,' it finds pngquant, installs it, runs it, and reports back on success.

Vibe Hosting: Claude Code MCP Integration for AI-Assisted Deployment
NameOcean's Vibe Hosting platform integrates Claude Code MCP to build and deploy projects through natural language commands. The service offers free SSL, domains, DNS, and VPS setup for static sites, Node.js, Python, Django, and Go applications.

Claudius: Open-Source Embeddable AI Chat Widget for Claude
Claudius is an open-source, self-hosted chat widget powered by Claude that can be embedded on any website with one script tag. It runs on Cloudflare Workers with a React frontend and includes features like custom system prompts, rate limiting, and accessibility compliance.