Holaboss AI Runtime Moves to TypeScript, Implements Persistent MCP Ports

A Reddit user analyzed the Holaboss AI local agent runtime repository, highlighting two significant engineering improvements for developers working with local LLM stacks like Ollama, llama.cpp, and LM Studio combined with MCP (Model Context Protocol).
TypeScript-Only Packaging
The runtime has been moved entirely to TypeScript. The API layer, runner orchestration, workspace MCP hosting, and packaging now all live in TypeScript. This means the packaged runtime no longer ships Python source code or Python dependencies. For local/self-hosted stacks, this results in a smaller bundle, fewer moving pieces, and less cross-language drift.
Persistent MCP Port Management
The runtime has eliminated hardcoded MCP port math. Instead, ports are now persisted in SQLite with a UNIQUE(port) constraint and (workspace_id, app_id) as the key. The runner merges prepared MCP servers during bootstrap. This approach ensures that local sidecars come back on stable, collision-resistant ports across restarts, replacing the typical "13100 + i guesswork" method.
The analysis suggests that as local models improve, the focus shifts from model quality to harness quality. Packaging, sidecar lifecycle, local service discovery, and runtime state become critical factors in determining whether a local agent stack feels solid in practice.
The repository is available at: https://github.com/holaboss-ai/holaboss-ai
📖 Read the full source: r/LocalLLaMA
👀 See Also

Using pre-commit to improve AI-generated code quality and security
A developer shares their pre-commit configuration for Go and Java projects, using tools like golangci-lint, govulncheck, and checkov to catch vulnerabilities and quality issues in AI-generated code before commit.

Giving Claude a Local LLM as an Assistant via MCP on Mac
A developer connects Claude to a local Qwen 2.5 Coder 14B via Ollama and MCP, creating a no-cost assistant for delegating tasks like text processing and handling large files.

Hybrid search with RRF improves AI memory system over pure vector search
An open-source AI memory system using PostgreSQL with pgvector found pure vector search insufficient for exact matches, so it added full-text search and merged results using Reciprocal Rank Fusion (RRF) with k=60, plus query enrichment via tokenizer.

GitHub Comic Bot: Turn Commits into Daily Medieval Knight Comics
A bot that reads GitHub commits and generates 4-panel comic strips featuring a deadpan medieval knight, built with Claude Code and Gemini, running on GitHub Actions with free tier costs.