TextGen (text-generation-webui) Becomes Native Desktop App with Portable Builds
TextGen (formerly text-generation-webui, also known as oobabooga) has released a native desktop app version. The project, in development since December 2022, now offers portable builds — just download, unzip, and double-click textgen. No installation or files outside the extracted folder. All chat histories and settings live in a bundled user_data folder.
Key Features
- Privacy: Zero outbound requests. Unlike LM Studio, it does not phone home with OS, CPU, app version, or backend choices.
- ik_llama.cpp builds: Ships custom quant types like IQ4_KS and IQ5_KS for better accuracy vs vanilla llama.cpp used by LM Studio and Ollama.
- Built-in web search: Uses
ddgsPython library. Two modes: tool-calling with theweb_searchtool (works with Qwen 3.6 and Gemma 4) or a checkbox that fetches search results as text attachments. - Tool-calling: Supports single-file .py tools (easy custom functions), HTTP MCP servers, and stdio MCP servers. Option to require approve/reject confirmation before executing tool calls. Guide here.
- Custom characters: Create characters for casual chats alongside instruction-following conversations.
- API compatibility: OpenAI and Anthropic spec-compliant API. Works with Claude Code:
ANTHROPIC_BASE_URL=http://127.0.0.1:5000 claude. - PDF extraction: Uses PyMuPDF for accurate text extraction.
- Web page fetching: Uses
trafilaturato strip navigation/boilerplate, saving tokens in agentic loops. - Jinja2 templates: Renders chat templates via Python Jinja2, avoiding crashes in llama.cpp's C++ jinja reimplementation.
Available Builds
CUDA, Vulkan, CPU-only, Mac (Apple Silicon and Intel), and ROCm. All portable.
License: AGPLv3. Source: https://github.com/oobabooga/textgen
📖 Read the full source: r/LocalLLaMA
👀 See Also

Developer Builds Scheme Compiler to WASM Using AI in 4 Days
A developer created Puppy Scheme, a Scheme compiler that targets WebAssembly, in about 4 days using AI assistance. The compiler supports 73% of R5RS and R7RS, uses WASM GC, and achieved compilation time improvements from 3½ minutes to 11 seconds overnight.

GSD-Lite: A State Machine for Claude Code That Enforces TDD and Prevents Test Skipping
GSD-Lite is an open-source MCP server that adds a 12-state workflow machine to Claude Code, enforcing test-driven development with specific anti-rationalization prompts and separate agent contexts for execution, review, and debugging.

4-Pane iTerm2 Setup for Claude Code CLI Separates AI Roles
A developer built a four-pane iTerm2 terminal setup specifically for Claude Code CLI to address context drift and self-grading bias. Each pane is locked to a specific role with dedicated models and permissions.

Coasts: Containerized Hosts for Running Multiple Localhost Environments
Coasts is a Docker-in-Docker solution that solves the problem of running multiple localhost environments simultaneously, handling port conflicts, secrets, and volume topologies without requiring complex scripting.