MCP Support Merged into llama.cpp with New WebUI Features

The Model Context Protocol (MCP) pull request for llama.cpp has been merged into the main repository. This integration adds MCP support to the llama-server and WebUI components, enabling several new capabilities for AI coding agents and local LLM workflows.
Key Features Added
According to the source material, this merge unlocks the following functionality:
- MCP support for llama-server/WebUI
- Tool calls capability
- Agentic loop implementation
- Server selector functionality
- Resources management
- Prompt attachments
- File/resource browser
- Backend CORS proxy enabled with
--webui-mcp-proxyflag
Technical Details
The specific pull request that was merged is PR #18655 in the llama.cpp repository. The contributor mentions using OpenWebUI in combination with llama.cpp WebUI and was anticipating this merge.
For developers working with local LLMs and AI coding agents, this integration means you can now use MCP-compatible tools directly through the llama.cpp WebUI interface. The --webui-mcp-proxy flag enables CORS proxy functionality for the backend, which is particularly useful for web-based interfaces.
This update is primarily for developers who use llama.cpp for local LLM inference and want to integrate MCP-compatible tools into their workflow through the WebUI interface.
📖 Read the full source: r/LocalLLaMA
👀 See Also

Symphony workflow automation tool works with Claude Code
A developer got the Symphony spec working with Claude Code to automate ticket-to-PR workflows, using Node/TypeScript initially but noting Elixir might be better. The tool requires separate API key setup and billing beyond Claude subscriptions.
UI and Server for Anthropic's Natural Language Autoencoders on llama.cpp
A custom llama.cpp server and Mikupad UI for Anthropic's open-weight Natural Language Autoencoders, supporting activation extraction, explanation, reconstruction, and steering via explanation editing.

Data Analyst Builds Prompt Calibrator Tool with Claude, No Prior Frontend Experience
A data analyst with no HTML, CSS, or JavaScript experience built Prompt Calibrator, a client-side web tool that structures AI prompts through a form with four fields and four modes. The tool was developed using Claude as a code review partner and is hosted on GitHub Pages.

AutoBe: How Weak Local LLMs Fixed an AI Backend Generator's Architecture
AutoBe is an open-source AI agent that generates complete backend apps using TypeScript, NestJS, and Prisma. The team discovered their initial 100% compilation success produced unmaintainable code, then rebuilt with modular generation—crashing success to 40%—and used weak local LLMs like qwen3-30b-a3b-thinking to debug schema ambiguities.