OpenClaw 2026.4.2 and 2026.3.31 break local LLM connections

Connection issues with local LLM instances
Two recent OpenClaw updates—versions 2026.4.2 and 2026.3.31—are breaking connections to locally hosted Ollama instances. Users report that the agent times out when attempting to connect to locally hosted Ubuntu machines running Ollama.
The error logs show a clear pattern of connection failures:
01:07:29 warn agent/embedded Profile ollama-remote2:default timed out. Trying next account... 01:07:29 warn agent/embedded embedded run failover decision 01:07:29 error diagnostic lane task error: lane=main durationMs=61252 error="FailoverError: LLM request timed out." 01:07:29 error diagnostic lane task error: lane=session:agent:aitrader:main durationMs=61254 error="FailoverError: LLM request timed out." 01:07:29 warn model-fallback/decision model fallback decision
According to the report, reverting to version 2026.3.28 resolves the issue. This suggests the problem was introduced in the 2026.3.31 update and persists in 2026.4.2.
For developers using OpenClaw with local LLM instances, this means either staying on version 2026.3.28 or waiting for a fix in future releases. The timeout errors indicate the agent is attempting to connect but failing to establish or maintain communication with the local Ollama service.
📖 Read the full source: r/openclaw
👀 See Also

Developer Prefers Qwen3.5-27B Over Proprietary Models for Its Failure Mode
A developer on r/LocalLLaMA reports preferring Qwen3.5-27B over Gemini 3.1 Pro and GPT-5.3 Codex because it gives up on problematic tasks rather than generating potentially dangerous code like unrestricted Perl or NodeJS scripts.

Anthropic's DoD Meeting and Chinese AI Labs Distilling Claude
Anthropic's CEO meets with the US Secretary of Defense in what officials describe as a 'shape up or ship out' situation, while the company reports catching three Chinese AI labs conducting massive model distillation of Claude's capabilities.

MLX Inference Performance Update: April 2026 Benchmarks and Features
MLX inference performance has improved significantly, with Qwen3.5-35B-A3B reaching 71.8 tokens/second at 4K context and new features like Multi-Token Prediction and SpecPrefill providing 2.3x-5.5x speedups for large models.

Anthropic Acquires Stainless for $300M+ — Now Owns Dominant MCP Server Generator
Anthropic bought SDK generator Stainless for $300M+. Stainless generates most production MCP servers from OpenAPI specs. The hosted product is winding down; new signups stopped Monday.