Diagnosing Operational Drift and Task Amnesia in OpenClaw with Gemini 2.5 Flash on Proxmox

OpenClaw, when configured with the Gemini 2.5 Flash model on a Proxmox VM, exhibits significant issues with operational drift and task amnesia in maintaining persistent workflows. The agent operates effectively for single tasks, but struggles when tasked with more complex, continuous operations.
Users report core issues with the configuration:
- Automated GitHub Organization Failures: Despite set protocols, the agent fails to autonomously manage GitHub Discussions and Projects without manual prompts.
- Selective Persistence: Though the setup successfully pings Healthchecks every 10 minutes, more complex tasks, such as sending daily Slack reports, are acknowledged but not executed.
- Tool Hallucination and Memory Volatility: The agent reports successful file manipulations that don't occur, forgets stored credentials for GitHub or Drive mid-session, and fails to format Slack messages correctly.
- Language Policy Drift: The agent inconsistently applies a specified prompt enforcing Spanish for communication and English for GitHub documentation, often reverting to Spanish inappropriately.
With the system resources averaging a 75% load, these issues may stem from a potential hardware bottleneck affecting the OpenClaw wrapper or underlying processes. Attempts to utilize commands like cat and ls for verification often result in no output during high resource usage.
📖 Read the full source: r/openclaw
👀 See Also

Qwen KV Cache Quantization Deep Dive: PPL, KL Divergence, and Asymmetric K/V Results
Second round of benchmarks on Qwen 3.6-35B-A3B with KV cache quantization: perplexity, KL divergence, asymmetric K/V combos, and 64K context depth on Apple M5 Max.

Tencent Hosts Free OpenClaw Installation Event in Shenzhen Amid High Demand
Tencent organized 20 employees outside its Shenzhen office building to install OpenClaw for free on March 6, responding to reports of people paying over $70 for house-call installation services. The event used Tencent Cloud's Lighthouse platform, with most attendees being white-collar professionals facing workplace competition and AI adoption pressure.

Gemma 4 vs Qwen 3.5 Blind Evaluation Results with Claude Opus as Judge
A 30-question blind evaluation compared Gemma 4 31B, Gemma 4 26B-A4B, and Qwen 3.5 27B using Claude Opus 4.6 as judge. Qwen 3.5 27B won 46.7% of matchups but had lower average scores due to three zero-scoring responses.
Parameter Golf: OpenAI's AI-Assisted ML Research Experiment
OpenAI ran Parameter Golf, a competition with 1,000+ participants and 2,000+ submissions, testing AI-assisted machine learning, coding agents, quantization, and novel model design under strict constraints.