Gemma 4 Released: Four Model Sizes for Local AI Hosting

Gemma 4 Model Specifications
Gemma 4 is now available as a self-hosted AI model with four distinct configurations for different hardware scenarios. According to the source, it doesn't compete with Claude, Codex, or Gemini but is positioned as a practical option for multi-routing scenarios where a small, capable self-hosted model can save tokens.
Model Variants and Hardware Requirements
- E2B (2.3B effective parameters): Built for edge devices like phones and Raspberry Pi. Requires ~4-8GB RAM and runs well on a CPU. Recommended for hosting on VPS.
- E4B (4.5B effective parameters): Built for laptops and low-end hardware. Maintains a low memory footprint.
- 26B MoE (25B total, 3.8B active): Built for consumer GPUs. Runs at inference speeds similar to a 4B model.
- 31B Dense: Built for mid-range GPUs and workstations. Requires approximately 16-20GB VRAM when using 4-bit quantization.
Capabilities and Availability
All Gemma 4 models are multimodal with both text and vision capabilities. The E2B and E4B edge models specifically support real-time audio. The models are built for advanced reasoning and agentic workflows.
Gemma 4 is available on Google AI Studio, Hugging Face, Kaggle, and Ollama.
📖 Read the full source: r/openclaw
👀 See Also

Cursor AI Study: Short-Term Speed Gains Lead to Long-Term Complexity
A study using difference-in-differences analysis found Cursor AI adoption leads to statistically significant but transient velocity increases, along with substantial and persistent increases in static analysis warnings and code complexity that drive long-term slowdowns.

OpenClaw API Costs Hit $275 in 5.5 Hours, Annualizing to Over $200K
A developer testing OpenClaw with OpenAI's GPT-5.4 API spent $275 between 11am and 4:30pm, which annualizes to over $200,000 per year at that usage rate.

OpenClaw 3.31 Update Resets Agent Permissions and Settings
OpenClaw update 3.31 automatically disabled all agent tools, computer access permissions, and sub-agents, requiring manual re-enabling in Settings. The update also changed how permission requests work, no longer prompting for approval during use.

AI Is Making Me Dumb: A Developer's Confession of Skill Atrophy
James Pain confesses that after a year or two of using AI exclusively for coding (no hand-written code), he has mostly forgotten how to code. He's now teaching himself to code by hand again, and warns that heavy AI use can erode writing and coding skills.