AIsbf 0.9.8 adds caching, routing improvements, and expanded AI service support

AIsbf (AI Should Be Free) 0.9.8 is an API proxy/router that provides an OpenAI-compatible interface to various AI endpoint services, aiming to make LLM usage more cost-effective. It's multiuser and can scale from small setups to large infrastructure.
Key features in version 0.9.8
- Cache support for Redis, SQLite, MySQL, and file-based storage
- Additional context condensation methods
- Native prompt caching and request caching support
- Faster and improved semantic prompt-based routing for automatic service selection
- Full OAuth2 support for Claude.ai subscribers
- Full OAuth2 support for Amazon Kiro-cli subscribers
- Full OAuth2 support for OpenAI Codex subscribers
- Full support for Kilo.ai subscribers using tokens or OAuth2
- Multiple bug fixes and minor feature additions
This type of tool is useful for developers who work with multiple AI services and want a unified interface while optimizing costs through intelligent routing and caching.
📖 Read the full source: r/LocalLLaMA
👀 See Also

Context Routing Layer Reduces Claude Code Token Usage by Tracking Accessed Files
A developer saved approximately $80 per month on Claude Code usage by adding a context routing layer that prevents the AI from re-reading the same repository files on follow-up turns. The tool tracks what files have already been accessed to reduce redundant token consumption.

LLM Circuit Finder: Duplicate 3 layers to boost reasoning without training
A new toolkit finds 'reasoning circuits' in transformer models - contiguous blocks of 3-4 layers that act as indivisible cognitive units. Duplicating these blocks (layers 12-14 in Devstral-24B) improves logical deduction from 0.22 to 0.76 on BBH benchmarks with no weight changes or training.

boxBot: An Open-Source Smart Speaker Powered by Claude and Hailo AI
A developer built a smart speaker named boxBot using Claude for agent-driven hardware control, Raspberry Pi, Hailo AI accelerator, and custom SDK—open-sourced on GitHub.

Open Source Agent Skill for TypeScript, React, and Next.js Patterns
A developer has released a 4,000-line, 17-file structured markdown reference designed for AI agents like Claude Code to follow when generating or reviewing TypeScript, React, and Next.js code. It addresses common issues like improper API response validation and misuse of 'use client' directives.