Offline Voice-to-Text Tool for macOS Using Local Whisper via MLX

A developer has created whisper-dictate, a macOS tool that enables fully offline voice-to-text transcription with real-time translation capabilities. The tool uses OpenAI's Whisper running locally through MLX on Apple Silicon, with no data ever leaving your machine.
How It Works
The workflow is straightforward: hold the fn key, speak, and release. The text is transcribed and pasted directly where you're typing. The tool works in Slack, VS Code, browsers, email, or any other text field. A floating "Listening..." overlay provides visual feedback during recording.
Technical Details
- Transcription takes about 500ms after you stop speaking
- Uses Whisper running locally through MLX on Apple Silicon
- The small model is solid for everyday use
- large-v3-turbo model provides near-perfect accuracy
- 100% offline operation - no accounts, tokens, or data leaving your machine
Translation Feature
Whisper can translate on the fly without extra models or translation APIs. The translation is built into Whisper's decode step. For example, speaking French outputs English text. You set the output language to English, and it handles translation natively.
Setup and Configuration
On launch, the tool asks what language you'll be speaking and what you want the output in, allowing easy switching between pure transcription and translation. The entire setup requires no internet connection once installed.
Availability
The project is open-sourced on GitHub for others to use and adapt to their workflows.
📖 Read the full source: r/LocalLLaMA
👀 See Also

OpenClaw developer builds Kumiho cognitive memory plugin for persistent agent collaboration
A developer created Kumiho, an AI cognitive memory system backed by a knowledge graph, to address OpenClaw's lack of memory across sessions. The openclaw-kumiho plugin hooks into conversations to recall context, capture structured summaries, and maintain versioned creative outputs.

OpenClaw Janitor Skill for Automated System Management and Security Hardening
A developer created a skill that uses Claude Code to SSH into OpenClaw machines and harden configurations, including sandboxing, OS hygiene, and channel security, while maintaining a project folder with audit instructions in CLAUDE.md.

LLM Agent Builds Complete Godot 4 Dungeon Crawler Using Visual Feedback
A developer connected an LLM agent to Godot 4 using an MCP tool and gave it a single prompt to build a dungeon crawler FPS. The agent created a complete prototype with 3 rooms, lighting, combat, enemies, and progression by running the game, taking screenshots, and fixing visual issues.

Galadriel: Open-Source Warm-Cache Harness for Persistent Claude Agents
Galadriel is a 3-tier stacked caching harness for Claude that reduces costs by 87% and latency to under 3s for 100K token prompts. Integrates MemPalace for persistent vector memory.