TranscriptionSuite v1.1.2 adds WhisperX, NeMo, and VibeVoice models

✍️ OpenClawRadar📅 Published: April 17, 2026🔗 Source
TranscriptionSuite v1.1.2 adds WhisperX, NeMo, and VibeVoice models
Ad

TranscriptionSuite v1.1.2 Release

TranscriptionSuite, a fully local and open-source audio transcription application, has released version 1.1.2 with significant feature additions based on community feedback.

Key Updates

The developer replaced the previous faster-whisper implementation with WhisperX and added support for multiple new model families:

  • WhisperX - Includes diarization via PyAnnote
  • NeMo models - Support for Parakeet and Canary models with diarization via PyAnnote
  • VibeVoice models - Support for both main model and 4-bit quantized versions with built-in diarization
Ad

New Features

  • Model Manager - Centralized control for managing different transcription models
  • Parallel processing mode - Simultaneous transcription and diarization
  • Shortcut controls - Keyboard shortcuts for improved workflow
  • Paste at cursor - Direct text insertion functionality
  • 24kHz recording pipeline - Specifically added to take full advantage of VibeVoice models (Whisper and NeMo models require 16kHz)

The application now provides three distinct transcription pipelines, each with different diarization approaches depending on the model family selected.

📖 Read the full source: r/LocalLLaMA

Ad

👀 See Also