Self-host an OpenAI-compatible speech API for local transcription, translation, and TTS with Speaches
Use Speaches when an agent stack expects OpenAI-style audio endpoints but you want a self-hosted speech backend for transcription, translation, and text-to-speech instead of a hosted API.
What it does
Self-host an OpenAI-compatible speech API for local transcription, translation, and TTS with Speaches
Use Speaches when an agent stack expects OpenAI-style audio endpoints but you want a self-hosted speech backend for transcription, translation, and text-to-speech instead of a hosted API.
Prerequisites
Docker or Python-based deployment environment, CPU or GPU runtime, supported speech models, and any client or agent stack that can call OpenAI-compatible audio endpoints.
Installation
Requirements and caveats from upstream:
Basic usage or getting-started notes:
-
See the documentation for installation instructions and usage: speaches.ai
-
Extracted from upstream docs: https://raw.githubusercontent.com/speaches-ai/speaches/HEAD/README.md
Documentation
Source
Capabilities
Install
Quality
deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (1,186 chars)