Text to Speech and Speech to Text

Decisions

Pending: STT engine choice (Groq Whisper, OpenAI Whisper, or alternatives)
Pending: TTS engine choice (ElevenLabs, OpenAI TTS, or alternatives)

User Tasks

Summary

Voice interface for Opus — speak to it and hear responses.

Problem / Motivation

Typing is not always convenient. Voice interaction enables hands-free use, especially on phone (FR-022) and for briefing delivery.

Proposed Solution

Integrate STT and TTS engines:

STT: Groq Whisper (fast, free tier), OpenAI Whisper, or local Whisper
TTS: ElevenLabs (high quality voices), OpenAI TTS, or alternatives

Consider privacy for voice data.

Open Questions

⇒

No open questions.

Phase Overview

Phase	Description	Status
Phase 1	Speech to Text	—
Phase 2	Text to Speech	—

Phase 1: Speech to Text —

Goal: Enable voice input that gets transcribed and sent to Opus.

File / Feature	Details	Owner	Status
Voice input	Voice input via phone or desktop mic	opus	—
Transcription	Transcribe and send to Opus as text command	opus	—

Phase 2: Text to Speech —

Goal: Enable Opus to read responses and briefings aloud.

File / Feature	Details	Owner	Status
Read briefings aloud	Read daily briefings as audio	opus	—
Voice responses	Voice responses to queries	opus	—

Prerequisites / Gap Analysis

Requirements

Requirement	Description
REQ-1	Can give Opus a voice command and receive a text response
REQ-2	Can listen to the daily briefing as audio

Current State

Component	Status	Details
Phone access	—	FR-022 not yet started
STT integration	—	No integration exists
TTS integration	—	No integration exists

Gap (What’s missing?)

Gap	Effort	Blocker?
Phone access (FR-022)	High	Yes
STT engine integration	Med	No
TTS engine integration	Med	No

Test

Manual tests

Test	Expected	Actual	Last
Voice command input	Transcribed text sent to Opus	pending	-
Briefing audio playback	Daily briefing read aloud	pending	-

AI-verified tests

Scenario	Expected behavior	Verification method
…	…	…

E2E tests

Scenario	Assertion
…	…

Integration tests

Component	Coverage
…	…

Unit tests

Component	Tests	Coverage
…	…	…

History

Date	Event	Details
2026-02-26	Created	Created from brain dump
2026-02-27	Renumbered	From FR-019 to FR-024
2026-02-28	Rewritten	Aligned to feature-request template

References

FR-022 (Phone Access) — depends on phone access for mobile voice use
ClaudeClaw Rebuild Prompt — Groq Whisper STT + ElevenLabs TTS pipeline, OGA→OGG workaround

Opus Vault

Explorer