Decisions
User Tasks
Summary
Voice interface for Opus — speak to it and hear responses.
Problem / Motivation
Typing is not always convenient. Voice interaction enables hands-free use,
especially on phone (FR-022) and for briefing delivery.
Proposed Solution
Integrate STT and TTS engines:
- STT: Groq Whisper (fast, free tier), OpenAI Whisper, or local Whisper
- TTS: ElevenLabs (high quality voices), OpenAI TTS, or alternatives
Consider privacy for voice data.
Open Questions
⇒
No open questions.
Phase Overview
| Phase | Description | Status |
|---|
| Phase 1 | Speech to Text | — |
| Phase 2 | Text to Speech | — |
Phase 1: Speech to Text —
Goal: Enable voice input that gets transcribed and sent to Opus.
| File / Feature | Details | Owner | Status |
|---|
| Voice input | Voice input via phone or desktop mic | opus | — |
| Transcription | Transcribe and send to Opus as text command | opus | — |
Phase 2: Text to Speech —
Goal: Enable Opus to read responses and briefings aloud.
| File / Feature | Details | Owner | Status |
|---|
| Read briefings aloud | Read daily briefings as audio | opus | — |
| Voice responses | Voice responses to queries | opus | — |
Prerequisites / Gap Analysis
Requirements
| Requirement | Description |
|---|
| REQ-1 | Can give Opus a voice command and receive a text response |
| REQ-2 | Can listen to the daily briefing as audio |
Current State
| Component | Status | Details |
|---|
| Phone access | — | FR-022 not yet started |
| STT integration | — | No integration exists |
| TTS integration | — | No integration exists |
Gap (What’s missing?)
| Gap | Effort | Blocker? |
|---|
| Phone access (FR-022) | High | Yes |
| STT engine integration | Med | No |
| TTS engine integration | Med | No |
Test
Manual tests
| Test | Expected | Actual | Last |
|---|
| Voice command input | Transcribed text sent to Opus | pending | - |
| Briefing audio playback | Daily briefing read aloud | pending | - |
AI-verified tests
| Scenario | Expected behavior | Verification method |
|---|
| … | … | … |
E2E tests
Integration tests
Unit tests
| Component | Tests | Coverage |
|---|
| … | … | … |
History
| Date | Event | Details |
|---|
| 2026-02-26 | Created | Created from brain dump |
| 2026-02-27 | Renumbered | From FR-019 to FR-024 |
| 2026-02-28 | Rewritten | Aligned to feature-request template |
References
- FR-022 (Phone Access) — depends on phone access for mobile voice use
- ClaudeClaw Rebuild Prompt — Groq Whisper STT + ElevenLabs TTS pipeline, OGA→OGG workaround