I wanted a voice-to-text app but didn't trust any of the proprietary ones with my privacy.
So I decided to see if I could vibe code it with 0 macOS app & Swift experience.
It uses a local binary of whisper.cpp (a fast implementation of OpenAI's Whisper voice-to-text model in C++).
Github: https://github.com/richardwu/openwhisper
I also decided to take this as an opportunity to compare 3 agentic coding harnesses:
Cursor w/ Opus 4.6:
- Best one-shot UI by far
- Didn't get permissioning correct
- Had issues making the "Cancel recording" hotkey being turned on all the time
Claude Code w/ Opus 4.6:
- Fewest turns to get main functionality right (recording, hotkeys, permissions)
- Was able to get a decent UI with a few more turns
Codex App w/ Codex 5.3 Extra-High:
- Worst one-shot UI
- None of the functionality worked without multiple subsequent prompts