Skip to content

larryxiao/openquack

Repository files navigation

OpenQuack — Speak. Send. Privately. Recording overlay shows 'Listening' with a live level meter; menu-bar popover shows 'Pasted at cursor' with the last transcript and a Settings / Quit footer.

OpenQuack Speak. Send. Privately.

Voice dictation for macOS. Nothing leaves your device — audio, text, nothing.

License Platform Status

English · 简体中文 · 日本語 · 한국어 · Français · Español · Deutsch

🌐 Translations are machine-translated stubs. Native-speaker contributions very welcome — open a PR or see CONTRIBUTING.md.


📢 What's new — v2.0.0-alpha.13: In-app updates are live. Settings → General now has an Updates section with auto-check, a "Check now" button, last-checked timestamp, and an opt-in for pre-release builds. Hope you like it! (past releases)

What it is

OpenQuack is a tiny menu-bar app for macOS. Press a hotkey, speak, press it again — your transcript appears at the cursor. Wherever you can type, you can talk.

Speech recognition happens on your Mac. No cloud, no account, no signup, no telemetry.

Why

Local. Everything runs on your device — recording, transcription, optional polish. Nothing leaves: no audio, no text, no telemetry, no signup. Confidential work stays confidential, by construction. And because there's no API call in the loop, it just keeps working — offline, on a plane, behind a corporate firewall.

Fast, especially on long clips. Whisper streams while you speak, so a 5-minute dictation finishes in about 3 seconds after you stop — the wait doesn't grow with length. ~2.6% word-error rate on real human speech on a baseline M4 / 16 GB, ~6.3% in realistic office noise. Full bench matrix in docs/BENCHMARKS.md.

Quiet on resources. ~120 MB of RAM while idle. ~8 MB app bundle. The Whisper model lives on disk and only loads when you press the hotkey.

Open. MIT-licensed. Every line is auditable; every change happens in public. The version running in your menu bar is the version in this repo.

What you get

  • One-key dictation. Pick a hotkey (default ⌃⇧Space, or bind fn / Globe). Press once to start, press again to stop.
  • Auto-paste at the cursor in any app. Falls back to your clipboard if you'd rather paste yourself.
  • 99 languages. English, Chinese, Japanese, Korean, Spanish, French, German, Italian, and Portuguese are right in Settings; auto-detect on by default.
  • Smart formatting — capitalisation, end-punctuation, "um/uh" cleanup.
  • Custom dictionary — teach it the proper nouns and project names you actually use.
  • Auto-stop after silence. Finish speaking, OpenQuack wraps up on its own.
  • Launch at login — show up in the menu bar after every restart with one toggle.

Privacy, in one screen

  1. Nothing leaves your device — audio, text, nothing. Recording and transcription are fully local. Always.
  2. No analytics, no telemetry, no signup.

The full privacy contract is in docs/VISION.md.

Coming next

  • In-context transcription — OpenQuack reads the surrounding text before transcribing, so domain terms get disambiguated by what you're actually doing.
  • Thinking mode — an opt-in second pass through a small local LLM (Ollama or MLX-LM, your pick) that turns a raw spoken sentence into one you'd press send on.

Both deferred while the adoption foundations land. See docs/ROADMAP.md for what's queued; docs/VISION.md for where this is going overall.

Install

brew tap larryxiao/openquack https://github.com/larryxiao/openquack
brew install --cask openquack

Or download the DMG and drag into Applications. First launch: right-click → OpenOpen (one-time Gatekeeper bypass).

Grant Microphone when macOS asks, pick a hotkey in Settings → Shortcut (default ⌃⇧Space).

Want a guided walkthrough? See docs/TUTORIAL.md — five minutes from install to first dictation.

Or tell your AI agent

Paste this into Claude Code, Codex, opencode, Hermes, or similar:

Install OpenQuack on this Mac:

  brew tap larryxiao/openquack https://github.com/larryxiao/openquack
  brew install --cask openquack

(Or grab the DMG from https://github.com/larryxiao/openquack/releases
and drag it into /Applications; first open right-click → Open → Open.)

Then launch /Applications/OpenQuack.app, grant Microphone, and pick a
hotkey in Settings → Shortcut. Default ⌃⇧Space.

More options (uninstall, build-from-source, what's downloaded on first run): docs/INSTALL.md.

Got stuck? Want a feature?

Drop a comment in Discussions — it's the lowest-friction way to reach me. Bugs, feature ideas, "I'm using it for X" workflow stories, or quick questions about Whisper / model choice / paste behavior in a specific app all welcome. Issues are fine for structured reports too, but no need to format.

Common questions (install, accuracy, languages, offline behaviour, Mac requirements) live in the FAQ on the docs site.

Acknowledgements

OpenQuack stands on the shoulders of generous open-source work. Huge thanks to:

  • OpenAI Whisper — the speech model that makes any of this possible.
  • WhisperKit by Argmax — Whisper, fast and native on Apple Silicon.
  • KeyboardShortcuts by Sindre Sorhus — the hotkey machinery you press every day.
  • voxt — a kindred project we learned a lot from on the technical side.
  • Typeless and Wispr Flow — the closed-source apps that proved how delightful voice-first input can feel; we're aiming for the same feel, locally and openly.

And to everyone filing issues, opening PRs, and telling friends: thank you. The duck quacks because of you.

Contribute

OpenQuack is AI-native open source — every PR cites a SPEC, atomic tasks come from the roadmap, the workflow is friendly to coding agents at scale (and humans on the same path).

Start with AGENTS.md, pick a 🔵 task in docs/ROADMAP.md, open a draft PR.

Under the hood: TUTORIAL · DEVELOPMENT · ARCHITECTURE · BENCHMARKS · DESIGN · INSTALL · BLOG.

License

MIT — see LICENSE.

About

Voice dictation menu-bar app for macOS. Press a hotkey, speak, transcript appears at the cursor. Whisper runs locally via WhisperKit on Apple Silicon — no audio, text, or telemetry leaves your Mac. MIT, Swift.

Topics

Resources

License

Contributing

Stars

Watchers

Forks

Packages

 
 
 

Contributors