Launch promo · 50% off · until June 1, 2026

Talk to your computer,
don't type.

Voice to text for Mac with smart cleanup, live translation, meeting summaries, and voice notes. No cloud. No subscription. Your voice never leaves your Mac.

Download Free

100 free dictations · 50 notes · 10 meeting recordings · $49 $24.50 with VEXT50

or install via brew install muvon/tap/vext

macOS 14+ · Apple Silicon · No account required

How it works

Three steps. No setup. No account.

1

Hold the hotkey

Press and hold your configured shortcut. Vext starts listening instantly.

2

Speak naturally

Talk at normal speed in any app. Your voice is transcribed in real time, locally.

3

Release to paste

Let go. Text appears at your cursor — wherever you are. No copy-paste needed.

Three modes. One app.

Dictate, record meetings, or capture quick notes — all processed locally on your Mac.

Dictation

Hold a hotkey, speak, release — text appears at your cursor. Works in any app, any text field.

Meetings

Record with speaker labels, get full transcripts, screenshots, and AI-generated summaries.

Notes

Quick voice remarks — transcribed, cleaned up, and stored locally in the app for later.

Speak messy.
Get clean text.

Vext automatically cleans up your speech — removes filler words, fixes structure, keeps your intent intact. Same language, same meaning, just polished. Works alongside translation in a single pass.

  • Removes filler words (um, uh, like, so)
  • Restructures for clarity without changing meaning
  • Preserves your original language and tone
  • Combines with translation in one step
What you said

"so um I was thinking we should uh probably move the deadline to like next Friday because ah the team needs more time to finish the uh the integration tests"

What gets pasted

"We should move the deadline to next Friday — the team needs more time to finish the integration tests."

Your voice never leaves your Mac.

No audio uploaded. No cloud transcription. No account. This isn't a privacy policy — it's architecture.

100% Local

Whisper runs directly on your Apple Silicon GPU. All processing stays on-device.

No Internet Required

Works offline, on an airplane, behind a firewall. No connection needed, ever.

Zero Data Collection

No analytics. No telemetry. No accounts. We never see your audio or transcriptions.

Try free. Pay once. Use forever.

100 free dictations, 50 notes, and 10 meeting recordings to start. Then one price, unlimited use.

Best Value

Vext

$49 $24.50 once

Launch promo · code VEXT50 auto-applied · until June 1, 2026

  • Runs locally on-device
  • No account required
  • Meeting transcription
  • Lifetime access
  • Free updates within version
Buy for $24.50

Cloud voice tools

$10–30 /month

$120–360/year

  • Cloud-dependent
  • Account required
  • Usage caps
  • Subscription only
  • Privacy trade-offs

Unlock from within the app when you're ready. Free updates included. Major new versions at 50% off for existing owners.

How Vext compares.

Feature-by-feature against the leading voice and meeting tools.

Vext $49 $24.50 onceWispr Flow $12–15/moGranola $14–35/moOtter.ai $8–17/mo
Dictation (paste at cursor)
Meeting transcription
Voice notes
Speaker labels
Cross-meeting voice recognitionN/A
AI text cleanup
Meeting summaries
Live translation
Screenshot capture (any mode)
Screenshots auto-paste to AI
YOLO mode (auto-submit)
100% local / private
Works offline
No bot joins your callN/A
Cost after 2 years$24.50$288–360$336–840$200–408

Competitor data sourced from public websites as of April 2026. Features and pricing may change.

Transcription with zero wait.

Vext transcribes locally in real-time — no upload, no server, no spinner. Your words are already there.

150x realtime

60 seconds of audio transcribed in ~400ms. On-device.

Parakeet local
150x
Apple local
25x
Gemini cloud
23x
OpenAI cloud
22x
AssemblyAI cloud
20x
Alex 00:12

Let's review the Q3 roadmap and figure out priorities.

Sarah 00:28

I think we should focus on the API redesign first. It's blocking three other teams.

Alex 00:45

Agreed. Can we have a draft by end of next week?

Meeting transcription.
And the summary.

Record any meeting — Zoom, Google Meet, FaceTime, or in-person — and get a full transcript with speaker identification. Turn on Summarize to extract key points and action items. Both versions are always saved.

  • Timestamps and per-speaker breakdowns
  • System audio + microphone capture
  • AI-powered key points and action items
  • Raw transcript always preserved

Label speakers once.
Recognized forever.

Vext detects every distinct voice in your meeting automatically. Name them once — and from your next call onward, the same person is identified, labeled, and color-coded without lifting a finger.

  • Automatic speaker detection in every recording
  • Label with custom names — saved to your library
  • Same voice auto-labeled in future meetings
  • Color-coded chips for fast transcript scanning
Meeting #1 Speakers
Them Sarah
Me John
Speaker 1 Jack
Meeting #2 Auto-labeled
Sarah
John
Jack

Voice + vision,
hands-free.

Capture any region of your screen during hands-free dictation. The screenshot pastes alongside your transcribed prompt — straight into Claude Code, Cursor, or any AI tool. Fully hands-free coding.

  • Drag to capture during hands-free dictation or meeting recording
  • Screenshot auto-pastes with your transcript — Claude Code, Cursor, ChatGPT
  • Combine voice + image without touching the keyboard
Vext app screenshot showing voice transcription interface
2 min ago

Check the API rate limits before we push the new integration. Sarah mentioned the sandbox has different thresholds than production.

18 min ago

The onboarding flow needs a skip option on the second screen. Users are dropping off because they think the setup is mandatory.

1 hour ago

Try SwiftUI NavigationSplitView for the sidebar instead of the custom implementation. It handles state restoration automatically.

Capture a thought
before it's gone.

Press a key, say what's on your mind, and move on. Vext transcribes, cleans up, and stores your note locally — ready when you need it.

  • Same Enhance and translation pipeline as dictation
  • All notes stored locally in the app
  • No app switching — works from anywhere on your Mac

Speak one language.
Type another.

Talk in English, get the text in Russian. Or Spanish. Or Japanese. Vext translates your speech in real time as it transcribes — so the text that lands at your cursor is already in the language you need.

  • Real-time speech-to-translation
  • 99+ target languages
  • Works with any source language
  • Same hotkey workflow — just set your target language
English

"Let's schedule a meeting for next Tuesday to discuss the project roadmap and assign tasks to the team."

Russian

"Давайте назначим встречу на следующий вторник, чтобы обсудить план проекта и распределить задачи в команде."

Works everywhere you type.

Vext is a system-level service. It works in any text field, in any app — browsers, editors, terminals, email, chat.

AI Tools

Claude CodeChatGPTClaude.aiCursorCodex

Browsers

SafariChromeFirefoxArc

Editors

VS CodeXcodeSublime TextVim

Terminals

TerminaliTerm2WarpGhostty

Communication

SlackDiscordTelegramMessages

Productivity

NotionObsidianNotesGmail

Go hands-free.

Press a key once to start dictation. Press again to stop. No holding required — just talk as long as you need. Perfect for longer passages or when your hands are busy.

Standard Hold key → speak → release
Hands-free Press key → speak freely → press key

Capture screenshots mid-dictation — they auto-paste with your transcript. See how →

Press once
Speaking freely...
Press again

YOLO Mode.

Turn it on and Vext automatically presses Return after pasting your transcription. Speak, release, and your prompt is already running.

Stop editing. Stop polishing. Just talk. LLMs know what you mean, even when your words aren't perfect.

YOLO Mode
Speak
Review transcript
Edit / fix mistakes
Press Return
Done

Audio ducking.

When you start recording, Vext automatically fades your system audio so your voice comes through clearly. Release the hotkey and volume returns to normal. No manual adjustment needed.

Playing music
Recording
Resumed

Choose your engines.

Pick the speech and AI models that fit your workflow. Run locally by default, or bring your own API key.

Speech-to-Text

ModelTypeSpeedSize
Apple Dictation Built-in macOS speech recognition. Zero download.LocalRealtimeBuilt-in
OpenAI-compatible Any OpenAI-compatible STT endpoint. Bring your own API key.APIVaries

AI Processing Enhance · Translate · Summarize

ModelTypeSize
Gemma 3 1B Ultra-lightweight. Fastest local option, lower accuracy.Local~1 GB
Qwen 3 4B Strong multilingual support. Good for translation tasks.Local~2.5 GB
LLaMA 3.2 3B Meta LLaMA. Strong general-purpose performance.Local~2.4 GB
Phi-3.5 Mini Microsoft Phi-3.5. Compact, strong reasoning.Local~2.8 GB
OpenAI-compatible Any OpenAI-compatible API — GPT, Claude, Gemini, or self-hosted.API
Affiliate program

Earn 40% sharing Vext.

Newsletter writer, YouTuber, indie hacker? We pay 40% on every sale you bring — automatic monthly payouts, 60-day cookie window, no caps.

Learn more

Built by Muvon.

Vext is built by Muvon Un Limited, a small product studio. We build tools we use every day — designed to work locally, respect your privacy, and get out of your way.

Questions & answers.