How-to guide
How to transcribe a meeting locally on Mac in 2026
A founder-built walkthrough for capturing meeting audio on-device, with no notetaker bot sitting in the participant list and no recording shipped off to someone else's cloud.
Install Whisply, grant Microphone and Screen Recording permissions, press Cmd+Return to summon the overlay, and let it transcribe the call on your Mac in real time.
- Whisply runs on-device on macOS 13 Ventura or later, Apple Silicon and Intel, no notetaker bot ever joins the call.
- The menu-bar overlay is summoned with Cmd+Return and stays out of screen sharing and screen recording by default.
- Works with Zoom, Google Meet, Microsoft Teams, Webex, Slack huddles, FaceTime, and any audio that reaches your Mac speakers.
Whisply is system-level hidden from screen capture, so when the other person on Zoom shares your screen back to you for confirmation, the overlay and the live transcript pane do not appear in their view or in the cloud recording.
Why the usual approaches fall short
- Notetaker bots like Otter, Fireflies, and Read AI join the call as a participant. Everyone in the room can see them, the conversation stiffens, and the recording lives in a vendor's cloud you do not control.
- Post-call AI summarizers only act after the meeting ends. You finish the conversation, get a tidy paragraph hours later, and the moment you actually needed the right number or the right phrasing is already gone.
- Generic cloud transcription services upload your audio to a third-party server for processing. The other person's voice ends up in a system they never agreed to, which is a real legal liability in two-party consent states and most of Europe.
- Browser extensions that scrape Zoom or Meet break every time the meeting platform ships a UI change, and they only work in that one tab. Switch to the native Zoom client or join a Teams call and the extension is dead.
- Built-in Zoom or Teams transcription only runs if the host enables it, sends the audio through the vendor's cloud, and is visible to every participant. It is useful for the host, not for you, and it is not actually local.
Step by step
- 1
Install Whisply and open it from Applications
Download the Mac app from whisply.net/download, drag it to the Applications folder, and open it. macOS will ask you to confirm an app downloaded from the internet the first time. The icon lands in your menu bar at the top right of the screen, next to Wi-Fi and Control Center, not in the Dock.
- 2
Grant Microphone and Screen Recording in System Settings
Open System Settings, Privacy and Security. Toggle Whisply on under Microphone, then under Screen Recording. The Screen Recording permission is what lets Whisply capture system audio (the other person's voice through your speakers) using the modern macOS audio APIs. You will be asked to quit and reopen Whisply once after granting Screen Recording so the new permission takes effect.
- 3
Join the meeting in your usual app
Open Zoom, Google Meet in Chrome or Safari, Microsoft Teams, Webex, Slack huddles, or FaceTime, and join the call exactly the way you normally would. Whisply does not replace your meeting client and does not need to be configured per-platform. It captures whatever audio reaches your Mac, so any app that plays through your speakers and uses your microphone works.
- 4
Press Cmd+Return to summon the overlay
The default global hotkey is Cmd+Return. Press it from anywhere, including inside Zoom's full-screen mode, and the Whisply overlay appears. Click Start meeting (or press the hotkey again with the overlay focused) to begin capturing. The transcript pane fills in line by line as people speak, with speaker turns separated automatically when Whisply can tell two voices apart.
- 5
Use the live transcript and ask follow-up questions
Scroll the transcript pane to review what was said in the last few minutes without interrupting the call. Type a question into the same overlay (What number did they just quote? Summarize the last five minutes for a colleague joining now.) and Whisply answers from the running transcript. The overlay stays hidden from screen sharing and screen recording, so the other person sees only your camera and your shared window if you are sharing one.
- 6
Export the transcript or close without saving
When the meeting ends, click End meeting in the overlay. You can export the full transcript as plain text, Markdown, or a shareable note link with optional timestamps, all from the overlay. If you do not export, the session is discarded and nothing is written to disk. The choice is per meeting and there is no background setting that quietly archives every call.
What local transcription on a Mac actually means
Local transcription means the audio never leaves your machine in a form anyone else can replay. The microphone signal and the system audio coming out of your speakers are captured by an app running under your user account, turned into text on-device, and held in memory or a file you control. No bot dials into the meeting. No third-party participant appears in the roster. No vendor stores a .wav of your colleague's voice in a bucket you cannot inspect.
That distinction matters more than it used to. A notetaker bot that joins your Zoom is recording every person on the call, often without explicit consent from any of them, and the resulting transcript lives wherever that vendor decides. Local transcription flips the model. You are recording your own side of the conversation, on your own hardware, for your own use. The other people on the call are not being archived in someone else's database because of a tool you chose to use.
On a Mac specifically, local transcription has a few moving parts: a way to capture microphone input, a way to capture system audio (the voices coming through your speakers), a transcription engine, and a UI that shows you the text in real time. Whisply handles all four inside a single menu-bar app.
Why a menu-bar overlay beats a notetaker bot
A bot in the participant list changes the room. People stop thinking out loud. They watch their words. The candor that makes a meeting worth having quietly drains away, and the recording you wanted captures a more guarded, less honest version of the conversation. The tool meant to remember the meeting is the reason the meeting got worse.
A menu-bar overlay does not change the room because the room never sees it. Whisply lives in your menu bar, summoned by Cmd+Return. The other side of the Zoom call sees you, your camera, your background. They do not see a Notetaker, a Recording Bot, or any indicator that a transcript is being generated on your end. The conversation stays the conversation it would have been without any tooling at all.
The overlay is also private to you in a stronger sense. Because Whisply uses system-level content protection, it stays out of the macOS screen-capture path by default. If the other person asks you to share your screen, the transcript pane does not show up in their feed. If you record the meeting yourself with QuickTime or with the host's built-in recorder, the overlay is not in the file.
Capturing system audio on macOS without a virtual cable
The historical pain of transcribing a Mac meeting was getting at the other person's voice. macOS does not, by default, let arbitrary apps tap the audio coming out of your speakers. The old workaround was a virtual audio device, BlackHole or Loopback, routed through a multi-output device in Audio MIDI Setup, then a careful prayer that the levels stayed sane. It worked, but it was fragile and it broke every time macOS shipped a point release.
Whisply uses the modern macOS audio-capture APIs that arrived with the Screen Recording permission. Once you grant Screen Recording in System Settings, Privacy and Security, Screen Recording, the app can read the audio mix the same way QuickTime does for a screen recording. No virtual driver. No multi-output device to remember to switch back. Your meeting audio and your microphone are merged into the transcript stream automatically.
That is also why the Screen Recording prompt appears the first time you run a meeting through Whisply. It is the macOS gate for system audio, not a sign that anything is being uploaded. The capture stays inside the Whisply process and is turned into text on your Mac.
What the live transcript is good for during the call
A transcript you read tomorrow is a record. A transcript you can glance at during the call is a tool. The two things look the same on disk and feel completely different in use. Whisply puts the live text in a small pane you can resize or hide with a keystroke, so you can scan the last thirty seconds without looking away from the camera.
That changes the kinds of meetings you can show up to. You can join a call cold, miss the first two minutes because your previous meeting ran over, and scroll back through what was said while the conversation continues. You can pull a number out of a sentence the client just spoke and quote it back without asking them to repeat themselves. You can ask Whisply to summarize the last five minutes for a colleague who joined late, in plain language, without leaving the call.
Because the same overlay also handles the AI assist (suggested replies, instant lookups, Computer Use on Pro Undetected), the transcript is not a dead-end artifact. It is the input the assistant is already reading. Ask it a question about what the other person said, and it answers from the transcript it has been building the whole time.
When you want the file, and when you do not
For some meetings, the transcript is throwaway. You wanted real-time help and a clean exit, no file to manage afterward. Whisply respects that. By default, transcripts live in the session and you can close the window without saving anything to disk.
For other meetings, the file is the whole point. A user research interview you need to quote. A board call you need to share with co-founders. A legal conversation where the exact phrasing matters. From the overlay, you can export the full transcript to plain text, Markdown, or a shareable note link, with timestamps if you want them. The export happens on your Mac and the file goes wherever you put it.
The decision stays yours, per meeting. There is no setting buried three menus deep that quietly turns on cloud archival of every call you ever take. If a file exists, you created it on purpose.
Related questions
Does the other person on the Zoom call know I am transcribing?
No. Whisply does not join the meeting as a participant, does not appear in the roster, and does not send any indicator to the meeting platform. The other side sees your camera and your name, exactly the same as a call with no tooling at all. The overlay also stays out of screen sharing by default, so if you share your screen mid-call, the transcript pane does not appear in their feed.
Is the audio actually processed on my Mac, or is it sent somewhere?
The capture happens on your Mac through the standard macOS audio and Screen Recording APIs. Whisply uses on-device models for the live transcript and assistant features so you can get real-time help without round-tripping your meeting audio through a public cloud. Models ship with the app, so you do not need to bring your own API key and you do not need to enable a separate cloud setting to use the transcript.
Will Whisply work if the host has disabled recording in Zoom or Teams?
Yes. Whisply does not use the meeting platform's recording feature. It captures the audio that reaches your Mac speakers and microphone, which is governed by macOS permissions, not by the host's settings inside Zoom or Teams. Whether the host has enabled or disabled their own cloud recording has no effect on your local transcript.
Does it transcribe both my voice and the other person's voice?
Yes. Once you grant Microphone and Screen Recording permissions, Whisply captures both your mic input and the system audio coming out of your speakers, merges them into a single stream, and separates speaker turns when it can tell voices apart. Solo recordings (just your microphone, with no call playing) also work if you want to dictate notes.
Which macOS versions and Macs are supported?
macOS 13 Ventura or later, on both Apple Silicon (M1, M2, M3, M4) and Intel Macs. Apple Silicon machines run the on-device transcription noticeably faster because the models use the Neural Engine, but the experience on a recent Intel MacBook Pro is still real-time for normal meeting pace.
Can I use this in a meeting with a non-disclosure agreement?
That is a question for the NDA and the people who signed it, not for us. What Whisply gives you is a setup where no third-party vendor receives a copy of the conversation, no bot appears in the participant list, and no file is written to disk unless you export one. Many people find that a stronger privacy posture than the bots their own teams routinely allow into NDA calls. Confirm with your counterparty if you are unsure.
What does it cost to transcribe meetings with Whisply?
The Free tier includes core meeting-assist features with a daily message limit, which is enough for occasional calls. Pro is $19.99 per month monthly or $11.99 per month annual and removes the limit. Pro Undetected is $149.99 per month monthly or $44.99 per month annual and adds Computer Use mode plus armed proctor-resistant features. Models are included on every tier, so there is no separate API bill.
Try Whisply free.
Mac only. macOS 13 or later. No bot in your calls.