TrainScription — Local AI Transcription. No Bot. No Cloud. Any App.

By the numbers

0 Cloud calls during transcription

2 Audio channels captured simultaneously

$9.99 Pro unlock — once, forever

5s Audio chunk window — volatile, never written to disk

How it works

Browser Tab or Desktop App Audio & Your Microphone two independent channels · any app making sound

↓

Local Web Audio Graph silence-gated · processed in browser memory

↓

Whisper AI — on your device WebAssembly · no network · no API key

↓

Phonetic Brain Corrections applied before you see the output

↓

Live Transcript · Studio View speaker-labeled · timestamped · stored locally

↓

Export · .txt · .srt your file · your machine · your call

01

Open your meeting — in a browser tab or a desktop app

TrainScription captures audio from your machine's audio pipeline — not from the meeting's participant list. Browser Tab mode works with Google Meet, Teams (web), Zoom (web), Webex, Discord, and any other browser-based call. Full Desktop mode captures all system audio — Teams desktop, Zoom, Discord app, or anything making sound on your machine. Free tier sessions are subject to the 15-minute cap. No third-party participant joins the call in either mode.

02

Click Begin Capture and choose your mode

Select Browser Tab to capture a meeting in Chrome, or Full Desktop to capture any app making sound on your machine. The extension captures two audio streams: your microphone and the meeting audio. Both feed into a local Web Audio graph. A silence gate drops empty chunks before Whisper ever sees them. The AI runs entirely in your browser via WebAssembly — no network requests during transcription, no server, no API key.

03

Watch the transcript appear in real time

Every 5 seconds, a new chunk of speech is transcribed and appended — speaker-labeled with your configured channel names and a precise timestamp. Open the full Studio view to see the live transcript, or just leave the popup open in the background. Close the popup at any time — transcription continues.

04

Highlight misfires to train the Brain

See a word the AI got wrong? Highlight it in the transcript. A popover appears instantly — assign it to the correct spelling in one click. That phonetic mapping is now permanent. Every future transcript corrects that word automatically, silently, before you ever see it. The transcript stays visible after you end a session — no need to dig through Recovery. Finish your corrections, then clear when you're done.

05

Sessions are always recoverable

Long sessions are automatically segmented into sealed chunks. All segments from a single Begin→End session are grouped together in Recovery. Export any segment or the full session as a clean .txt or .srt subtitle file. Review any past segment in the live transcript view to catch and correct misfires after the fact. Rename sessions with custom labels anytime. When you're ready, you bring the AI you already trust — one click from Recovery sends the transcript on your terms.

06

Already have a recording? Skip live capture entirely

Audio & Video File Import (Pro) runs an existing audio or video file you already have — a meeting export, a webinar, anything — through the exact same on-device Whisper pipeline. Your file is read locally in your browser and processed entirely on your device, never uploaded. The Phonetic Brain corrects it automatically, the same as a live session. And whenever you want everything off the machine at once — a new computer, a reinstall, just your own copy — Bulk Export, Backup & Restore packages every session and your full Brain into one file, and brings it all back with one click.

Built for

Legal, Finance & Consulting

Professionals handling sensitive conversations where routing audio through third-party cloud services is simply not an option.

Researchers & Interviewers

Capture browser-based interviews without routing audio through third-party cloud transcription systems. Locally stored, fully exportable.

Heavy Meeting Users

Stop paying monthly transcription subscriptions. One payment covers unlimited sessions — no per-seat, no per-minute billing ever again.

Specialized Vocabulary Workflows

Industries with difficult names, product codes, medication names, case names, or campaign terminology — where generic AI always gets it wrong.

Pre-train vocabulary on your phone with BrainTrainer ↗

The differentiator

The problem every AI transcript has

proper names product names case names medication names campaign terms company jargon character names niche terminology

AI transcription tools consistently fail on specialized vocabulary — in predictable, repeatable patterns. The Phonetic Brain is the permanent fix. Train it once on a word; it corrects that word on every future transcript automatically, before you ever see the output.

The Phonetic Brain

Whisper — the AI model at the core of TrainScription — is remarkable with general language and terrible with your language. Proper names, product names, case names, medication names, character names, niche jargon: all mangled, all the time, in predictable patterns.

The Phonetic Brain is a local correction layer you build yourself. Train it once on a word; it fixes that word on every future transcript, automatically, before you ever see the output. It compounds — the more you use it, the more accurate your transcripts become.

01 · FASTEST

Highlight in the live transcript

Select any misheard word during a live session or while reviewing a past segment. A popover appears — assign it to the correct spelling in under three seconds. No separate screen.

02 · MANUAL

Type it in the Brain panel

Already know how Whisper is mangling a term? Use the Manual Add form — correct spelling plus the variants Whisper produces. Instant.

03 · IMPORT

Restore from a .train.json backup

Share a specialized vocabulary set across installs, or bring your Brain to a new machine. The file is plain JSON — human-readable and editable outside the extension.

04 · REMOTE

Train from your phone — before the meeting starts

Know a tricky word is coming? The free BrainTrainer companion runs on Android and iOS. Speak the word a few times, pick the mishears, export a .train.json, and import it into TrainScription Pro. Your Brain knows the word before the first transcript line appears.

05 · MANAGE

Remove variants or whole terms anytime

Made a wrong assignment? Click the Brain panel, expand any term, and remove individual variants — or delete the whole term — with a single tap. No JSON editing required.

Train before the meeting. Refine during the meeting. Keep improving after the meeting.

Everything included

No Meeting Bot Required

TrainScription works from browser-tab audio or system audio directly. No third-party participant joins the call. No external transcription service relays your audio through cloud infrastructure.

Full Desktop Mode

Capture any app making sound on your machine — Teams desktop, Zoom, Discord, Slack, media players, anything. Not just browser tabs. Select your screen from the OS picker and TrainScription captures the full dual-channel stream. Brain corrections apply the same way as in Browser Tab mode.

Audio & Video File Import

Already have a recording? Import an existing audio or video file — a meeting export, a webinar, anything you already have — and run it through the same on-device Whisper pipeline used for live sessions. Read and processed entirely on your device, never uploaded. The Phonetic Brain applies automatically, exactly as it does live. Pro feature.

Bulk Export, Backup & Restore

Export every stored session at once as a single .zip — available on Free and Pro. Pro adds Backup Everything: a complete backup of every session and your full Phonetic Brain in one file, and Restore from Backup to bring it all back on a fresh install or a new machine.

Dual-Channel, Speaker-Labeled

Your microphone and the meeting audio captured on two independent audio pipelines — not merged into one. Overlapping speech on both channels is preserved, not dropped. Each line labeled with your configured channel names and a precise timestamp.

Phonetic Brain

Permanently fixes the words AI always gets wrong. Train by highlighting in the transcript. Corrections apply to every future session automatically — and every word the Brain corrects is highlighted live in the transcript so you can see the engine working in real time. Made a wrong correction? One-shot Undo appears in the same toast before the fix sticks.

Channel Presets

Six built-in label presets for common use cases: Default, Corporate, Legal, Interview, Medical, RPG. Fully customizable per session from the popup.

Pause Controls

Pause your mic channel, the meeting audio, or everything — without ending the session. Critical for when you mute yourself in the call but don't want your in-room conversation transcribed.

Export to .txt and .srt

Download any session or individual segment as a clean .txt file or an .srt subtitle file. Pro users get one-click assembled export of the full session — all segments merged, formatted, ready to use.

Session Recovery

Sessions are automatically segmented and stored. Rename sessions with custom labels. Review any past segment in the live transcript view to find and correct misfires. Download individual segments as .txt or .srt anytime — or use Pro's one-click assembled session export. Optionally set Recovery to auto-purge sessions past 30, 60, or 90 days — and lock any session you want kept forever, exempt from the sweep regardless of age.

Session Tags

Tag any Recovery session with your own labels and filter the session list with one click. All sessions marked "Client Calls" or "Weekly Sync" surface instantly — no scrolling through timestamps. Bulk export respects the active tag filter, so you can export just the sessions in a category as a single .zip. Ungated — Free and Pro alike.

Silence Auto-Stop

Automatically stops transcription after a configurable period of sustained silence. Never run indefinitely again. A warning fires 1 minute before the threshold with a Keep Transcribing option. Transcript always saved to Recovery on auto-stop.

Brain Management

Made a wrong assignment? Expand any Brain term and remove individual variants with a single click — or delete the whole term. No config files, no JSON editing. Your vocabulary stays exactly right.

AI Summary Handoff

Your transcript stays local. When you're ready, you choose the AI. Configure your preferred platform once in Settings — it's off by default, and nothing is ever sent automatically. Choose from three preset styles (Executive Summary, Action Items, Full Notes), modify any preset as a starting point, or write your own one-off custom prompt. One click from Recovery copies your instructions and transcript to your clipboard and opens the platform ready to receive them. TrainScription gives you the clean local text. You bring the AI you already trust.

Pricing

Free

$0

No credit card. No account required.

15-minute session cap
3 Recovery sessions stored
3 Phonetic Brain slots
Real-time dual-channel transcript (Browser Tab + Full Desktop)
Pause controls
Export to .txt and .srt after each session
Session naming in Recovery
Session Tags — tag and filter Recovery sessions
Silence Auto-Stop (5 & 10 min)
AI Summary Handoff
Bulk Export — all sessions as one .zip

Pro

$9.99 _once

One payment. No subscription. Works across devices.

Unlimited session length
Unlimited Recovery sessions
Unlimited Brain slots
Real-time dual-channel transcript (Browser Tab + Full Desktop)
One-click assembled session export
Import full Brain vocabulary files
Silence Auto-Stop — all options incl. Indefinite
AI Summary Handoff — full multi-segment session
Audio & Video File Import
Backup Everything & Restore from Backup
Cross-device via email login
Everything in Free, forever

How it compares

	TrainScription	Otter.ai	Fireflies	Rev
No bot joins the call	✓ Never	✗	✗	✗
Audio stays on your device	✓ Always	✗	✗	✗
Works fully offline	✓	✗	✗	✗
Fixes jargon / names	✓ Phonetic Brain	Paid only	✗	✗
Pre-session vocabulary training	✓ BrainTrainer	✗	✗	✗
Corrections visible in transcript	✓ Live highlights	✗	✗	✗
Works with desktop apps (Teams, Zoom, Discord)	✓ Full Desktop mode	✗	✗	✗
Import an existing recording	✓ Same on-device pipeline	Cloud upload required	Cloud upload required	Cloud upload required
Back up & restore your full history	✓ One file, any machine	✗	✗	✗
Price	Free / $9.99 once	$16.99/mo	$10/mo	$9.99/mo
Works in any browser tab	✓	Partial	Partial	✗

Why it exists

Most transcription tools route sensitive conversations through external cloud infrastructure, inject third-party participants into your calls, and charge you monthly for the privilege. The costs compound — financially, architecturally, and in trust.

TrainScription was built around a different idea: transcription should happen locally, under your control, on your own machine — regardless of how long the session runs or how sensitive the content is. The constraint of doing it entirely on-device isn't a limitation. It's the whole point.

Under the hood

TrainScription is a Manifest V3 Chrome Extension running OpenAI Whisper via WebAssembly locally in your browser. The AI model is downloaded once from the model provider and cached — transcription itself occurs entirely on-device after that initial download. TrainScription does not operate server-side infrastructure and audio is processed locally on your device and is not uploaded for cloud transcription.

Audio is processed in 5-second volatile chunks that exist only in RAM. They are never written to disk. Audio is not uploaded for cloud processing. The extension's data policy is architecture, not a promise — the extension contains no code to route your audio externally.

The only external connections in this extension: a one-time AI model download from the model provider, and ExtPay license verification for Pro users. Both are unrelated to your content — your audio, transcripts, and Brain vocabulary are never involved in either connection. TrainScription does not execute remote code or load external scripts that alter extension functionality after installation.

The Phonetic Brain correction function runs synchronously on every transcript line before it's written to storage. Your vocabulary corrections apply before you see the output.

Audio & Video File Import runs Whisper inference inside a dedicated background Worker, separate from the page you're looking at — so processing a long file never freezes or slows down the browser tab. Bulk Export, Backup, and Restore are pure local file operations: reading what's already stored on your device, packaging it, or unpacking it back — no inference, no model, no network connection of any kind, in either direction.

Privacy & data

✓ Audio is processed locally using WebAssembly-based Whisper inference running in your browser. Audio chunks exist temporarily in RAM during transcription and are discarded immediately after processing. They are never written to disk.

✓ Audio is not transmitted to external servers. TrainScription does not operate cloud infrastructure and does not upload your audio for remote processing. Audio is processed on your device and is not routed externally.

✓ Transcripts are stored locally in your browser's extension storage on your machine. They are never synchronized, indexed, or accessible outside your device unless you export them yourself.

✓ Brain vocabulary corrections are stored locally in browser extension storage. Exports are user-controlled .train.json files that live wherever you save them.

✓ No account is required for the free version. No email, no registration, no profile.

✓ No analytics, no tracking, no behavioral data collection. TrainScription does not use tracking pixels, session recorders, or any analytics tooling.

✓ The extension works fully offline after the AI model has been downloaded. No internet connection is required for transcription.

✓ Imported audio and video files are read locally in your browser and processed entirely on your device, exactly like a live session. Never uploaded anywhere, at any point.

✓ Bulk export, backup, and restore are entirely local operations. Building an export file, packaging a backup, and restoring from one all happen in your browser, on your device. No network request is made in either direction.

External connections, fully disclosed: The AI model is downloaded once from the model provider (not from TrainScription) and cached locally. Pro license verification is handled by ExtPay — payment processing only. AI Summary Handoff — an optional, user-initiated feature that is off by default. When you explicitly request a summary, your transcript is sent directly from your browser to the AI platform you have configured and trust. TrainScription does not see or relay this content. Your audio and Brain data are not involved in any external connection.

Why TrainScription requests permissions

Microphone Used to capture your local microphone channel during transcription sessions. Only active when you initiate a capture session.

Tab Audio Used to capture the meeting audio from the active browser tab during transcription (Browser Tab mode). Only active when you initiate a capture session.

Desktop Audio Used to capture system audio in Full Desktop mode. Requires you to actively select your screen source from an OS picker and enable system audio sharing. Only active when a Full Desktop session is started.

Storage Used to save transcripts, Brain vocabulary corrections, and your settings locally on your device. Nothing is stored outside your browser's local extension storage.

Unlimited Storage Lets your local extension storage grow beyond Chrome's default quota, since saved transcripts, Recovery sessions, and Brain data can accumulate over time. Affects only how much can be stored on your own device — does not enable any remote or cloud storage.

Offscreen Document Runs the on-device Whisper transcription engine and Browser Tab audio capture in a hidden, persistent document with no visible interface — never shown to you, displays no content. Exists so transcription keeps running while you do other things in Chrome.

Downloads Used only when you explicitly export a transcript or Brain backup as a file. Not used at any other time.

Notifications Shows a single desktop notification when Silence Auto-Stop ends a session, so you know without needing the Studio tab open. Never used for marketing or promotional alerts.

Alarms Schedules the free-tier 15-minute session cap and Silence Auto-Stop reliably — ordinary browser timers don't survive a service worker restart. Not used to track or schedule anything else.

Tabs Opens a new browser tab to your chosen AI platform only when you explicitly request a summary via AI Summary Handoff. Not used at any other time.

TrainScription does not use permissions for advertising, tracking, behavioral profiling, or any purpose beyond the transcription workflow described above.

Say Something

Using the beta? Got a use case I haven't thought of? Found something broken? Leave a note — Terrance reads every one.