Pro + Media · £19/month

This tool requires the Media tier. The engine is already loaded — sign in and upgrade to run it on your audio.

Loading tool…

Audio Transcription limits by plan

Free is enough for most one-off jobs. Pro raises the file and batch caps; Pro + Media unlocks GB-scale streaming and unlimited duration.

See all plans

Free

No signup needed

File size: 50 MB
Duration: 30 min
Files per batch: 1

Pro

£7/mo — 50× larger files

File size: 200 MB
Duration: 120 min
Files per batch: 10

Pro + Media

Stream multi-GB files

File size: 100 GB
Duration: Unlimited
Files per batch: 100

Larger files supported on Developer (5 GB CSV) and Enterprise (unlimited). All processing happens in your browser — files never reach a server.

How it Works

1
Drop your audio file (any common format)
2
Pick language and output format (plain text, SRT, VTT, or JSON segments)
3
Download the transcript

Privacy Audit

0 bytes uploaded. Audio Transcription runs entirely in your browser using FFmpeg.wasm 8.1, RNNoise, and the Web Audio API. Your audio stays on your device at all times. No data is sent to any server — critical for sensitive interviews and confidential calls.

Frequently Asked Questions

Does my audio leave my computer?

No. Transcription runs on the paired runner using a local Whisper-class model. Audio bytes never reach JAD servers.

Which languages are supported?

100+ languages including English, Spanish, French, German, Mandarin, Japanese, Hindi, Portuguese, and Arabic. Set the language hint for best accuracy or leave on auto-detect.

Can I get word-level timestamps?

Yes — pick the JSON output format. Each segment includes start/end timestamps, and the model can emit per-word offsets for fine-grained alignment.

Related Audio Tools

Audio Trimmer / Cutter

Cut start/end timestamps from any audio file. JAD re-encodes once at the source quality — your file never leaves the browser.

Open tool

Silence Stripper

Detect and cut silences below a threshold (default -40 dB, 0.5 s minimum) from any recording. Tighten interviews and solo podcasts in seconds.

Open tool

Loudness Normalizer (EBU R128)

Two-pass EBU R128 loudness normalisation in your browser. Hit -16 LUFS for podcasts, -14 for Spotify/YouTube — true-peak limited, FFmpeg 8.1 engine, no upload.

Open tool

Frequently Asked Questions

Does my audio leave my computer?

No. Transcription runs on the paired runner using a local Whisper-class model. Audio bytes never reach JAD servers.

Which languages are supported?

100+ languages including English, Spanish, French, German, Mandarin, Japanese, Hindi, Portuguese, and Arabic. Set the language hint for best accuracy or leave on auto-detect.

Can I get word-level timestamps?

Yes — pick the JSON output format. Each segment includes start/end timestamps, and the model can emit per-word offsets for fine-grained alignment.