Speech-to-text API that fits your workflow.

Handle single-file transcription, async jobs, batch processing, polling, and webhooks with one API built for reliable transcription at scale.

curl -X POST https://api.transcribeapi.com/v1/transcribe \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -F "file=@audio.mp3" 
curl -X POST https://api.transcribeapi.com/v1/transcribe \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -F "file=@audio.mp3" \
  -F "language=en" \
  -F "webhook_url=https://app.com/webhook" 
$0.15
Per audio hour
10K
Files per batch
60 Mins
Free transcription
99
Languages

Built for developers who ship fast

Everything you need to add transcription to your product.

Simple REST API

One endpoint handles single files, async jobs, and batch processing. POST multipart or JSON—it just works.

Fast Batches

Batch jobs complete in minutes, not 24 hours. Up to 10,000 files and 10 GB per batch req.

SDKs for Every Stack

SDKs for Node.js, Python, and Cloudflare Workers. Handles uploads, polling, and retries for you.

Webhooks & Polling

Get results delivered to your server with signed webhooks, or poll job status. Works great with serverless.

Privacy First

Audio files are deleted immediately after transcription.

Detailed Docs

Every endpoint, error code, and option is documented with copy-paste examples in 7 languages.

Simple, predictable pricing

Pay only for the audio you transcribe. No hidden fees

$0.15/hour

Billed per second, rounded up to the nearest second.

10 minutes $0.025
1 hour $0.15
10 hours $1.50

Every new account gets 60 mins of free transcription just save a card to prevent abuse. No charge.

Fast transcription at any scale

These are maximum expected completion times.

Audio Duration Max Completion
Under 10 seconds 1–4 sec
Under 10 minutes Up to 25 sec
Single file under 1 hour Up to 5 min
Single file 1–5 hours Up to 15 min
Single file 5–10 hours Up to 30 min

Batch jobs add ~100–300 ms per file on top of the duration-based latency.

Powered by open-source Whisper

State-of-the-art speech recognition from OpenAI's Whisper family.

Whisper Large v3 Turbo

Optimized for speed. Delivers near real-time transcription with minimal latency—ideal for interactive applications and quick turnarounds.

Whisper Large v3

Maximum accuracy for challenging audio. Handles accents, background noise, and 99 languages with exceptional precision.

Clear limits, no surprises

Transcription Endpoints

Free 10 RPM
Pay as You Go 300 RPM

File & Batch Limits

Single file (sync) 30 MB / 10 min
Single file (async) 10 GB / 10 hrs
Batch files 10,000
Total batch size 10 GB

Ready to transcribe?

Get 60 mins of free transcription.

Start Transcribing Free