Transcribe meetings, podcasts, interviews, and customer calls at scale using the Runcrate Whisper API. This guide covers single-file transcription, batch processing across a folder, and output format options.Documentation Index
Fetch the complete documentation index at: https://runcrate.ai/docs/llms.txt
Use this file to discover all available pages before exploring further.
What you’ll build
A pipeline that transcribes multiple audio files using Whisper large-v3 through the Runcrate API. The pipeline reads a folder of recordings, transcribes each one, and writes the results as plain text, SRT subtitles, or structured JSON — your choice.Single file transcription
Batch processing
Transcribe every audio file in a folder and write the results to disk. This example uses the Runcrate Python SDK and outputs SRT subtitle files.Batch processing with the OpenAI SDK
The same pattern works with the OpenAI Python SDK pointed at the Runcrate API:Output formats
| Format | Extension | Use case |
|---|---|---|
text | .txt | Plain transcript — search, summarization, RAG ingestion |
json | .json | Structured output with word-level timestamps |
srt | .srt | Subtitle file for video editors (Premiere, DaVinci, Final Cut) |
vtt | .vtt | Web video subtitles (HTML5 <track> element) |
response_format parameter:
Language hints
Whisper auto-detects the spoken language, but you can improve accuracy on non-English audio by passing a language hint:Use cases
- Meeting recordings — transcribe and feed into an LLM for searchable notes and action items.
- Podcast episodes — generate full transcripts for show notes, blog posts, and SEO.
- Customer support calls — bulk-transcribe for quality analysis and compliance review.
- Lecture recordings — produce study materials and make content accessible.
- Video content — generate SRT/VTT subtitle files for automatic captioning.
Tips
- Supported formats. MP3, WAV, M4A, FLAC, OGG, and WebM.
- Large files. For recordings longer than ~2 hours, split into chunks before uploading. Tools like
ffmpegmake this easy:ffmpeg -i long-meeting.mp3 -f segment -segment_time 1800 -c copy chunk_%03d.mp3. - SRT for video editing. SRT is the most widely supported subtitle format across video editors and media players. Use VTT only if you need web-native
<track>elements. - Combine with chat models. Pipe transcripts into a Runcrate chat model for summarization, action-item extraction, or translation — see the RAG Pipeline guide for the pattern.