Overview
Salad Transcription API helps teams convert large volumes of media into accurate transcripts, captions, or subtitles.Salad Transcription API lets you generate accurate transcripts, captions, or subtitles for audio and video content using ASR models like Whisper.
That means you’ll be able to make your content globally accessible with captions and subtitles that ensure ADA compliance for audiences with disabilities.
Even better, Salad Transcription API lets you differentiate between speakers and most accents to transcribe content with many speakers.
Once you connect Salad Transcription API, you can send instructions from your platform via JSON and receive transcripts in the same format as well.
Using speech-to-text models, Salad Transcription API makes it easy to accurately transcribe your audio and video content for any use case.
Transcribe content without breaking the bank.
Get lifetime access to Salad Transcription API today!
Plans & features
- Lifetime access to Salad Transcription API
- All future Small Business Plan updates
- If Plan name changes, deal will be mapped to the new Plan name with all accompanying updates
- You must redeem your code(s) within 60 days of purchase
- Stack unlimited codes
- Each additional code beyond 10 increases transcription hours per month by 200 hours per month
- Reselling transcription hours is prohibited
- Built with Whisper-large-V3 as the core model
- Transcribe up to 2 hour audio/video files
- 97 languages (all current and future languages)
- Diarization (speaker recognition)
- Sentence/Word-level time codes
- Hallucination removal typical from Whisper-large-V3
- Transcripts, captions, and subtitles
- Supports audio (MP3, WAV, M4A, etc.)
- Supports video (MP4, AVI, WMV, MKV, WEBM, etc.)
- Automatic speech recognition
- JSON, SRT outputs