Function
Speech to Text With Speakers icon

Speech to Text With Speakers

Available ActionsEach successful request consumes credits as outlined below.

transcribe_quick100crtranscribe_standard150crtranscribe_extended200cr

Description

Turn any audio recording into clean, searchable text in seconds. Transcribe voice memos, meetings, interviews, podcasts, and webinars with accurate speech recognition that handles accents and background noise. Get plain text for quick reference, SRT or WebVTT subtitles for video captioning, or rich JSON output with word-level timestamps and speaker identification. Choose from three tiers based on recording length — up to 15, 30, or 60 minutes — and optionally enable speaker diarization to label who said what, profanity filtering, and alternative transcripts for maximum accuracy.