Run
Examples
API
Loading...
54 runs
Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.
Documents
🔗Chichewa ASR via MMS-Large Google Translate.mp3.wav
Transcription
45 runs
🔗audio_2024-10-25_16-19-54.ogg.wav
23 runs
Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.
🔗3_audio_from_918247443697_to_112808001793652.wav
18 runs
43 runs
8 runs
48 runs
155 runs
6 runs
🔗https://www.youtube.com/watch?v=pQx4f2u9R6E
20 runs
🔗Kikuyu via MMS Large and GhanaNLP for Speech Reco Trans.ogg.wav
41 runs
🔗WHATSAPP_audio_from_918764022384_to_113275925092502.wav
22 runs
🔗https://drive.google.com/file/d/1DQpKCg2J_Osq7u-2hsfvc6R9dFndaTqk/view?usp=drive_link
🔗https://www.youtube.com/watch?v=7ZrxTFxeyzY
88 runs
21 runs
61 runs
207 runs
This is one of the few Bhojpuri ASR models available based on: https://huggingface.co/Harveenchadha/vakyansh-wav2vec2-bhojpuri-bhom-60Transcribe mp3, WhatsApp audio + wavs. Optionally translate to any language too.
🔗https://www.youtube.com/watch?v=WK7rEMCVsGE
7 runs
🔗https://www.youtube.com/watch?v=PdNZ9ip0qjg
9 runs
🔗4dc062a8-b080-45c5-b538-cb30c854c83c.wav
2 runs
🔗https://www.youtube.com/watch?v=L-yHhIq3sE0&list=PL4aOhrbpcqyYGrLHMyZIgVNej2pa53uac&index=14
🔗https://www.youtube.com/watch?v=Cgd_Cjxyme4&list=PL3LYCEMgJ1urYQcVZEj1D9VlupHW3hlcT&index=2