Examples: Speech

Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.


Dev Aggrawal

Whisper Large v3 - Kannada

Loading...

4 runs

Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

Documents

🔗3_audio_from_918247443697_to_112808001793652.wav

Transcription

Dev Aggrawal

Seamless M4T Kannada -> EN

Loading...

3 runs

Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

Documents

🔗3_audio_from_918247443697_to_112808001793652.wav

Transcription

Dev Aggrawal

Seamless M4T - Swahili -> EN

Loading...

16 runs

Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

Documents

🔗WHATSAPP_audio_from_918764022384_to_113275925092502.wav

Transcription

Dev Aggrawal

Whisper Large v2 - Swahili -> EN

Loading...

1 runs

Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

Documents

🔗WHATSAPP_audio_from_918764022384_to_113275925092502.wav

Transcription

Sean Blagsvedt

Whisper v3 - Swahili -> EN

Loading...

13 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

🔗https://drive.google.com/file/d/1DQpKCg2J_Osq7u-2hsfvc6R9dFndaTqk/view?usp=drive_link

Transcription

Sean Blagsvedt

Azure ASR (Swahili)

Loading...

1 runs

Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

Documents

🔗https://www.youtube.com/watch?v=7ZrxTFxeyzY

Transcription

Ambika Joshi

Whisper v3 (with english translation)

Loading...

52 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

🔗https://drive.google.com/file/d/1DQpKCg2J_Osq7u-2hsfvc6R9dFndaTqk/view?usp=drive_link

Transcription

Ambika Joshi

Whisper v2 (auto detect)

Loading...

0 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

🔗https://drive.google.com/file/d/1DQpKCg2J_Osq7u-2hsfvc6R9dFndaTqk/view?usp=drive_link

Transcription

Ambika Joshi

Conformer Hindi (ai4bharat)

Loading...

32 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

🔗https://drive.google.com/file/d/1DQpKCg2J_Osq7u-2hsfvc6R9dFndaTqk/view?usp=drive_link

Ambika Joshi

Chirp/USM (Google)

Loading...

47 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

🔗https://drive.google.com/file/d/1DQpKCg2J_Osq7u-2hsfvc6R9dFndaTqk/view?usp=drive_link

Transcription

Ambika Joshi

Deepgram

Loading...

43 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

🔗https://drive.google.com/file/d/1DQpKCg2J_Osq7u-2hsfvc6R9dFndaTqk/view?usp=drive_link

Transcription

Ambika Joshi

Whisper Hindi Large v2 (Bhashini)

Loading...

35 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

🔗https://drive.google.com/file/d/1DQpKCg2J_Osq7u-2hsfvc6R9dFndaTqk/view?usp=drive_link

Transcription

Ambika Joshi

Azure - Hindi (Microsoft)

Loading...

28 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

🔗https://drive.google.com/file/d/1DQpKCg2J_Osq7u-2hsfvc6R9dFndaTqk/view?usp=drive_link

Transcription

Ambika Joshi

Seamless4MT (facebook)

Loading...

43 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

🔗https://drive.google.com/file/d/1DQpKCg2J_Osq7u-2hsfvc6R9dFndaTqk/view?usp=drive_link

Transcription

Whisper Large v2 - Kannada

Loading...

39 runs

Documents

🔗3_audio_from_918247443697_to_112808001793652.wav

Transcription

Chirp - Kannada

Loading...

17 runs

Documents

🔗3_audio_from_918247443697_to_112808001793652.wav

Transcription

Azure - Kannada

Loading...

17 runs

Documents

🔗3_audio_from_918247443697_to_112808001793652.wav

Transcription

Swahili Speech Recognition & Translation via Google Chirp & Translate

Loading...

21 runs

Documents

🔗https://www.youtube.com/watch?v=7ZrxTFxeyzY

Transcription

Bhojpuri Speech Recognition (using Gates/Ekstep)

Loading...

118 runs

This is one of the few Bhojpuri ASR models available based on: https://huggingface.co/Harveenchadha/vakyansh-wav2vec2-bhojpuri-bhom-60 Transcribe mp3, WhatsApp audio + wavs. Optionally translate to any language too.

Documents

🔗https://www.youtube.com/watch?v=WK7rEMCVsGE

Transcription

Speech Recognition and Translation

Loading...

6 runs

Documents

🔗https://www.youtube.com/watch?v=PdNZ9ip0qjg

Transcription

Speech Recognition and Translation

Loading...

8 runs

Documents

🔗4dc062a8-b080-45c5-b538-cb30c854c83c.wav

Transcription

Speech Recognition and Translation

Loading...

1 runs

Documents

🔗https://www.youtube.com/watch?v=L-yHhIq3sE0&list=PL4aOhrbpcqyYGrLHMyZIgVNej2pa53uac&index=14

Transcription

Speech Recognition and Translation

Loading...

1 runs

Documents

🔗https://www.youtube.com/watch?v=L-yHhIq3sE0&list=PL4aOhrbpcqyYGrLHMyZIgVNej2pa53uac&index=14

Transcription

Speech Recognition and Translation

Loading...

1 runs

Documents

🔗https://www.youtube.com/watch?v=Cgd_Cjxyme4&list=PL3LYCEMgJ1urYQcVZEj1D9VlupHW3hlcT&index=2

Transcription