Examples: Speech


Chichewa ASR via MMS-Large + Google Translate

Loading...

4 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

šŸ”—https://drive.google.com/file/d/1BhLCiVGYXTHYEe_lfxdr00R1aX0FRh8t/view?usp=drive_link

Transcription

Whisper Large v3 - Kannada

Loading...

5 runs

Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

Documents

šŸ”—3_audio_from_918247443697_to_112808001793652.wav

Transcription

Seamless M4T Kannada -> EN

Loading...

3 runs

Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

Documents

šŸ”—3_audio_from_918247443697_to_112808001793652.wav

Transcription

Seamless M4T - Swahili -> EN

Loading...

17 runs

Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

Documents

šŸ”—WHATSAPP_audio_from_918764022384_to_113275925092502.wav

Transcription

Whisper Large v2 - Swahili -> EN

Loading...

2 runs

Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

Documents

šŸ”—WHATSAPP_audio_from_918764022384_to_113275925092502.wav

Transcription

Whisper v3 - Swahili -> EN

Loading...

15 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

šŸ”—https://drive.google.com/file/d/1DQpKCg2J_Osq7u-2hsfvc6R9dFndaTqk/view?usp=drive_link

Transcription

Azure ASR (Swahili)

Loading...

2 runs

Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

Documents

šŸ”—https://www.youtube.com/watch?v=7ZrxTFxeyzY

Transcription

Whisper v3 (with english translation)

Loading...

53 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

šŸ”—https://drive.google.com/file/d/1DQpKCg2J_Osq7u-2hsfvc6R9dFndaTqk/view?usp=drive_link

Transcription

Whisper v2 (auto detect)

Loading...

0 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

šŸ”—https://drive.google.com/file/d/1DQpKCg2J_Osq7u-2hsfvc6R9dFndaTqk/view?usp=drive_link

Transcription

Conformer Hindi (ai4bharat)

Loading...

32 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

šŸ”—https://drive.google.com/file/d/1DQpKCg2J_Osq7u-2hsfvc6R9dFndaTqk/view?usp=drive_link

Chirp/USM (Google)

Loading...

47 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

šŸ”—https://drive.google.com/file/d/1DQpKCg2J_Osq7u-2hsfvc6R9dFndaTqk/view?usp=drive_link

Transcription

Deepgram

Loading...

43 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

šŸ”—https://drive.google.com/file/d/1DQpKCg2J_Osq7u-2hsfvc6R9dFndaTqk/view?usp=drive_link

Transcription

Whisper Hindi Large v2 (Bhashini)

Loading...

59 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

šŸ”—https://drive.google.com/file/d/1DQpKCg2J_Osq7u-2hsfvc6R9dFndaTqk/view?usp=drive_link

Transcription

Azure - Hindi (Microsoft)

Loading...

28 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

šŸ”—https://drive.google.com/file/d/1DQpKCg2J_Osq7u-2hsfvc6R9dFndaTqk/view?usp=drive_link

Transcription

Seamless4MT (facebook)

Loading...

45 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

šŸ”—https://drive.google.com/file/d/1DQpKCg2J_Osq7u-2hsfvc6R9dFndaTqk/view?usp=drive_link

Transcription

Whisper Large v2 - Kannada

Loading...

39 runs

Documents

šŸ”—3_audio_from_918247443697_to_112808001793652.wav

Transcription

Chirp - Kannada

Loading...

17 runs

Documents

šŸ”—3_audio_from_918247443697_to_112808001793652.wav

Transcription

Azure - Kannada

Loading...

17 runs

Documents

šŸ”—3_audio_from_918247443697_to_112808001793652.wav

Transcription

Swahili Speech Recognition & Translation via Google Chirp & Translate

Loading...

21 runs

Documents

šŸ”—https://www.youtube.com/watch?v=7ZrxTFxeyzY

Transcription

Bhojpuri Speech Recognition (using Gates/Ekstep)

Loading...

119 runs

This is one of the few Bhojpuri ASR models available based on: https://huggingface.co/Harveenchadha/vakyansh-wav2vec2-bhojpuri-bhom-60
Transcribe mp3, WhatsApp audio + wavs. Optionally translate to any language too.

Documents

šŸ”—https://www.youtube.com/watch?v=WK7rEMCVsGE

Transcription

Speech Recognition and Translation

Loading...

6 runs

Documents

šŸ”—https://www.youtube.com/watch?v=PdNZ9ip0qjg

Transcription

Speech Recognition and Translation

Loading...

8 runs

Documents

šŸ”—4dc062a8-b080-45c5-b538-cb30c854c83c.wav

Transcription

Speech Recognition and Translation

Loading...

1 runs

Documents

šŸ”—https://www.youtube.com/watch?v=L-yHhIq3sE0&list=PL4aOhrbpcqyYGrLHMyZIgVNej2pa53uac&index=14

Transcription

Speech Recognition and Translation

Loading...

1 runs

Documents

šŸ”—https://www.youtube.com/watch?v=L-yHhIq3sE0&list=PL4aOhrbpcqyYGrLHMyZIgVNej2pa53uac&index=14

Transcription

Speech Recognition and Translation

Loading...

1 runs

Documents

šŸ”—https://www.youtube.com/watch?v=Cgd_Cjxyme4&list=PL3LYCEMgJ1urYQcVZEj1D9VlupHW3hlcT&index=2

Transcription