Examples: Speech


Chichewa ASR (MMS+Google Translate)

Loading...

59 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

Transcription

Azure - Hindi (Microsoft)

Loading...

48 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

Transcription

Gooey.AI (Dara.network Inc)

Azure - Telugu

Loading...

23 runs

Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

Documents

Transcription

Gooey.AI (Dara.network Inc)

Chirp - Telugu

Loading...

18 runs

Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

Documents

Transcription

Gooey.AI (Dara.network Inc)

Whisper Large v2 - Telugu

Loading...

43 runs

Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

Documents

Transcription

Seamless M4T Telugu-> EN

Loading...

8 runs

Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

Documents

Transcription

Whisper Large v3 - Telegu

Loading...

48 runs

Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

Documents

Transcription

Whisper Hindi Large v2 (Bhashini)

Loading...

162 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

Transcription

Speaker diarization via Deepgram

Loading...

6 runs

Documents

Transcription

Seamless M4T v2 - Swahili -> EN

Loading...

41 runs

Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

Documents

Transcription

Whisper Large v2 - Swahili -> EN

Loading...

22 runs

Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

Documents

Transcription

Whisper v3 - Swahili -> EN

Loading...

43 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

Transcription

Azure ASR (Swahili)

Loading...

20 runs

Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

Documents

Transcription

Whisper v3 (with english translation)

Loading...

91 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

Transcription

Whisper v2 (auto detect)

Loading...

21 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

Transcription

Chirp/USM (Google)

Loading...

61 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

Transcription

Gooey.AI (Dara.network Inc)

Swahili Speech Recognition & Translation via Google Chirp & Translate

Loading...

23 runs

Documents

Transcription

Gooey.AI (Dara.network Inc)

Bhojpuri Speech Recognition (using Gates/Ekstep)

Loading...

214 runs

This is one of the few Bhojpuri ASR models available based on: https://huggingface.co/Harveenchadha/vakyansh-wav2vec2-bhojpuri-bhom-60
Transcribe mp3, WhatsApp audio + wavs. Optionally translate to any language too.

Documents

Transcription

Gooey.AI (Dara.network Inc)

Speech Recognition and Translation

Loading...

7 runs

Documents

Transcription