Examples: Speech


Gooey.AI

Chichewa ASR (Seamless M4T v2 + Google Translate)

Loading...

1 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

Transcription

Gooey.AI

Generic Whisper v2 + Google Translate

Loading...

2 runs

Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper v2

Documents

Transcription

Azure - Hindi (Microsoft)

Loading...

50 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

Transcription

Gooey.AI

Azure - Telugu

Loading...

25 runs

Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

Documents

Transcription

Gooey.AI

Chirp - Telugu

Loading...

19 runs

Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

Documents

Transcription

Gooey.AI

Whisper Large v2 - Telugu

Loading...

43 runs

Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

Documents

Transcription

Seamless M4T Telugu-> EN

Loading...

12 runs

Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

Documents

Transcription

Whisper Large v3 - Telegu

Loading...

50 runs

Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

Documents

Transcription

Whisper Hindi Large v2 (Bhashini)

Loading...

174 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

Transcription

Speaker diarization via Deepgram

Loading...

6 runs

Documents

Transcription

Seamless M4T v2 - Swahili -> EN

Loading...

41 runs

Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

Documents

Transcription

Whisper Large v2 - Swahili -> EN

Loading...

22 runs

Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

Documents

Transcription

Whisper v3 - Swahili -> EN

Loading...

44 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

Transcription

Azure ASR (Swahili)

Loading...

20 runs

Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

Documents

Transcription

Whisper v3 (with english translation)

Loading...

92 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

Transcription

Whisper v2 (auto detect)

Loading...

21 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

Transcription

Chirp/USM (Google)

Loading...

61 runs

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

Documents

Transcription

Gooey.AI

Swahili Speech Recognition & Translation via Google Chirp & Translate

Loading...

23 runs

Documents

Transcription

Gooey.AI

Bhojpuri Speech Recognition (using Gates/Ekstep)

Loading...

223 runs

This is one of the few Bhojpuri ASR models available based on: https://huggingface.co/Harveenchadha/vakyansh-wav2vec2-bhojpuri-bhom-60
Transcribe mp3, WhatsApp audio + wavs. Optionally translate to any language too.

Documents

Transcription