• Dev Aggarwal
  • Examples: Speech


    Chichewa ASR (MMS+Google Translate)

    Loading...

    59 runs

    Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

    Documents

    Transcription

    Azure - Hindi (Microsoft)

    Loading...

    48 runs

    Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

    Documents

    Transcription

    Gooey.AI (Dara.network Inc)

    Azure - Telugu

    Loading...

    23 runs

    Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

    Documents

    Transcription

    Gooey.AI (Dara.network Inc)

    Chirp - Telugu

    Loading...

    18 runs

    Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

    Documents

    Transcription

    Gooey.AI (Dara.network Inc)

    Whisper Large v2 - Telugu

    Loading...

    43 runs

    Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

    Documents

    Transcription

    Seamless M4T Telugu-> EN

    Loading...

    8 runs

    Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

    Documents

    Transcription

    Whisper Large v3 - Telegu

    Loading...

    48 runs

    Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

    Documents

    Transcription

    Whisper Hindi Large v2 (Bhashini)

    Loading...

    156 runs

    Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

    Documents

    Transcription

    Speaker diarization via Deepgram

    Loading...

    6 runs

    Documents

    Transcription

    Seamless M4T v2 - Swahili -> EN

    Loading...

    41 runs

    Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

    Documents

    Transcription

    Whisper Large v2 - Swahili -> EN

    Loading...

    22 runs

    Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

    Documents

    Transcription

    Whisper v3 - Swahili -> EN

    Loading...

    43 runs

    Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

    Documents

    Transcription

    Azure ASR (Swahili)

    Loading...

    20 runs

    Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

    Documents

    Transcription

    Whisper v3 (with english translation)

    Loading...

    89 runs

    Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

    Documents

    Transcription

    Whisper v2 (auto detect)

    Loading...

    21 runs

    Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

    Documents

    Transcription

    Chirp/USM (Google)

    Loading...

    61 runs

    Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

    Documents

    Transcription

    Gooey.AI (Dara.network Inc)

    Swahili Speech Recognition & Translation via Google Chirp & Translate

    Loading...

    23 runs

    Documents

    Transcription

    Gooey.AI (Dara.network Inc)

    Bhojpuri Speech Recognition (using Gates/Ekstep)

    Loading...

    211 runs

    This is one of the few Bhojpuri ASR models available based on: https://huggingface.co/Harveenchadha/vakyansh-wav2vec2-bhojpuri-bhom-60
    Transcribe mp3, WhatsApp audio + wavs. Optionally translate to any language too.

    Documents

    Transcription

    Gooey.AI (Dara.network Inc)

    Speech Recognition and Translation

    Loading...

    7 runs

    Documents

    Transcription

    Gooey.AI (Dara.network Inc)

    Speech Recognition and Translation

    Loading...

    9 runs

    Documents

    Transcription

    Gooey.AI (Dara.network Inc)

    Speech Recognition and Translation

    Loading...

    2 runs

    Documents

    Transcription

    Gooey.AI (Dara.network Inc)

    Speech Recognition and Translation

    Loading...

    2 runs

    Documents

    Transcription

    Gooey.AI (Dara.network Inc)

    Speech Recognition and Translation

    Loading...

    2 runs

    Documents

    Transcription