Speech Recognition & Translation

Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

Loading...

Swahili Speech Recognition & Translation via Chirp (+ Google Translate)

Documents

🔗https://www.youtube.com/watch?v=7ZrxTFxeyzY

Transcription

Loading...

Bhojpuri Speech Recognition (using Gates/Ekstep)

This is one of the few Bhojpuri ASR models available based on: https://huggingface.co/Harveenchadha/vakyansh-wav2vec2-bhojpuri-bhom-60 Transcribe mp3, WhatsApp audio + wavs. Optionally translate to any language too.

Documents

🔗https://www.youtube.com/watch?v=WK7rEMCVsGE

Transcription

Loading...

Documents

🔗https://www.youtube.com/watch?v=PdNZ9ip0qjg

Transcription

Loading...

Documents

🔗4dc062a8-b080-45c5-b538-cb30c854c83c.wav

Transcription

Loading...

Documents

🔗https://www.youtube.com/watch?v=L-yHhIq3sE0&list=PL4aOhrbpcqyYGrLHMyZIgVNej2pa53uac&index=14

Transcription

Loading...

Documents

🔗https://www.youtube.com/watch?v=L-yHhIq3sE0&list=PL4aOhrbpcqyYGrLHMyZIgVNej2pa53uac&index=14

Transcription

Loading...

Documents

🔗https://www.youtube.com/watch?v=Cgd_Cjxyme4&list=PL3LYCEMgJ1urYQcVZEj1D9VlupHW3hlcT&index=2

Transcription