Speech Recognition and Translation

Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.


Audio Files

Loading...

Whisper Large v2 (openai)

English | en


Run cost = 2 credits (1 credit for 12.5 words β‰ˆ 0.08 per word)

By submitting, you agree to Gooey.AI's terms & privacy policy.

Transcription

Generated in 3.4s onΒ 

...

Related Workflows

Copilot Builder

Create customized chatbots from your own docs/PDF/webpages. Craft your own bot prompts using the creative GPT3, fast GPT 3.5-turbo or powerful GPT4 & optionally prevent hallucinations by constraining all answers to just your citations. Available as Facebook, Instagram, WhatsApp bots or via API. Add multi-lingual speech recognition and text-to-speech in 100+ languages and even video responses. Collect πŸ‘πŸΎ πŸ‘ŽπŸ½ feedback + see usage & retention graphs too! This is the workflow that powers https://Farmer.CHAT and it's yours to tweak.

Lip Sync with TTS

Add your text prompt, pick a voice & upload a sample video to quickly create realistic lipsync videos. Discover the ease of text-to-video AI.

Compare AI Voice Generators

Input your text, pick a voice & a Text-to-Speech AI engine to create audio. Compare the best voice generators from Google, UberDuck.ai & more to add automated voices to your podcast, YouTube videos, website, or app.

Compare LLMs: GPT4, Claude3, Gemini 1.5, LLaMA2 vs Mixtral

Which language model works best your prompt? Compare your text generations across multiple large language models (LLMs) like OpenAI's evolving and latest ChatGPT engines and others like Curie, Ada, Babbage.