Save as New
Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.
Run
Examples
API
Submit Links in Bulk
Filter by Language
All Languages
Chirp / USM (Google V2)
Auto Detect
🔠 Translate
Choose a model and language to translate recognized audio
Google Translate
English | en
⚙️ Settings
This is usually inferred from the spoken language, but in case that is set to Auto detect, you can specify one explicitly.
Provide a glossary to customize translation and improve accuracy of domain-specific terms.If not specified or invalid, no glossary will be used. Read about the expected format here.
Text
JSON
SRT
VTT
Run cost = 1 credits (1 credit for 12.5 words ≈ 0.08 per word)
🏃 Run
By submitting, you agree to Gooey.AI's terms & privacy policy.
Transcription
Generated in 4.8s on
...
ℹ️ Details
🙋🏽♀️ Need more help? Join our Discord
Gooey.AI's Copilot is the best chatbot builder anywhere, combining your choice of LLMs (GPT4o, GPT4o-mini, Gemini, Claude3.5, Mixtral or LLaMA3), knowledge docs from any link or doc/PDF (with table …
Create realistic lipsync videos with custom voices. Just upload a video or image, choose a voice from Google, OpenAI or bring your own voice from Eleven Labs to generate amazing videos with the Gooey.AI …
Input your text, pick a voice & a Text-to-Speech AI engine to create audio. Compare the best voice generators from Bark/Suno, …
Which language model works best for your prompt? What are the biases inherent in each? Compare LLaMA3, Gemini, Mistral, OpenAI GPT-4o engines with more LLMs being added each month.
If you are looking for …