Speech Recognition and Translation

Transcribe mp3s, WhatsApp voice, YouTube videos in 1000+ langs with Meta’s MMS /Seemless M4T, OpenAI's GPT4o Audio LLM, Whisper v2/v3, Azure, Google, GhanaNLP, AI4Bharat & Bhasini ASR models. Optionally translate to any language too.

11d ago

🎙️ Audio Files

Show as Links

Filter by Language

Speech-to-Text Provider

Spoken Language

🔠 Translate

Choose a model and language to translate recognized audio

Translation Model

Target Translation Language

Run cost = 2 credits (1 credit for 12.5 words ≈ 0.08 per word)

By submitting, you agree to Gooey.AI's terms & privacy policy.

Transcription

Generated in 15.6s on

...

ℹ️ Details

🙋🏽‍♀️ Need more help? Join our Discord

Related Workflows

Copilot Builder

Gooey.AI's Copilot is the best chatbot builder anywhere, combining your choice of LLMs (GPT4o, GPT4o-mini, Gemini, Claude3.5, Mixtral or LLaMA3), knowledge docs from any link or doc/PDF (with table …

Lipsync with Text-to-Speech

Create realistic lipsync videos with custom voices. Just upload a video or image, choose a voice from Google, OpenAI or bring your own voice from Eleven Labs to generate amazing videos with the Gooey.AI …

Compare AI Voice Generators

Input your text, pick a voice & a Text-to-Speech AI engine to create audio. Compare the best voice generators from Bark/Suno, …

Compare LLMs: GPT4o, Claude3.5 Sonnet, Gemini 1.5 Pro, LLaMA3 vs Mixtral

Which language model works best for your prompt? What are the biases inherent in each? Compare LLaMA3, Gemini, Mistral, OpenAI GPT-4o engines with more LLMs being added each month.If you are looking for local …

Speech Recognition and Translation

🎙️ Audio Files

Speech-to-Text Provider

Spoken Language

Translation Model

Target Translation Language

Output Format

🧩 Developer Tools and Functions

Related Workflows

Copilot Builder

Lipsync with Text-to-Speech

Compare AI Voice Generators

Compare LLMs: GPT4o, Claude3.5 Sonnet, Gemini 1.5 Pro, LLaMA3 vs Mixtral

GET STARTED

LEARN

DEVELOPERS

SOCIAL

CONNECT

EXTRAS