Chirp/USM (Google)

Transcribe any YouTube, mp3, WhatsApp audio or wavs with the best of transcription and translation AI models from OpenAI (Whisper v2 & v3), Microsoft Azure, Google USM, Meta Seemless4MT, AI4Bharat, Bhashini, etc Optionally translate to any language too.

1y ago

🎙️ Audio Files

Show as Links

Filter by Language

Speech-to-Text Provider

Spoken Language

🔠 Translate

Choose a model and language to translate recognized audio

Translation Model

Target Translation Language

⚙️ Settings

Source Translation Language

This is usually inferred from the spoken language, but in case that is set to Auto detect, you can specify one explicitly.

Translation Glossary

Provide a glossary to customize translation and improve accuracy of domain-specific terms.
If not specified or invalid, no glossary will be used. Read about the expected format here.

Output Format

Text

JSON

SRT

VTT

🧩 Developer Tools and Functions

Run cost = 1 credits (1 credit for 12.5 words ≈ 0.08 per word)

By submitting, you agree to Gooey.AI's terms & privacy policy.

Transcription

Generated in 4.8s on

...

ℹ️ Details

🙋🏽‍♀️ Need more help? Join our Discord

Chirp/USM (Google)

🎙️ Audio Files

Speech-to-Text Provider

Spoken Language

Translation Model

Target Translation Language

Source Translation Language

Translation Glossary

Output Format

🧩 Developer Tools and Functions

Related Workflows

Copilot Builder

Lipsync with Text-to-Speech

Compare AI Voice Generators

Compare LLMs: GPT4o, Claude3.5 Sonnet, Gemini 1.5 Pro, LLaMA3 vs Mixtral

GET STARTED

LEARN

DEVELOPERS

SOCIAL

CONNECT

EXTRAS