Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.
πββοΈRun
π Examples
π API
Enter Custom URLs
βοΈ Settings
Text
JSON
SRT
VTT
Title
Notes
Run cost = 15 credits (1 credit for 12.5 words β 0.08 per word)
π Submit
By submitting, you agree to Gooey.AI's terms & privacy policy.
Transcription
βΉοΈ Details
This workflow let's you compare the latest and finest speech recognition models from OpenAI, AI4Bharat and Bhashini and Google's USM coming soon.
Just upload an audio file (mp3, wav, ogg or aac file) setting its language and then choose a speech recognition engine. You can also translate the output to any language too (using Google's Translation APIs).
ππ½ββοΈ Need more help? Join our Discord
Create customized chatbots from your own docs/PDF/webpages. Craft your own bot prompts using the creative GPT3, fast GPT 3.5-turbo or powerful GPT4 & optionally prevent hallucinations by constraining all answers to just your citations. Available as Facebook, Instagram, WhatsApp bots or via API. Add multi-lingual speech recognition and text-to-speech in 100+ languages and even video responses. Collect ππΎ ππ½ feedback + see usage & retention graphs too! This is the workflow that powers https://Farmer.CHAT and it's yours to tweak.
Add your text prompt, pick a voice & upload a sample video to quickly create realistic lipsync videos. Discover the ease of text-to-video AI.
Input your text, pick a voice & a Text-to-Speech AI engine to create audio. Compare the best voice generators from Google, UberDuck.ai & more to add automated voices to your podcast, YouTube videos, website, or app.
Which language model works best your prompt? Compare your text generations across multiple large language models (LLMs) like OpenAI's evolving and latest ChatGPT engines and others like Curie, Ada, Babbage.