Gates Foundation

gatesfoundation

A workspace for the Gates Foundation DPI, FairFoward and Gooey teams focused on evals for low-resource languages plus the home of our Agriculture advisory work e.g. https://gooey.ai/ageval

332 Public Workflows
8 Members

Here we take an input sample of Kikuyu phrases (and their expert transcription and English translations) and compare the top Speech Reco and Translation models. As of Mar 25, 2025, we've found that Meta's MMS-Large works best for Kikuyu with Ghana NLP's Kikuyu to English translation services.

. Which AI model actually works best for your needs?
Upload your own data and evaluate any Gooey.AI workflow, LLM or AI model against any other.
Great for large data sets, AI model evaluation, task automation, parallel processing and automated testing.
To get started, paste in a Gooey.AI workflow, upload a CSV of your test data (with header names!), check the mapping of headers to workflow inputs and tap Submit.
More tips in the Details below.

Compare Kinyarwanda Speech Recognition and Translation to English with this run.

Compare Swahili Speech Recognition and Translation to English with this run. We have 10 variations in this. Including the latest Seamless M4T v2, MMS, Azure, Whisper v3 and more.

Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

šŸ‘‚šŸ¼

Here we take an input sample of Kikuyu phrases (and their expert transcription and English translations) and compare top LLMs as transalors. Currently Gemini 2.5 is winning...

Here we take an input sample of Kikuyu phrases (and their expert transcription and English translations) and compare top LLMs as transalors. Currently Gemini 2.5 is winning...

Here we take an input sample of Kikuyu phrases (and their expert transcription and English translations) and compare top LLMs as transalors. Currently Gemini 2.5 is winning...

Which language model works best for your prompt? What are the biases inherent in each? Compare the latest models from Google, Deepseek, OpenAI, Mistral, Meta, OpenAI and Anthropic with more LLMs being added each month. If you are looking for local language models we now also host Sarvam and SEA-LION!

🧠

9mo ago

248 runs

Public

Which language model works best for your prompt? What are the biases inherent in each? Compare the latest models from Google, Deepseek, OpenAI, Mistral, Meta, OpenAI and Anthropic with more LLMs being added each month. If you are looking for local language models we now also host Sarvam and SEA-LION!

🧠

9mo ago

249 runs

Public

Which language model works best for your prompt? What are the biases inherent in each? Compare the latest models from Google, Deepseek, OpenAI, Mistral, Meta, OpenAI and Anthropic with more LLMs being added each month. If you are looking for local language models we now also host Sarvam and SEA-LION!

🧠

added {{ language }} variable

9mo ago

266 runs

Public

Which language model works best for your prompt? What are the biases inherent in each? Compare the latest models from Google, Deepseek, OpenAI, Mistral, Meta, OpenAI and Anthropic with more LLMs being added each month. If you are looking for local language models we now also host Sarvam and SEA-LION!

🧠

added {{ language }} variable

9mo ago

264 runs

Public

Which language model works best for your prompt? What are the biases inherent in each? Compare the latest models from Google, Deepseek, OpenAI, Mistral, Meta, OpenAI and Anthropic with more LLMs being added each month. If you are looking for local language models we now also host Sarvam and SEA-LION!

🧠

add {{ language }} variable

9mo ago

265 runs

Public

added a variable for language

🧠

9mo ago

262 runs

Public

Which language model works best for your prompt? What are the biases inherent in each? Compare the latest models from Google, Deepseek, OpenAI, Mistral, Meta, OpenAI and Anthropic with more LLMs being added each month. If you are looking for local language models we now also host Sarvam and SEA-LION!

🧠

9mo ago

249 runs

Public

Compare translation models like Google Translate, GhanaNLP to understand which model provides the best translation, compare latency and accuracy and test for production with our Eval tool!

šŸ—£ļø

Compare translation models like Google Translate, GhanaNLP to understand which model provides the best translation, compare latency and accuracy and test for production with our Eval tool!

šŸ—£ļø