Sarvam AgBot

Here we compare the top speech recognition, LLM and machine translation models for Kinyarwanda. In short, unfortunately the realtime OpenAI models (GPT4o-Audio and GPT-realtime) perform badly while leveraging dedicated ASR models (MMS, Sunbird, MBaza) in conjunction with GPT5 and Google 2.5 Pro appears to work reasonably well.

Computational Mama aka Ambika

9mo ago

Public

Kinyarwanda ChatGPT (GPT4o-A+GPT5) (A2T)

💬 Copilot

Web

A version that attempts to use the same models as ChatGPT for Kinyarwanda as of Sept 15, 2025. GPT-4o Audio (the latest published audio models) + GPT5.

Sean Blagsvedt

Added TTS

9mo ago

Public

REALTIME: Kinyarwanda Audio2Text Prompt Compare - 3Qs

🦾 Bulk

Here we compare the top speech recognition, LLM and machine translation models for Kinyarwanda. In short, unfortunately the realtime OpenAI models (GPT4o-Audio and GPT-realtime) perform badly while leveraging dedicated ASR models (MMS, Sunbird, MBaza) in conjunction with GPT5 and Google 2.5 Pro appears to work reasonably well.

Computational Mama aka Ambika

9mo ago

Public

GPT-Realtime, Kinyarwanda on Twilio (Medium Prompt)

💬 Copilot

Pushed Kinyarwanda on Twilio

💬

Computational Mama aka Ambika

9mo ago

Public

GPT-Realtime, Kinyarwanda on Twilio (Long Prompt)

💬 Copilot

Pushed Kinyarwanda on Twilio

💬

Computational Mama aka Ambika

9mo ago

Public

Top4: Swahili Audio2Text Comparison (11 sept - 30Qs)

🦾 Bulk

Which of the latest models best understand Swahili questions (as WhatsApp audio notes) and can provide an English? GPT-realtime, GPT4o realtime, Jacaranda(ASR) + GPT5 (LLM as MT) and Jacarandra + GPT5 + Google Translate are compared.

Sean Blagsvedt

Removed extra rows and columns

9mo ago

Public

Top5: Kinyarwanda Audio2Text Compare - 30Qs

🦾 Bulk

Here we compare the top speech recognition, LLM and machine translation models for Kinyarwanda. In short, unfortunately the realtime OpenAI models (GPT4o-Audio and GPT-realtime) perform badly while leveraging dedicated ASR models (MMS, Sunbird, MBaza) in conjunction with GPT5 and Google 2.5 Pro appears to work reasonably well.

Sean Blagsvedt

9mo ago

Public

Kikuyu (A2A) Latency Eval

🦾 Bulk

This workflow is designed specifically to measure end-to-end latency for voice-based AI interactions, focusing on benchmarking system response times rather than providing full conversational answers. Incoming audio samples (in Kikuyu or other selected languages) are transcribed, processed by an AI assistant with a maximum output of 10 tokens, and then synthesized back to audio. The workflow compares two different Kikuyu audio-to-audio (A2A) translation pipelines by processing input samples from a Google Sheet and logging runtime, price, and output URLs for each. This setup enables reliable benchmarking and optimization of transcription, AI processing, and text-to-speech components, helping teams evaluate latency and cost across different ASR models for the Kikuyu language.

🦾

Sean Blagsvedt

9mo ago

Public

Kinyarwanda (A2A) Latency Eval

🦾 Bulk

Here we compare the top speech recognition, LLM and machine translation models for Kinyarwanda. In short, unfortunately the realtime OpenAI models (GPT4o-Audio and GPT-realtime) perform badly while leveraging dedicated ASR models (MMS, Sunbird, MBaza) in conjunction with GPT5 and Google 2.5 Pro appears to work reasonably well.

Computational Mama aka Ambika

9mo ago

Public