(25Qs) Kinyarwanda Audio to Text Benchmark | Gemini 3 Pro, GPT‑4o, Mbaza, Sun bird & Omnilingual

(25 Qs Updated 10 Jan 2025)
This page shows a test of many Kinyarwanda speech‑to‑text systems.
kinyarwanda
For each system, we play the same Kinyarwanda audio and capture its text output.
We then compare that text to a trusted reference answer and give a score between 0 and 1.

A higher score means the system output is closer to the reference text and usually more accurate.

WorkflowAccuracy (mean)Latency (median)
0 GPT‑Realtime0.045.58
1 Mbza+GPT‑5.10.923.46
2 Mbaza+Gemini 3 Pro0.958.98
3 Sunbird+GPT‑5.10.634.28
4 Mbaza+GPT‑5.1+GMT0.873.25
5 Omnilingual+GPT‑5.10.844.61
6 Omnilingual+Gemini 3 Pro0.959.97
7 Gemini 3 Pro0.909.54
8 Omnilingual+Gemini 3 Flash0.926.06
9 Mbaza+Gemini 3 Flash0.915.04

You can use these scores to:

  • See which system gets the best score
  • Compare different models and pipelines side by side
  • Choose the best system for your app, research, or product
  • Download all results for deeper analysis
Gooey Workflows
Input Data Spreadsheet
Loading...
Input Columns

Loading...



Evaluation Workflows


Run cost = 1 credits

With each run, you agree to Gooey.AI's terms & privacy policy.

Run: Compare Output Text (from input_audio) Download

Loading...


Aggregate:Mean

Loading...

Loading...


Run: Compare Run Time (Median) Download

Loading...


Aggregate:Median

Loading...

Loading...