Swahili Audio to Text Benchmark | Gemini 3 Pro, GPT‑4o, Jacaranda & Omnilingual

(Updated Dec 2025)

This workflow benchmarks multiple Swahili (Kiswahili) speech-to-text pipelines on the same audio dataset. It compares GPT‑4o audio, Jacaranda GPT‑5.1, Gemini 3 Pro, Omni and Swahili → English pipelines with Google Machine Translation.

Workflows covered

  • GPT‑4o Audio
  • GPT Realtime
  • Jacaranda + GPT‑5.1
  • Jacaranda + Gemini 3 Pro
  • Jacaranda GPT‑5.1 + Google MT
  • Omni GPT‑5.1 + Google MT
  • Omni + Gemini 3 Pro
  • Omni + Gemini + Google MT

Data & Evaluation

Swahili audio & reference transcripts from Google Sheets
Automated evaluator: compare-output-text-from-input_audio for text similarity and quality scoring.

Ideal for teams comparing Swahili speech recognition, Swahili audio transcription, and Swahili → English translation for call centers, media, education, and African-language AI applications.

Keywords: Swahili speech-to-text, Kiswahili ASR, Swahili audio transcription, Swahili to English translation, Jacaranda AI Swahili, GPT‑4o audio Swahili, Gemini Pro Swahili transcription, Google MT Swahili.

Gooey Workflows
Input Data Spreadsheet
Loading...
Input Columns

Loading...



Evaluation Workflows


Run cost = 1 credits

With each run, you agree to Gooey.AI's terms & privacy policy.

Run: Compare Output Text (from input_audio) Download

Loading...


Aggregate:Mean

Loading...

Loading...