(Updated Dec 2025)
This workflow benchmarks multiple Swahili (Kiswahili) speech-to-text pipelines on the same audio dataset. It compares GPT‑4o audio, Jacaranda GPT‑5.1, Gemini 3 Pro, Omni and Swahili → English pipelines with Google Machine Translation.
Workflows covered
- GPT‑4o Audio
- GPT Realtime
- Jacaranda + GPT‑5.1
- Jacaranda + Gemini 3 Pro
- Jacaranda GPT‑5.1 + Google MT
- Omni GPT‑5.1 + Google MT
- Omni + Gemini 3 Pro
- Omni + Gemini + Google MT
Data & Evaluation
Swahili audio & reference transcripts from Google Sheets
Automated evaluator: compare-output-text-from-input_audio for text similarity and quality scoring.
Ideal for teams comparing Swahili speech recognition, Swahili audio transcription, and Swahili → English translation for call centers, media, education, and African-language AI applications.
Keywords: Swahili speech-to-text, Kiswahili ASR, Swahili audio transcription, Swahili to English translation, Jacaranda AI Swahili, GPT‑4o audio Swahili, Gemini Pro Swahili transcription, Google MT Swahili.