Save as New
This recipe is used with https://gooey.ai/bulk to evaluate the latest private & open source speech recognition models (from Google, Meta, OpenAI and others). It takes a CSV file of golden (aka human provided) translations and compares those against a set of AI created translations to generate scores from 0 to 1. It then takes the mean of the scores to determine which model performed best.
Run
Examples
API
Upload or link to a CSV or google sheet that contains your sample input data.For example, for Copilot, this would sample questions or for Art QR Code, would would be pairs of image descriptions and URLs.Remember to includes header names in your CSV too.
Submit Links in Bulk
Here's what you uploaded:
Loading...
GPT-4o (openai)
Specify custom LLM prompts to calculate metrics that evaluate each row of the input data. The output should be a JSON object mapping the metric names to values.The columns dictionary can be used to reference the spreadsheet columns.
columns
Add a Prompt
Aggregate using one or more operations. Uses pandas.
mean
Add an Aggregation
⚙️ Settings
Run cost = 190 credits
By submitting, you agree to Gooey.AI's terms & privacy policy.
https://storage.googleapis.com/dara-c1b52.appspot.com/daras_ai/media/7e713680-955f-11ef-af61-02420a000104/evaluator-19.csv
Generated in 31.2s on
...
ℹ️ Details
🙋🏽♀️ Need more help? Join our Discord
Which AI model actually works best for your needs? Upload your own data and evaluate any Gooey.AI workflow, LLM or AI model against any other. Great for large data sets, AI model evaluation, task automation, …
Gooey.AI's Copilot is the best chatbot builder anywhere, combining your choice of LLMs (GPT4o, GPT4o-mini, Gemini, Claude3.5, Mixtral or LLaMA3), knowledge docs from any link or doc/PDF (with table …
Transcribe mp3, WhatsApp audio, YouTube videos + wavs with OpenAI's GPT4o Audio LLM, Whisper, Azure or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.
We've built the best Retrieval Augmented Generation (RAG) as-a-Service anywhere - now with page-level citations! Absorb tables, PDFs, docs, links, videos or audio clips and use our synthetic data maker to …