Evaluator


Input Data Spreadsheet

Upload or link to a CSV or google sheet that contains your sample input data.
For example, for Copilot, this would sample questions or for Art QR Code, would would be pairs of image descriptions and URLs.
Remember to includes header names in your CSV too.

Loading...

Input Data Preview

Here's what you uploaded:

Loading...


Evaluation Prompts

Specify custom LLM prompts to calculate metrics that evaluate each row of the input data. The output should be a JSON object mapping the metric names to values.
The columns dictionary can be used to reference the spreadsheet columns.


Aggregations

Aggregate using one or more operations. Uses pandas.

mean


Run cost = 10 credits

By submitting, you agree to Gooey.AI's terms & privacy policy.

Related Workflows

Bulk Runner and Evaluator

Which AI model actually works best for your needs?
Upload your own data and evaluate any Gooey.AI workflow, LLM or AI model against any other.
Great for large data sets, AI model evaluation, task automation, parallel processing and automated testing.
To get started, paste in a Gooey.AI workflow, upload a CSV of your test data (with header names!), check the mapping of headers to workflow inputs and tap Submit.
More tips in the Details below.

Copilot Builder

Create customized chatbots from your own docs/PDF/webpages. Craft your own bot prompts using the creative GPT3, fast GPT 3.5-turbo or powerful GPT4 & optionally prevent hallucinations by constraining all answers to just your citations. Available as Facebook, Instagram, WhatsApp bots or via API. Add multi-lingual speech recognition and text-to-speech in 100+ languages and even video responses. Collect πŸ‘πŸΎ πŸ‘ŽπŸ½ feedback + see usage & retention graphs too! This is the workflow that powers https://Farmer.CHAT and it's yours to tweak.

Speech Recognition and Translation

Transcribe mp3, WhatsApp audio + wavs with OpenAI's Whisper or AI4Bharat / Bhashini ASR models. Optionally translate to any language too.

Search any document with GPT4

Add your PDF, Word, HTML or Text docs, train our AI on them with OpenAI embeddings & vector search and then process results with a GPT3 script. This workflow is perfect for anything NOT in ChatGPT: 250-page compliance PDFs, training manuals, your diary, etc.