Which AI model actually works best for your needs? Upload your own data and evaluate any Gooey.AI workflow, LLM or AI model against any other. Great for large data sets, AI model evaluation, task automation, parallel processing and automated testing. To get started, paste in a Gooey.AI workflow, upload a CSV of your test data (with header names!), check the mapping of headers to workflow inputs and tap Submit. More tips in the Details below.
🏃♀️Run
🔖 Examples
🚀 API
Paste in one or more Gooey.AI workflow links (on separate lines). You can add multiple URLs runs from the same recipe (e.g. two versions of your copilot) and we'll run the inputs over both of them.
Upload or link to a CSV or google sheet that contains your sample input data. For example, for Copilot, this would sample questions or for Art QR Code, would would be pairs of image descriptions and URLs. Remember to includes header names in your CSV too.
Enter Custom URLs
Here's what you uploaded:
Loading...
Please select which CSV column corresponds to your workflow's input fields. For the outputs, please fill in what the column name should be that corresponds to each output too. To understand what each field represents, check out our API docs.
Input Prompt
Output Text
Run URL
🤲 Show All Columns
Bot Script
Input Images
Messages.Role
Messages.Content
Messages.Display Name
Tts Provider
Uberduck Voice Name
Uberduck Speaking Rate
Google Voice Name
Google Speaking Rate
Google Pitch
Bark History Prompt
Elevenlabs Voice Name
Elevenlabs Api Key
Elevenlabs Voice Id
Elevenlabs Model
Elevenlabs Stability
Elevenlabs Similarity Boost
Selected Model
🩻 Photo / Document Intelligence
Avoid Repetition
Num Outputs
Quality
Max Tokens
Sampling Temperature
Input Face
Face Padding Top
Face Padding Bottom
Face Padding Left
Face Padding Right
Task Instructions
Query Instructions
Keyword Instructions
Documents
Max References
Max Context Words
Scroll Jump
Dense Embeddings Weightage
Citation Style
Use Url Shortener
User Language
Input Glossary
Output Glossary
Variables
Price
Run Time
Error Msg
Final Prompt
Output Audio
Output Video
Raw Input Text
Raw Tts Text
Raw Output Text
References
Final Search Query
Final Keyword Query
⚙️ Settings
Title
Notes
Run cost = 5 credits
🏃 Submit
By submitting, you agree to Gooey.AI's terms & privacy policy.
✅
Success! Run Time: 19.16 seconds.
19.16
https://storage.googleapis.com/dara-c1b52.appspot.com/daras_ai/media/fc73dbe0-678f-11ee-8863-02420a000141/bulk-runner-0-0-2.csv
ℹ️ Details
Building complex AI workflows like copilot) and then evaluating each iteration is complex. Workflows are affected by the particular LLM used (GPT4 vs PalM2), their vector DB knowledge sets (e.g. your google docs), how synthetic data creation happened (e.g. how you transformed your video transcript or PDF into structured data), which translation or speech engine you used and your LLM prompts. Every change can affect the quality of your outputs.
To get started:
🙋🏽♀️ Need more help? Join our Discord