• Gooey.AI
  • Examples: Bulk


    Farmer.CHAT Bulk Evaluator (GPT-4o, Mixtral, Claude vs Gemini Pro 1.5)

    Loading...

    7 runs

    Which AI model actually works best for your needs? Upload your own data and evaluate any Gooey.AI workflow, LLM or AI model against any other. Great for large data sets, AI model evaluation, task automation, parallel processing and automated testing. To get started, paste in a Gooey.AI workflow, upload a CSV of your test data (with header names!), check the mapping of headers to workflow inputs and tap Submit. More tips in the Details below.

    Documents

    Caching test: Gooey Bot Bulk Run

    Loading...

    2 runs

    This run is useful if you are only interested in Regression testing, Monitoring and Evaluation, Observability and Bugs.

    Documents

    Gooey.AI

    SimpleBench Bulk Eval

    Loading...

    12 runs

    Here we run the incredible questions from the public dataset of the Simple Bench benchmark comparing the top models and how well they perform against these questions, which are pretty obvious for most humans but tend to trip up LLMs quite badly. If anyone wants to try their hand at competing in the Jan 2025 prompt engineering contest, just duplicate any of the workflows below, save it and then add your saved workflow as another workflow to compare against (it's ok to delete the others too).

    Documents

    Gooey.AI

    Compare Chichewa Speech Recognition

    Loading...

    21 runs

    In this bulk run example, we compare Gooey.AI speech recognition + translations workflows (https://gooey.ai/speech), each of which uses a different AI speech model:1) MMS Large + Google Translation2) Seamless M4T - WINNER!3) Google Chirp + Google TranslateWe then evaluate them at https://gooey.ai/eval/ (where you can see what works best....)

    Documents

    Compare KIKUYU Speech Recognition

    Loading...

    31 runs

    Here we take an input sample of Kikuyu phrases (and their expert transcription and English translations) and compare the top Speech Reco and Translation models. As of April 3, 2024, we've found that Meta's MMS-Large works best for Kikuyu with Ghana NLP's Kikuyu to English translation services.

    . Which AI model actually works best for your needs?
    Upload your own data and evaluate any Gooey.AI workflow, LLM or AI model against any other.
    Great for large data sets, AI model evaluation, task automation, parallel processing and automated testing.
    To get started, paste in a Gooey.AI workflow, upload a CSV of your test data (with header names!), check the mapping of headers to workflow inputs and tap Submit.
    More tips in the Details below.

    Documents

    Compare Swahili Speech Recognition

    Loading...

    26 runs

    Compare Swahili Speech Recognition and Translation to English with this run. We have 10 variations in this. Including the latest Seamless M4T v2, MMS, Azure, Whisper v3 and more.

    Documents

    Bulk Runner - Bangla

    Loading...

    1 runs

    Which AI model actually works best for your needs? Upload your own data and evaluate any Gooey.AI workflow, LLM or AI model against any other. Great for large data sets, AI model evaluation, task automation, parallel processing and automated testing. To get started, paste in a Gooey.AI workflow, upload a CSV of your test data (with header names!), check the mapping of headers to workflow inputs and tap Submit. More tips in the Details below.

    Documents

    Compare Hindi Speech Recognition

    Loading...

    15 runs

    Which AI model actually works best for your needs? Upload your own data and evaluate any Gooey.AI workflow, LLM or AI model against any other. Great for large data sets, AI model evaluation, task automation, parallel processing and automated testing. To get started, paste in a Gooey.AI workflow, upload a CSV of your test data (with header names!), check the mapping of headers to workflow inputs and tap Submit. More tips in the Details below.

    Documents

    Farmer.CHAT - Bulk Runner and Evaluator (GPT4, Mixtral Comparison)

    Loading...

    3 runs

    Which AI model actually works best for your needs? Upload your own data and evaluate any Gooey.AI workflow, LLM or AI model against any other. Great for large data sets, AI model evaluation, task automation, parallel processing and automated testing. To get started, paste in a Gooey.AI workflow, upload a CSV of your test data (with header names!), check the mapping of headers to workflow inputs and tap Submit. More tips in the Details below.

    Documents

    Gooey.AI

    Compare Kannada Speech Recognition

    Loading...

    6 runs

    In this bulk run example, we compare 3 different Gooey.AI speech recognition + translations workflows (https://gooey.ai/speech), each of which uses a different AI speech model:

    Whisper v2
    Google Chirp
    Azure

    We then evaluate them at https://gooey.ai/eval/ (where you can see what works best....)

    Documents

    Gooey.AI

    Karpathy Pod answers

    Loading...

    18 runs

    In this bulk run example, we compare 6 different Gooey.AI speech recognition + translations workflows (https://gooey.ai/speech), each of which uses a different AI speech model:
    Whisper Telugu Bhashini
    Whisper v2
    Google Chirp
    Azure
    Meta Seemless 4MT
    AI4Bharat.org Conformer Hindi

    We then evaluate them at https://gooey.ai/eval/ (where you can see what works best....)

    Documents

    Gooey.AI

    Compare Telugu Speech Recognition

    Loading...

    207 runs

    In this bulk run example, we compare 5 different Gooey.AI speech recognition + translations workflows (https://gooey.ai/speech), each of which uses a different AI speech model:
    Whisper Telugu Bhashini
    Whisper v2
    Google Chirp
    Azure
    Meta Seemless 4MT

    We then evaluate them at https://gooey.ai/eval/?example_id=lc1f4ka1 (where you can see what works best....)

    Documents

    Gooey.AI

    Ulangizi Copilot Eval

    Loading...

    2 runs

    Testing AI bots and copilots (https://gooey.ai/copilot) in practice is complex because the answer a bot will give is affected not just by the user's last question but also by the conversation history that preceded that question. Hence, this bulk example shows you how to format your CSV to include a bot/user conversation history before each question is asked. The output columns also show which knowledge documents were returned as results that the bot used in trying to formulate its answer.

    Documents

    Gooey.AI

    Copilot Message History Eval

    Loading...

    3 runs

    Testing AI bots and copilots (https://gooey.ai/copilot) in practice is complex because the answer a bot will give is affected not just by the user's last question but also by the conversation history that preceded that question. Hence, this bulk example shows you how to format your CSV to include a bot/user conversation history before each question is asked. The output columns also show which knowledge documents were returned as results that the bot used in trying to formulate its answer.

    Documents

    Gooey.AI

    Bulk Runner for different dense embeddings weightages

    Loading...

    7 runs

    This example takes a CSV with URLs and image descriptions and then runs our artistic QR Code maker over each of them, generating two QR codes for each URL + description pair. This is great if you want to create 100s of art QR Codes in one huge batch.

    Documents

    Gooey.AI

    Compare Anime vs Pixar Profile Pix style

    Loading...

    2 runs

    This bulk run compares two different AI Image editor workflows - one that takes profile pictures and renders them in an anime vs Pixar style. Swap out the CSV with your own list of profile photo URLs to test even more. Swap out the workflow URLs with different Image prompts to test out new styles.

    About the Bulk Runner:
    Which AI model actually works best for your needs? Upload your own data and evaluate any Gooey.AI workflow, LLM or AI model against any other. Great for large data sets, AI model evaluation, task automation, parallel processing and automated testing. To get started, paste in a Gooey.AI workflow, upload a CSV of your test data (with header names!), check the mapping of headers to workflow inputs and tap Submit. More tips in the Details below.

    Documents

    Gooey.AI

    Bulk Create Art QR codes

    Loading...

    11 runs

    This example takes a CSV with URLs and image descriptions and then runs our artistic QR Code maker over each of them, generating two QR codes for each URL + description pair. This is great if you want to create 100s of art QR Codes in one huge batch.

    Documents