Quickstart: Datasets & Experiments

This guide helps you run experiments to test and validate changes in your LLM applications against a curated dataset. For end-to-end walkthroughs, check out our cookbooks.

1. Upload a CSV as a dataset

Download this sample CSV and upload it into the UI.

2. Test a prompt in playground

Load the dataset you created into prompt playground, and run it to see your results. Once you've finished the run, you can save it as an experiment to track your changes.

3. Run an evaluator on your playground experiments

Create a task to run evaluations on your experiment results. Arize will run the evaluator task in the background as soon as you create the task.

Compare experiments

With Diff Mode enabled, you can compare experiments side-by-side to easily spot improvements and regressions. Learn more

Last updated

Was this helpful?