Quickstart: Datasets & Experiments
This guide helps you run experiments to test and validate changes in your LLM applications against a curated dataset. For end-to-end walkthroughs, check out our cookbooks.
1. Upload a CSV as a dataset
Download this sample CSV and upload it into the UI.
2. Test a prompt in playground
Load the dataset you created into prompt playground, and run it to see your results. Once you've finished the run, you can save it as an experiment to track your changes.
3. Run an evaluator on your playground experiments
Create a task to run evaluations on your experiment results. Arize will run the evaluator task in the background as soon as you create the task.
Compare experiments
With Diff Mode enabled, you can compare experiments side-by-side to easily spot improvements and regressions. Learn more
Last updated
Was this helpful?