Create versioned datasets for experimentation, evaluation, and fine-tuning. Supports Python dicts and pandas DataFrames.
The datasets client methods are currently in BETA. The API may change without notice. A one-time warning is emitted on first use.
Create versioned datasets for experimentation, evaluation, and fine-tuning. Datasets are version-controlled collections of examples. Updates modify the current version in-place.
Delete a dataset by name or ID. This operation is irreversible. There is no response from this call.
client.datasets.delete( dataset="dataset-name-or-id", space="your-space-name-or-id", # required when using a name)print("Dataset deleted successfully")
Retrieve examples from a dataset with pagination support. Pass all=True to fetch all examples via Flight (ignores limit).
resp = client.datasets.list_examples( dataset="dataset-name-or-id", space="your-space-name-or-id", # required when using a name limit=100,)for example in resp.examples: print(example)
For details on pagination, field introspection, and data conversion (to dict/JSON/DataFrame), see Response Objects.
Add new examples to an existing dataset. Examples are appended in-place to the latest dataset version by default — this does not create a new version. You can target a specific version by passing dataset_version_id.
new_examples = [ { "query": "What is machine learning?", "expected_output": "A subset of AI focused on learning from data", "eval.Correctness.label": "correct", }, { "query": "Who invented Python?", "expected_output": "Guido van Rossum", "eval.Correctness.label": "correct", },]updated_dataset = client.datasets.append_examples( dataset="dataset-name-or-id", space="your-space-name-or-id", # required when using a name examples=new_examples,)
Note: Do not include system-managed fields (id, created_at, updated_at) in your examples. These are automatically generated by the server.Learn more:Datasets Documentation