Skip to content

Upload Minimal Dataset

If you don’t have production traces but can provide a small set of labeled examples for your task, you can prepare a minimal structured dataset. You only need a few dozen high-quality examples that capture the essence of your task.

Your data directory needs the following files:

FileFormatRequiredDescription
job_description.jsonJSONYesTask description defining what the model should do
train.csvCSV or JSONLYes20+ labeled (question, answer) pairs
test.csvCSV or JSONLYesHeld-out evaluation set
config.yamlYAMLYesTraining hyperparameters
unstructured.csvCSV or JSONLNoDomain-relevant text for synthetic data generation
distil model upload-data <model-id> --data ./data

For detailed formatting and structure requirements per task type, refer to: