← Dataset
Download

Download the dataset.

Five training-ready formats. Each file is generated live when you click. Drop into trl.DPOTrainer, openai.fine_tuning, Axolotl, or LLaMA-Factory.

SFT · chat format

OpenAI · HuggingFace · TRL

Public human responses scoring ≥ 75/100, as { messages: [...] } per line.

26 rows

Download

SFT · ShareGPT

Axolotl · LLaMA-Factory

Same threshold, ShareGPT conventions.

26 rows

Download

SFT · Alpaca

Stanford Alpaca · LoRA

{ instruction, input, output } per line.

26 rows

Download

DPO · preference pairs

TRL DPOTrainer

{ prompt, chosen, rejected }; chosen beats rejected by ≥ 5 points.

340 rows

Download

Raw

Everything

Every scenario, response, per-criterion judgment with rationales.

214 rows

Download

Licensing

Released for research. Contributors consented to anonymous public release. Please do not use the corpus to train systems that manipulate emotionally vulnerable users.