nvidia
/

Llama-3_1-Nemotron-Ultra-253B-v1

Text Generation

Model card Files Files and versions

jiaqiz commited on Apr 9

Commit

70378c3

·

verified ·

1 Parent(s): 2f63d7e

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -208,6 +208,8 @@ The data for the multi-stage post-training phases is a compilation of SFT and RL
 Prompts have been sourced from either public and open corpus or synthetically generated. Responses were synthetically generated by a variety of models, with some prompts containing responses for both reasoning on and off modes, to train the model to distinguish between two modes. This model was improved with Qwen.
 **Data Collection for Training Datasets:**
 - Hybrid: Automated, Human, Synthetic

 Prompts have been sourced from either public and open corpus or synthetically generated. Responses were synthetically generated by a variety of models, with some prompts containing responses for both reasoning on and off modes, to train the model to distinguish between two modes. This model was improved with Qwen.
+We have released our [Llama-Nemotron-Post-Training-Dataset](https://huggingface.co/datasets/nvidia/Llama-Nemotron-Post-Training-Dataset) to promote openness and transparency in model development and improvement.
 **Data Collection for Training Datasets:**
 - Hybrid: Automated, Human, Synthetic