jiaqiz commited on
Commit
70378c3
·
verified ·
1 Parent(s): 2f63d7e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -208,6 +208,8 @@ The data for the multi-stage post-training phases is a compilation of SFT and RL
208
 
209
  Prompts have been sourced from either public and open corpus or synthetically generated. Responses were synthetically generated by a variety of models, with some prompts containing responses for both reasoning on and off modes, to train the model to distinguish between two modes. This model was improved with Qwen.
210
 
 
 
211
  **Data Collection for Training Datasets:**
212
 
213
  - Hybrid: Automated, Human, Synthetic
 
208
 
209
  Prompts have been sourced from either public and open corpus or synthetically generated. Responses were synthetically generated by a variety of models, with some prompts containing responses for both reasoning on and off modes, to train the model to distinguish between two modes. This model was improved with Qwen.
210
 
211
+ We have released our [Llama-Nemotron-Post-Training-Dataset](https://huggingface.co/datasets/nvidia/Llama-Nemotron-Post-Training-Dataset) to promote openness and transparency in model development and improvement.
212
+
213
  **Data Collection for Training Datasets:**
214
 
215
  - Hybrid: Automated, Human, Synthetic