Update README.md
Browse files
README.md
CHANGED
|
@@ -208,6 +208,8 @@ The data for the multi-stage post-training phases is a compilation of SFT and RL
|
|
| 208 |
|
| 209 |
Prompts have been sourced from either public and open corpus or synthetically generated. Responses were synthetically generated by a variety of models, with some prompts containing responses for both reasoning on and off modes, to train the model to distinguish between two modes. This model was improved with Qwen.
|
| 210 |
|
|
|
|
|
|
|
| 211 |
**Data Collection for Training Datasets:**
|
| 212 |
|
| 213 |
- Hybrid: Automated, Human, Synthetic
|
|
|
|
| 208 |
|
| 209 |
Prompts have been sourced from either public and open corpus or synthetically generated. Responses were synthetically generated by a variety of models, with some prompts containing responses for both reasoning on and off modes, to train the model to distinguish between two modes. This model was improved with Qwen.
|
| 210 |
|
| 211 |
+
We have released our [Llama-Nemotron-Post-Training-Dataset](https://huggingface.co/datasets/nvidia/Llama-Nemotron-Post-Training-Dataset) to promote openness and transparency in model development and improvement.
|
| 212 |
+
|
| 213 |
**Data Collection for Training Datasets:**
|
| 214 |
|
| 215 |
- Hybrid: Automated, Human, Synthetic
|