Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
princeton-nlp
/
warm-start__grpo__think__Llama-3.1-8B-Instruct
like
0
Safetensors
llama
Model card
Files
Files and versions
xet
Community
main
warm-start__grpo__think__Llama-3.1-8B-Instruct
/
longcot_config.json
princeton-nlp
Uploading the models
d6d5150
verified
about 1 month ago
raw
Copy download link
history
blame
contribute
delete
124 Bytes
{
"longcot"
:
true
,
"longcot_delimiter"
:
"</think>"
,
"end_delimiter"
:
null
,
"start_think_marker"
:
"<think>"
}