Adding `safetensors` variant of this model
#4 opened over 1 year ago
by
SFconvertbot
The output of the reward model is a two-dimensional vector, what does each dimension mean?
#3 opened almost 2 years ago
by
Lily912
More details on training data for reward model
🤯
2
#2 opened about 2 years ago
by
reign12
Where is the input file of augment_oasst ?
#1 opened over 2 years ago
by
LetsJumP