davidkim205
/

Rhea-72b-v0.5

Text Generation

text-generation-inference

Model card Files Files and versions

davidkim205 commited on Apr 3, 2024

Commit

40bd979

·

verified ·

1 Parent(s): fe4f3ca

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -214,7 +214,7 @@ This method proposes a novel method for generating datasets for DPO (Self-superv
 * If the data set cannot be found, it is internal company data and cannot be made public.
 ## dpo dataset info : datasets_encomp_151k
-Randomly selecting data from each category within the training dataset, we constructed a DPO (Data Perturbation Object) dataset using sentences with logits lower than the mean within the model-generated sentences.
 * I'm sorry I can't reveal it.
 ## Evaluation

 * If the data set cannot be found, it is internal company data and cannot be made public.
 ## dpo dataset info : datasets_encomp_151k
+Randomly selecting data from each category within the training dataset, we constructed a DPO (Direct Preference Optimization) dataset using sentences with logits lower than the mean within the model-generated sentences.
 * I'm sorry I can't reveal it.
 ## Evaluation