pkshatech
/

GLuCoSE-base-ja-v2

Sentence Similarity

sentence-transformers

feature-extraction

Model card Files Files and versions

yano0 commited on Sep 2, 2024

Commit

69c2c00

·

verified ·

1 Parent(s): 83027ed

Update README.md

Files changed (1) hide show

README.md +9 -3

README.md CHANGED Viewed

@@ -34,12 +34,18 @@ This is a [sentence-transformers](https://www.SBERT.net) model trained. It maps
 ## Model Details
 The model is based on GLuCoSE and additional fine-tuned.
 Fine-tuning consists of the following steps.
-Step 1: Ensemble distillation
 - The embedded representation was distilled using E5-mistral, gte-Qwen2 and mE5-large as teacher models.
-Step 2: Contrast learning
 -  Triples were created from JSNLI, MNLI, PAWS-X, JSeM and Mr.TyDi and used for training.
 - This training aimed to improve the overall performance as a sentence embedding model.
-Step 3: Search-specific contrastive learning.
 - In order to make the model more robust to the retrieval task, additional two-stage training with QA and question-answer data was conducted.
 - In the first stage, the synthetic dataset auto-wiki was used for training, while in the second stage, Japanese Wikipedia Human Retrieval, Mr.TyDi, MIRACL, JQaRA, MQA, Quiz Works and Quiz No Mori were used.

 ## Model Details
 The model is based on GLuCoSE and additional fine-tuned.
 Fine-tuning consists of the following steps.
+**Step 1: Ensemble distillation**
 - The embedded representation was distilled using E5-mistral, gte-Qwen2 and mE5-large as teacher models.
+**Step 2: Contrast learning**
 -  Triples were created from JSNLI, MNLI, PAWS-X, JSeM and Mr.TyDi and used for training.
 - This training aimed to improve the overall performance as a sentence embedding model.
+**Step 3: Search-specific contrastive learning.**
 - In order to make the model more robust to the retrieval task, additional two-stage training with QA and question-answer data was conducted.
 - In the first stage, the synthetic dataset auto-wiki was used for training, while in the second stage, Japanese Wikipedia Human Retrieval, Mr.TyDi, MIRACL, JQaRA, MQA, Quiz Works and Quiz No Mori were used.