Update README.md
Browse files
README.md
CHANGED
|
@@ -39,6 +39,39 @@ Prometheus is trained with two different sizes (7B and 13B).
|
|
| 39 |
You could check the 7B sized LM on [this page](https://huggingface.co/kaist-ai/prometheus-7b-v1.0).
|
| 40 |
Also, check out our dataset as well on [this page](https://huggingface.co/datasets/kaist-ai/Feedback-Collection).
|
| 41 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 42 |
## License
|
| 43 |
Feedback Collection and Prometheus is subject to OpenAI's Terms of Use for the generated data. If you suspect any violations, please reach out to us.
|
| 44 |
|
|
|
|
| 39 |
You could check the 7B sized LM on [this page](https://huggingface.co/kaist-ai/prometheus-7b-v1.0).
|
| 40 |
Also, check out our dataset as well on [this page](https://huggingface.co/datasets/kaist-ai/Feedback-Collection).
|
| 41 |
|
| 42 |
+
## Prompt Format
|
| 43 |
+
Prometheus requires 4 components in the input: An instruction, a response to evaluate, a score rubric, and a reference answer. You could refer to the prompt format below.
|
| 44 |
+
You should fill in the instruction, response, reference answer, criteria description, and score description for score in range of 1 to 5.
|
| 45 |
+
```
|
| 46 |
+
###Task Description:
|
| 47 |
+
An instruction (might include an Input inside it), a response to evaluate, a reference answer that gets a score of 5, and a score rubric representing a evaluation criteria are given.
|
| 48 |
+
1. Write a detailed feedback that assess the quality of the response strictly based on the given score rubric, not evaluating in general.
|
| 49 |
+
2. After writing a feedback, write a score that is an integer between 1 and 5. You should refer to the score rubric.
|
| 50 |
+
3. The output format should look as follows: \"Feedback: (write a feedback for criteria) [RESULT] (an integer number between 1 and 5)\"
|
| 51 |
+
4. Please do not generate any other opening, closing, and explanations.
|
| 52 |
+
|
| 53 |
+
###The instruction to evaluate:
|
| 54 |
+
{instruction}
|
| 55 |
+
|
| 56 |
+
###Response to evaluate:
|
| 57 |
+
{response}
|
| 58 |
+
|
| 59 |
+
###Reference Answer (Score 5):
|
| 60 |
+
{reference_answer}
|
| 61 |
+
|
| 62 |
+
###Score Rubrics:
|
| 63 |
+
[{criteria_description}]
|
| 64 |
+
Score 1: {score1_description}
|
| 65 |
+
Score 2: {score2_description}
|
| 66 |
+
Score 3: {score3_description}
|
| 67 |
+
Score 4: {score4_description}
|
| 68 |
+
Score 5: {score5_description}
|
| 69 |
+
|
| 70 |
+
###Feedback:
|
| 71 |
+
```
|
| 72 |
+
|
| 73 |
+
As a result, a feedback and score decision will be generated, divided by a separating phrase ```[RESULT]```
|
| 74 |
+
|
| 75 |
## License
|
| 76 |
Feedback Collection and Prometheus is subject to OpenAI's Terms of Use for the generated data. If you suspect any violations, please reach out to us.
|
| 77 |
|