kevinpro
commited on
Commit
·
b220808
1
Parent(s):
562362e
commit message
Browse files- __pycache__/content.cpython-38.pyc +0 -0
- content.py +10 -11
__pycache__/content.cpython-38.pyc
CHANGED
|
Binary files a/__pycache__/content.cpython-38.pyc and b/__pycache__/content.cpython-38.pyc differ
|
|
|
content.py
CHANGED
|
@@ -3,25 +3,24 @@ TITLE = '<h1 align="center" id="space-title">Open Multilingual Reasoning Leaderb
|
|
| 3 |
INTRO_TEXT = f"""
|
| 4 |
## About
|
| 5 |
|
| 6 |
-
This leaderboard tracks
|
| 7 |
-
emphasizing on non-English languages to democratize benefits of LLMs to broader society.
|
| 8 |
-
Our current leaderboard provides evaluation data for 10 languages.
|
| 9 |
Both multilingual and language-specific LLMs are welcome in this leaderboard.
|
| 10 |
-
|
|
|
|
| 11 |
|
| 12 |
- <a href="https://huggingface.co/datasets/Mathoctopus/MSVAMP" target="_blank"> MSVAMP </a>
|
| 13 |
- <a href="https://huggingface.co/datasets/juletxara/mgsm" target="_blank"> MGSM </a>
|
| 14 |
-
- <a href="https://
|
| 15 |
|
| 16 |
-
|
| 17 |
|
| 18 |
-
|
| 19 |
-
|
| 20 |
|
| 21 |
-
|
| 22 |
|
| 23 |
-
|
| 24 |
-
|
| 25 |
|
| 26 |
# CREDIT = f"""
|
| 27 |
# ## Credit
|
|
|
|
| 3 |
INTRO_TEXT = f"""
|
| 4 |
## About
|
| 5 |
|
| 6 |
+
This leaderboard tracks and ranks the reasoning performance of the leading, most advanced multilingual reasoning LLMs on three multilingual mathematical reasoning benchmarks. Each benchmark contains 10 languages: Bengali, Swedish, Thailand, Chinese, Japan, Russian, French, Spanish, German and English.
|
|
|
|
|
|
|
| 7 |
Both multilingual and language-specific LLMs are welcome in this leaderboard.
|
| 8 |
+
|
| 9 |
+
## Benchmarks
|
| 10 |
|
| 11 |
- <a href="https://huggingface.co/datasets/Mathoctopus/MSVAMP" target="_blank"> MSVAMP </a>
|
| 12 |
- <a href="https://huggingface.co/datasets/juletxara/mgsm" target="_blank"> MGSM </a>
|
| 13 |
+
- <a href="https://huggingface.co/datasets/kevinpro/MNumGLUESub" target="_blank"> MNumGLUESub </a>
|
| 14 |
|
| 15 |
+
"""
|
| 16 |
|
| 17 |
+
HOW_TO = f"""
|
| 18 |
+
## How to list your model performance on this leaderboard:
|
| 19 |
|
| 20 |
+
Run the evaluation of your model using this repo: <a href="https://github.com/NJUNLP/MAPO" target="_blank">https://github.com/NJUNLP/MAPO</a>.
|
| 21 |
|
| 22 |
+
And then, push the evaluation log and make a pull request.
|
| 23 |
+
"""
|
| 24 |
|
| 25 |
# CREDIT = f"""
|
| 26 |
# ## Credit
|