Seems lagging behind Pangu-1B

by hankaixyz - opened 17 days ago

17 days ago

Benchmark	Metric	MobileLLM-Pro-1B	openPangu-1B-V1.1
General
MMLU	Acc	44.8	65.08
IF-Eval	Prompt Strict	62.0	55.51
Math & Reasoning
GSM8K	Acc	54.0	82.76
MATH-500	Acc	21.5	81.83
Coding
MBPP	Pass@1	46.8	59.31
HumanEval	Pass@1	59.8	66.66

17 days ago

Pangu-1B's outperformance in reasoning and MMLU is indeed quite striking.

15 days ago

Where did you get the GSM8K and MATH-500 score for MobileLLM-Pro? It's not in this repo.

14 days ago

Where did you get the GSM8K and MATH-500 score for MobileLLM-Pro? It's not in this repo.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment