AI & ML interests
None yet
Organizations
None yet
kemuxu/wikipedia-edit-reward-model
Text Classification
•
8B
•
Updated
•
1
kemuxu/mixed-ultra-wiki-reward-model
Updated
kemuxu/mixed-hh-wiki-reward-model
Updated
kemuxu/mixed-all-reward-model
Updated
kemuxu/mixed-hh-ultra-reward-model
Updated
kemuxu/triple-mixed-deepspeed-reward-model
Updated
kemuxu/mixed-rlhf-ultrafeedback-quad-a100-reward-model
Updated
kemuxu/ultrafeedback-reward-model
Text Classification
•
8B
•
Updated
•
4
kemuxu/hh-rlhf-lora-quad-a100-reward-model
Updated
kemuxu/hh-rlhf-lora-dual-h200-reward-model
Updated
kemuxu/hh-rlhf-reward-model
Text Classification
•
8B
•
Updated
•
1
kemuxu/ultrafeedback-reward-model-dual-h200
Updated
kemuxu/ultrafeedback-reward-model-h200
Updated
kemuxu/qwen3-4b-lora-hh-rlhf-test-reward
Updated
kemuxu/qwen3-4b-lora-hh-rlhf-reward-dual-A100
Updated