lky

NJU-RLer

AI & ML interests

LLM RL

Recent Activity

liked a model 14 days ago

Skywork/Skywork-Reward-V2-Llama-3.1-8B

liked a model 18 days ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

liked a model about 1 month ago

casperhansen/llama-3-70b-instruct-awq

View all activity

Organizations

None yet

liked a model 14 days ago

Skywork/Skywork-Reward-V2-Llama-3.1-8B

Text Classification • 8B • Updated Jul 6, 2025 • 50k • 35

liked a model 18 days ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

Text Generation • 2B • Updated Feb 24, 2025 • 808k • • 1.44k

liked a model about 1 month ago

casperhansen/llama-3-70b-instruct-awq

Text Generation • 71B • Updated Apr 19, 2024 • 7.03k • 70

updated a dataset about 1 month ago

NJU-RLer/ImagineBench

Viewer • Updated Dec 21, 2025 • 747k • 23 • 2

liked a dataset about 2 months ago

DigitalLearningGmbH/MATH-lighteval

Viewer • Updated Jan 15, 2025 • 25k • 13.9k • 61

liked a dataset 5 months ago

trl-lib/ultrafeedback-prompt

Viewer • Updated Jan 8, 2025 • 39.8k • 348 • 9

liked 2 models 5 months ago

RLHFlow/ArmoRM-Llama3-8B-v0.1

Text Classification • 8B • Updated Sep 23, 2024 • 10.4k • 183

Qwen/Qwen2.5-32B-Instruct

Text Generation • 33B • Updated Sep 25, 2024 • 4.1M • • 322

liked a model 6 months ago

meta-llama/Meta-Llama-3-70B-Instruct

Text Generation • 71B • Updated Jun 18, 2025 • 50.7k • • 1.5k

liked a dataset 6 months ago

tatsu-lab/alpaca_farm

Viewer • Updated May 29, 2023 • 91.7k • 205 • 34

liked a model 7 months ago

Qwen/Qwen3-8B

Text Generation • 8B • Updated Jul 26, 2025 • 4.35M • • 876

liked a model 8 months ago

Qwen/Qwen2.5-Coder-3B-Instruct

Text Generation • 3B • Updated Jan 12, 2025 • 187k • • 93

upvoted a paper 8 months ago

ImagineBench: Evaluating Reinforcement Learning with Large Language Model Rollouts

Paper • 2505.10010 • Published May 15, 2025 • 2

liked a model 8 months ago

Qwen/Qwen2.5-1.5B-Instruct

Text Generation • 2B • Updated Sep 25, 2024 • 6.57M • • 596

liked a dataset 9 months ago

NJU-RLer/ImagineBench

Viewer • Updated Dec 21, 2025 • 747k • 23 • 2

published a dataset 9 months ago

NJU-RLer/ImagineBench

Viewer • Updated Dec 21, 2025 • 747k • 23 • 2

liked a model 11 months ago

meta-llama/Llama-3.2-3B-Instruct

Text Generation • 3B • Updated Oct 24, 2024 • 1.55M • • 1.95k

liked 3 datasets about 1 year ago

lky

AI & ML interests

Recent Activity

Organizations

NJU-RLer's activity