Yibo Li's picture

1 7 3

Yibo Li

liushiliushi

·

https://liushiliushi.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper about 7 hours ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

upvoted a paper about 7 hours ago

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

upvoted a paper 15 days ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

View all activity

Organizations

None yet

Papers 3

arxiv:2508.18847

arxiv:2505.19955

arxiv:2310.11829

models 7

liushiliushi/ConfTuner-Ministral

Text Generation • 8B • Updated Sep 20, 2025 • 3 • 3

liushiliushi/ConfTuner-LLaMA

8B • Updated Sep 19, 2025 • 61

liushiliushi/ConfTuner-Qwen

8B • Updated Sep 19, 2025 • 5 • 2

liushiliushi/Qwen2.5-7B-Instruct_gpt

8B • Updated Jun 18, 2025 • 1

liushiliushi/Llama-3.1-8B-Instruct_gpt

8B • Updated Jun 18, 2025

liushiliushi/llama-uncertainty

8B • Updated Jun 18, 2025

liushiliushi/llama-7b-uncertainty-brier

8B • Updated May 9, 2025 • 1

datasets 0

None public yet