2 14 1

brucewan666

https://github.com/SUSTechBruce

SUSTechBruce

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

upvoted a paper about 1 month ago

DoPE: Denoising Rotary Position Embedding

upvoted a paper 3 months ago

LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction

View all activity

Organizations

None yet

upvoted 2 papers about 1 month ago

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published Nov 20 • 108

DoPE: Denoising Rotary Position Embedding

Paper • 2511.09146 • Published Nov 12 • 93

upvoted a paper 3 months ago

LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction

Paper • 2509.07403 • Published Sep 9 • 34

upvoted a paper 4 months ago

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Paper • 2508.14029 • Published Aug 19 • 118

upvoted a paper 5 months ago

SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning

Paper • 2506.01713 • Published Jun 2 • 48

commented a paper 7 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 263 •

published a model 10 months ago

brucewan666/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

Updated Mar 1

upvoted a paper 10 months ago

MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning

Paper • 2502.19634 • Published Feb 26 • 63

published a model 10 months ago

brucewan666/Qwen2.5-1.5B-Open-R1-Distill

Updated Feb 27

upvoted a paper about 1 year ago

Autoregressive Models in Vision: A Survey

Paper • 2411.05902 • Published Nov 8, 2024 • 19

upvoted 2 papers over 1 year ago

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

Paper • 2407.13623 • Published Jul 18, 2024 • 56

D2O: Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models

Paper • 2406.13035 • Published Jun 18, 2024 • 3

New activity in huggingface/HuggingDiscussions over 1 year ago

[FEEDBACK] Daily Papers

🔥 ❤️ 21

167

#32 opened over 1 year ago by

kramp

upvoted 5 papers over 1 year ago

A Survey on Model Compression for Large Language Models

Paper • 2308.07633 • Published Aug 15, 2023 • 3

SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression

Paper • 2403.07378 • Published Mar 12, 2024 • 4

liked a model almost 2 years ago

EleutherAI/gpt-j-6b

Text Generation • Updated Jun 21, 2023 • 39.3k • 1.52k

brucewan666

AI & ML interests

Recent Activity

Organizations

brucewan666's activity

[FEEDBACK] Daily Papers