10 14 1

beiqing

zhangBeiQing

ZhangBeiQing

AI & ML interests

None yet

Recent Activity

liked a Space 11 days ago

Apollo-LMMs/TimeScope

commented on a paper 13 days ago

StreamMem: Query-Agnostic KV Cache Memory for Streaming Video Understanding

upvoted a paper 13 days ago

StreamMem: Query-Agnostic KV Cache Memory for Streaming Video Understanding

View all activity

Organizations

None yet

commented a paper 13 days ago

StreamMem: Query-Agnostic KV Cache Memory for Streaming Video Understanding

Paper • 2508.15717 • Published Aug 21 • 1 •

commented a paper 21 days ago

MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding

Paper • 2404.05726 • Published Apr 8, 2024 • 23 •

commented a paper 24 days ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 262 •

commented 2 papers about 2 months ago

Rethinking Memory in AI: Taxonomy, Operations, Topics, and Future Directions

Paper • 2505.00675 • Published May 1 • 3 •

Memory Decoder: A Pretrained, Plug-and-Play Memory for Large Language Models

Paper • 2508.09874 • Published Aug 13 • 7 •

commented 3 papers 3 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7 • 177 •

Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models

Paper • 2506.06395 • Published Jun 5 • 131 •

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26 • 156 •

commented 2 papers 4 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 262 •

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30 • 274 •

commented 2 papers 5 months ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 185 •

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 185 •

commented a paper 6 months ago

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published Apr 21 • 88 •

beiqing

AI & ML interests

Recent Activity

Organizations

zhangBeiQing's activity