wangrui's picture

wangrui

varuy322

·

varuy322

AI & ML interests

None yet

Recent Activity

liked a dataset 6 days ago

open-r1/codeforces-cots

upvoted a paper 14 days ago

Robot Learning: A Tutorial

liked a dataset 14 days ago

HuggingFaceFW/finepdfs

View all activity

Organizations

None yet

upvoted a paper 14 days ago

Robot Learning: A Tutorial

Paper • 2510.12403 • Published 20 days ago • 98

upvoted a collection 18 days ago

Ferret

A framework for training LLM agents via RL with advanced search capability: https://github.com/Tree-Shu-Zhao/ferret • 7 items • Updated 13 days ago • 1

upvoted 2 papers about 1 month ago

MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe

Paper • 2509.18154 • Published Sep 16 • 49

ARE: Scaling Up Agent Environments and Evaluations

Paper • 2509.17158 • Published Sep 21 • 34

upvoted a collection about 1 month ago

ZeroSearch_Policy_Google_V2

6 items • Updated Sep 7 • 5

upvoted a paper about 2 months ago

Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Paper • 2508.20751 • Published Aug 28 • 89

upvoted an article about 2 months ago

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

May 21

• 225

upvoted a paper about 2 months ago

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published Sep 2 • 123

upvoted a collection about 2 months ago

InternVL3.5

This collection includes all released checkpoints of InternVL3.5, covering different training stages (e.g., Pretraining, SFT, MPO, Cascade RL). • 54 items • Updated Sep 28 • 101

upvoted a paper 2 months ago

Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

Paper • 2505.04921 • Published May 8 • 185

upvoted a collection 2 months ago

Seed-X

A powerful open-source multilingual translation language model series, including instruction and reasoning models. • 8 items • Updated Aug 22 • 65

upvoted a paper 2 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21 • 255

upvoted a collection 2 months ago

Intern-S1

7 items • Updated Aug 22 • 25

upvoted a collection 3 months ago

agent

208 items • Updated 10 days ago • 14

upvoted 5 papers 3 months ago

Large Language Model Agent: A Survey on Methodology, Applications and Challenges

Paper • 2503.21460 • Published Mar 27 • 83

Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving

Paper • 2507.06229 • Published Jul 8 • 75

Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training

Paper • 2508.00414 • Published Aug 1 • 91

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17 • 257

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26 • 156

upvoted a paper 4 months ago

Pre-Trained Policy Discriminators are General Reward Models

Paper • 2507.05197 • Published Jul 7 • 39