Shuo Wang's picture

1 18 5

Shuo Wang

shuo-hf

·

AI & ML interests

None yet

Recent Activity

liked a model about 2 months ago

openbmb/MiniCPM4.1-8B

upvoted a paper 2 months ago

Limitations of Normalization in Attention Mechanism

upvoted a paper 2 months ago

Beyond Memorization: Extending Reasoning Depth with Recurrence, Memory and Test-Time Compute Scaling

View all activity

Organizations

upvoted 13 papers 2 months ago

Limitations of Normalization in Attention Mechanism

Paper • 2508.17821 • Published Aug 25 • 7

Beyond Memorization: Extending Reasoning Depth with Recurrence, Memory and Test-Time Compute Scaling

Paper • 2508.16745 • Published Aug 22 • 28

Understanding Tool-Integrated Reasoning

Paper • 2508.19201 • Published Aug 26 • 32

Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks

Paper • 2508.18672 • Published Aug 26 • 10

ThinkDial: An Open Recipe for Controlling Reasoning Effort in Large Language Models

Paper • 2508.18773 • Published Aug 26 • 15

UltraMemV2: Memory Networks Scaling to 120B Parameters with Superior Long-Context Learning

Paper • 2508.18756 • Published Aug 26 • 36

DeepScholar-Bench: A Live Benchmark and Automated Evaluation for Generative Research Synthesis

Paper • 2508.20033 • Published Aug 27 • 10

AudioStory: Generating Long-Form Narrative Audio with Large Language Models

Paper • 2508.20088 • Published Aug 27 • 20

Predicting the Order of Upcoming Tokens Improves Language Modeling

Paper • 2508.19228 • Published Aug 26 • 22

MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers

Paper • 2508.20453 • Published Aug 28 • 63

Mixture of Contexts for Long Video Generation

Paper • 2508.21058 • Published Aug 28 • 34

rStar2-Agent: Agentic Reasoning Technical Report

Paper • 2508.20722 • Published Aug 28 • 113

PaperRegister: Boosting Flexible-grained Paper Search via Hierarchical Register Indexing

Paper • 2508.11116 • Published Aug 14 • 22

upvoted a paper 4 months ago

RLPR: Extrapolating RLVR to General Domains without Verifiers

Paper • 2506.18254 • Published Jun 23 • 31

upvoted a paper 5 months ago

MiniCPM4: Ultra-Efficient LLMs on End Devices

Paper • 2506.07900 • Published Jun 9 • 92

upvoted a paper 9 months ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3 • 61

upvoted a paper 10 months ago

Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models

Paper • 2501.05767 • Published Jan 10 • 29

upvoted a paper about 1 year ago

LLMtimesMapReduce: Simplified Long-Sequence Processing using Large Language Models

Paper • 2410.09342 • Published Oct 12, 2024 • 39