Zhenwen Liang's picture

2 14

Zhenwen Liang

invokerliang

·

http://zhenwen-nlp.github.io/

LZhenwen

AI & ML interests

Mathematical Reasoning.

Recent Activity

upvoted a paper 12 days ago

The Art of Scaling Reinforcement Learning Compute for LLMs

upvoted a paper 12 days ago

The Role of Computing Resources in Publishing Foundation Model Research

upvoted a paper 14 days ago

Building a Foundational Guardrail for General Agentic Systems via Synthetic Data

View all activity

Organizations

upvoted 2 papers 12 days ago

The Art of Scaling Reinforcement Learning Compute for LLMs

Paper • 2510.13786 • Published 13 days ago • 30

The Role of Computing Resources in Publishing Foundation Model Research

Paper • 2510.13621 • Published 13 days ago • 14

upvoted a paper 14 days ago

Building a Foundational Guardrail for General Agentic Systems via Synthetic Data

Paper • 2510.09781 • Published 18 days ago • 26

upvoted a collection 19 days ago

EVOL-RL

The models trained with EVOL-RL • 7 items • Updated 25 days ago • 1

upvoted a paper 26 days ago

CLUE: Non-parametric Verification from Experience via Hidden-State Clustering

Paper • 2510.01591 • Published 26 days ago • 26

upvoted a paper about 1 month ago

Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation

Paper • 2509.15194 • Published Sep 18 • 33

upvoted a paper 2 months ago

Self-Rewarding Vision-Language Model via Reasoning Decomposition

Paper • 2508.19652 • Published Aug 27 • 84

upvoted a paper 4 months ago

Towards Solving More Challenging IMO Problems via Decoupled Reasoning and Proving

Paper • 2507.06804 • Published Jul 7 • 15

upvoted 3 papers 5 months ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30 • 138

DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning

Paper • 2505.23754 • Published May 29 • 15

MPS-Prover: Advancing Stepwise Theorem Proving by Multi-Perspective Search and Data Curation

Paper • 2505.10962 • Published May 16 • 8

upvoted 3 papers over 1 year ago

Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning

Paper • 2406.12050 • Published Jun 17, 2024 • 19

SceMQA: A Scientific College Entrance Level Multimodal Question Answering Benchmark

Paper • 2402.05138 • Published Feb 6, 2024 • 2

What indeed can GPT models do in chemistry? A comprehensive benchmark on eight tasks

Paper • 2305.18365 • Published May 27, 2023 • 4