Hai Ye

oceanpty

AI & ML interests

None yet

Recent Activity

upvoted a paper about 21 hours ago

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

upvoted a paper 24 days ago

Robust-R1: Degradation-Aware Reasoning for Robust Visual Understanding

authored a paper about 2 months ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

View all activity

Organizations

None yet

upvoted a paper about 21 hours ago

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

Paper • 2601.09688 • Published 1 day ago • 102

upvoted a paper 24 days ago

Robust-R1: Degradation-Aware Reasoning for Robust Visual Understanding

Paper • 2512.17532 • Published 28 days ago • 65

authored a paper about 2 months ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Paper • 2511.11793 • Published Nov 14, 2025 • 183

upvoted a paper about 2 months ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Paper • 2511.11793 • Published Nov 14, 2025 • 183

updated a model 4 months ago

oceanpty/self-j-vicuna-13b-v1.5-student

8B • Updated Sep 3, 2025 • 1

published a model 4 months ago

oceanpty/self-j-vicuna-13b-v1.5-student

8B • Updated Sep 3, 2025 • 1

updated a model 5 months ago

oceanpty/self-j-vicuna-13b-v1.5-kd

Updated Aug 25, 2025

published a model 5 months ago

oceanpty/self-j-vicuna-13b-v1.5-kd

Updated Aug 25, 2025

upvoted a paper 9 months ago

100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models

Paper • 2505.00551 • Published May 1, 2025 • 36

upvoted a paper 11 months ago

Finding the Sweet Spot: Preference Data Construction for Scaling Preference Optimization

Paper • 2502.16825 • Published Feb 24, 2025 • 7

commented a paper 11 months ago

Finding the Sweet Spot: Preference Data Construction for Scaling Preference Optimization

Paper • 2502.16825 • Published Feb 24, 2025 • 7 •

authored a paper about 1 year ago

Test-time Computing: from System-1 Thinking to System-2 Thinking

Paper • 2501.02497 • Published Jan 5, 2025 • 45

upvoted a paper about 1 year ago

Test-time Computing: from System-1 Thinking to System-2 Thinking

Paper • 2501.02497 • Published Jan 5, 2025 • 45

updated 2 models about 1 year ago

oceanpty/Self-J-lla31-8b-inst-base-yi-1-5-16k-chat-threshold-1

8B • Updated Dec 31, 2024 • 7

oceanpty/Self-J-lla31-8b-inst-ref-lla31-70b-base-yi-1-5-16k-chat-threshold-1

8B • Updated Dec 31, 2024 • 1

authored 3 papers about 1 year ago

Preference-Guided Reflective Sampling for Aligning Language Models

Paper • 2408.12163 • Published Aug 22, 2024

Self-Judge: Selective Instruction Following with Alignment Self-Evaluation

Paper • 2409.00935 • Published Sep 2, 2024

Multi-Agent Sampling: Scaling Inference Compute for Data Synthesis with Tree Search-Based Agentic Collaboration

Paper • 2412.17061 • Published Dec 22, 2024 • 1

updated 2 models about 1 year ago

oceanpty/Self-J-lla31-8b-inst-base-mis-7b-v02-threshold-1

8B • Updated Dec 30, 2024 • 4

oceanpty/Self-J-lla31-8b-inst-base-mis-7b-v02-threshold-1

8B • Updated Dec 30, 2024 • 4

Hai Ye

AI & ML interests

Recent Activity

Organizations

oceanpty's activity