3 4 18

Ye Zhiling

yzlnew

https://yzlnew.com

yzlnew

AI & ML interests

Data → Pre-train → Post-train

Recent Activity

liked a model 18 days ago

0xSero/GLM-4.7-REAP-50-W4A16

updated a model about 1 month ago

AQ-MedAI/Kimi-K2-Instruct-eagle3

liked a Space 2 months ago

lvwerra/distill-blog-template

View all activity

Organizations

liked a model 18 days ago

0xSero/GLM-4.7-REAP-50-W4A16

Text Generation • 2B • Updated 17 days ago • 6.24k • 60

updated a model about 1 month ago

AQ-MedAI/Kimi-K2-Instruct-eagle3

1B • Updated Dec 22, 2025 • 112 • 9

liked a Space 2 months ago

The Distill Template

🌌

Craft Beautiful Blogs

authored a paper 3 months ago

Self-Rewarding Rubric-Based Reinforcement Learning for Open-Ended Reasoning

Paper • 2509.25534 • Published Sep 19, 2025 • 3

upvoted a paper 3 months ago

Self-Rewarding Rubric-Based Reinforcement Learning for Open-Ended Reasoning

Paper • 2509.25534 • Published Sep 19, 2025 • 3

liked a dataset 3 months ago

nvidia/ProfBench

Viewer • Updated Oct 30, 2025 • 40 • 523 • 19

liked a Space 3 months ago

BigCodeArena

🚀

Compare two AI models by sending them code and seeing their responses

authored a paper 4 months ago

MedReseacher-R1: Expert-Level Medical Deep Researcher via A Knowledge-Informed Trajectory Synthesis Framework

Paper • 2508.14880 • Published Aug 20, 2025 • 15

authored a paper 5 months ago

Learning to Align, Aligning to Learn: A Unified Approach for Self-Optimized Alignment

Paper • 2508.07750 • Published Aug 11, 2025 • 21

upvoted a paper 5 months ago

Learning to Align, Aligning to Learn: A Unified Approach for Self-Optimized Alignment

Paper • 2508.07750 • Published Aug 11, 2025 • 21

liked a Space 6 months ago

Megatron Memory Estimator

👁

Estimate GPU memory usage for Megatron models

liked a model 7 months ago

Menlo/Jan-nano

Text Generation • 4B • Updated Jul 4, 2025 • 2.43k • • 496

New activity in deepseek-ai/DeepSeek-R1-0528-Qwen3-8B 8 months ago

DeepSeek-R1-Lite

🔥 ❤️ 20

#6 opened 8 months ago by

Dampfinchen

liked a Space 8 months ago

DeepSite v3

🐳

16.3k

Generate any application by Vibe Coding

New activity in nanotron/ultrascale-playbook 8 months ago

Typo on ZeRO-1

#112 opened 8 months ago by

yzlnew

liked a dataset 8 months ago

nvidia/OpenCodeReasoning

Viewer • Updated May 4, 2025 • 753k • 3.1k • 523

liked a Space 11 months ago

The Ultra-Scale Playbook

🌌

3.66k

The ultimate guide to training LLM on large GPU Clusters

liked a model 11 months ago

Qwen/QwQ-32B

Text Generation • 33B • Updated Mar 11, 2025 • 85k • • 2.88k

liked a dataset 11 months ago

Congliu/Chinese-DeepSeek-R1-Distill-data-110k

Viewer • Updated Feb 21, 2025 • 110k • 427 • 720