shangzeyu's picture

8 11

shangzeyu

shangzy

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 19 days ago

Multi-Docker-Eval: A `Shovel of the Gold Rush' Benchmark on Automatic Environment Building for Software Engineering

liked a model about 2 months ago

moonshotai/Kimi-K2-Thinking

upvoted a paper 3 months ago

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

View all activity

Organizations

upvoted a paper 19 days ago

Multi-Docker-Eval: A `Shovel of the Gold Rush' Benchmark on Automatic Environment Building for Software Engineering

Paper • 2512.06915 • Published 20 days ago • 12

liked a model about 2 months ago

moonshotai/Kimi-K2-Thinking

Text Generation • Updated Nov 8 • 396k • • 1.57k

upvoted a paper 3 months ago

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2 • 227

liked a model 4 months ago

moonshotai/Kimi-K2-Instruct-0905

Text Generation • 1T • Updated Nov 7 • 31.5k • • 644

liked a model 6 months ago

moonshotai/Kimi-K2-Base

Text Generation • 1T • Updated Jul 13 • 1.94k • 282

upvoted a collection 6 months ago

Kimi-K2

Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 5 items • Updated Nov 14 • 162

liked a model 6 months ago

moonshotai/Kimi-K2-Instruct

Text Generation • 1T • Updated Nov 7 • 59.2k • • 2.28k

upvoted a collection 7 months ago

NextCoder

NextCoder family of code-editing LMs developed with Selective Knowledge Transfer and its training data. • 6 items • Updated Jul 9 • 74

upvoted a paper 8 months ago

Kimi-Audio Technical Report

Paper • 2504.18425 • Published Apr 25 • 20

authored a paper 8 months ago

Kimi-Audio Technical Report

Paper • 2504.18425 • Published Apr 25 • 20

liked a model 8 months ago

moonshotai/Kimi-Audio-7B-Instruct

Text-to-Speech • 10B • Updated May 29 • 1.08k • 377

upvoted 2 collections 9 months ago

Kimina Prover Preview

State-of-the-Art Models for Formal Mathematical Reasoning • 5 items • Updated Apr 28 • 33

Kimi-VL-A3B

Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 7 items • Updated Oct 30 • 77

liked a model 9 months ago

moonshotai/Kimi-VL-A3B-Thinking

Image-Text-to-Text • 16B • Updated Aug 18 • 13.9k • 442

liked 2 models 10 months ago

qihoo360/TinyR1-32B-Preview

Text Generation • 33B • Updated Sep 24 • 114 • • 329

internlm/internlm3-8b-instruct

Text Generation • 9B • Updated Feb 11 • 14.3k • 228

liked a dataset 10 months ago

Congliu/Chinese-DeepSeek-R1-Distill-data-110k

Viewer • Updated Feb 21 • 110k • 506 • 711

upvoted a paper over 1 year ago

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Paper • 2408.08152 • Published Aug 15, 2024 • 60

liked a model over 1 year ago

meta-llama/Meta-Llama-3-70B-Instruct

Text Generation • 71B • Updated Jun 18 • 56.9k • • 1.5k

liked a Space about 2 years ago

HierSpeech++ (Zero-shot TTS)

Generate high-quality speech from text using a prompt audio