9 4 198

Korek Rybens

Rybens

AI & ML interests

None yet

Recent Activity

liked a model 10 days ago

unsloth/Nemotron-3-Nano-30B-A3B-GGUF

liked a model 10 days ago

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

liked a model 14 days ago

Nanbeige/Nanbeige4-3B-Thinking-2511

View all activity

Organizations

liked 2 models 10 days ago

unsloth/Nemotron-3-Nano-30B-A3B-GGUF

Text Generation • 32B • Updated 9 days ago • 65.3k • 155

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

Text Generation • 32B • Updated 1 day ago • 136k • 465

liked a model 14 days ago

Nanbeige/Nanbeige4-3B-Thinking-2511

Text Generation • 4B • Updated 8 days ago • 6.02k • 139

liked a model 15 days ago

ServiceNow-AI/Apriel-1.6-15b-Thinker

Image-Text-to-Text • 15B • Updated 3 days ago • 7.72k • • 241

liked a model 20 days ago

meituan-longcat/LongCat-Image

Text-to-Image • Updated 9 days ago • 20.9k • • 221

liked 2 models 28 days ago

PrimeIntellect/INTELLECT-3

Text Generation • 107B • Updated 28 days ago • 21.2k • 191

deepseek-ai/DeepSeek-Math-V2

Text Generation • 685B • Updated 28 days ago • 11.5k • 666

liked a model 3 months ago

Kwaipilot/KAT-Dev

Text Generation • 33B • Updated Oct 14 • 2.42k • • 201

reacted to AdinaY's post with 🔥 3 months ago

Post

4430

At the close of the National Holiday🇨🇳, Antgroup drops a new SoTA model.

Ling-1T 🔥 the trillion-parameter flagship of the Ling 2.0 series.

inclusionAI/Ling-1T

✨1T total / 50B active params per token
✨20T+ reasoning-dense tokens (Evo-CoT)
✨128K context via YaRN
✨FP8 training: 15%+ faster, same precision as BF16
✨Hybrid Syntax-Function-Aesthetics reward for front-end & visual generation

1 reply

liked a model 3 months ago

inclusionAI/Ling-1T

Text Generation • 1000B • Updated Nov 4 • 481 • • 521

liked a Space 3 months ago

Web Search MCP

🔎

142

Search and extract web content for LLM ingestion

liked a Space 4 months ago

AI News Stream Global

🌍

AI-powered 24/7 real-time news service.

liked a model 5 months ago

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26 • 3.9M • • 4.28k

reacted to AtAndDev's post with 🔥 5 months ago

Post

580

Qwen 3 Coder is a personal attack to k2, and I love it.
It achieves near SOTA on LCB while not having reasoning.
Finally people are understanding that reasoning isnt necessary for high benches...

Qwen ftw!

DECENTRALIZE DECENTRALIZE DECENTRALIZE

liked a Space 5 months ago

AnyCoder

🏆

3.02k

Generate code with AI

liked a model 5 months ago

Qwen/Qwen3-235B-A22B-Instruct-2507

Text Generation • 235B • Updated Sep 17 • 121k • • 738

reacted to Tonic's post with 👍 5 months ago

Post

3397

🙋🏻‍♂️ Normalize adding compute & runtime traces to your model cards

2 replies

reacted to AdinaY's post with 🔥 5 months ago

Post

3460

Kimi-K2 is now available on the hub🔥🚀
This is a trillion-parameter MoE model focused on long context, code, reasoning, and agentic behavior.

moonshotai/kimi-k2-6871243b990f2af5ba60617d

✨ Base & Instruct
✨ 1T total / 32B active - Modified MIT License
✨ 128K context length
✨ Muon optimizer for stable trillion-scale training