Water Guy

WaterGuyV1

AI & ML interests

None yet

Recent Activity

liked a model 10 days ago

inflatebot/MN-12B-Mag-Mell-R1

liked a model about 2 months ago

stepfun-ai/Step-Audio-R1

liked a model about 2 months ago

jayn7/Z-Image-Turbo-GGUF

View all activity

Organizations

None yet

liked a model 10 days ago

inflatebot/MN-12B-Mag-Mell-R1

Text Generation • 12B • Updated Apr 3, 2025 • 1.11k • • 225

liked 2 models about 2 months ago

stepfun-ai/Step-Audio-R1

Audio-Text-to-Text • 33B • Updated Dec 2, 2025 • 287 • 140

jayn7/Z-Image-Turbo-GGUF

Text-to-Image • 6B • Updated Dec 5, 2025 • 74.7k • 298

reacted to danielhanchen's post with ❤️ about 2 months ago

Post

6622

Run DeepSeek-V3.1 locally on 170GB RAM with Dynamic 1-bit GGUFs!🐋
GGUFs: unsloth/DeepSeek-V3.1-GGUF

The 715GB model gets reduced to 170GB (-80% size) by smartly quantizing layers.

The 1-bit GGUF passes all our code tests & we fixed the chat template for llama.cpp supported backends.

Guide: https://docs.unsloth.ai/basics/deepseek-v3.1

reacted to danielhanchen's post with 🚀❤️ about 2 months ago

Post

4431

You can now run Kimi K2 Thinking locally with our Dynamic 1-bit GGUFs: unsloth/Kimi-K2-Thinking-GGUF

We shrank the 1T model to 245GB (-62%) & retained ~85% of accuracy on Aider Polyglot. Run on >247GB RAM for fast inference.

We also collaborated with the Moonshot AI Kimi team on a system prompt fix! 🥰

Guide + fix details: https://docs.unsloth.ai/models/kimi-k2-thinking-how-to-run-locally

reacted to danielhanchen's post with 🔥 about 2 months ago

Post

8548

Qwen3-Next can now be Run locally! (30GB RAM)
Instruct GGUF: unsloth/Qwen3-Next-80B-A3B-Instruct-GGUF

The models come in Thinking and Instruct versions and utilize a new architecture, allowing it to have ~10x faster inference than Qwen32B.
💜 Step-by-step Guide: https://docs.unsloth.ai/models/qwen3-next

Thinking GGUF: unsloth/Qwen3-Next-80B-A3B-Thinking-GGUF