Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
wenhuachΒ 
posted an update about 21 hours ago
Post
98
πŸš€ AutoRound(https://github.com/intel/auto-round) is now supported by SGLang!

After integrations with TorchAO, Transformers, and VLLM, AutoRound-quantized models are now officially compatible with SGLang β€” bringing faster and more flexible deployment to your LLM workflows.

πŸ’‘ We’ve also enhanced the RTN mode (--iters 0), cutting quantization costs significantly for low-resource users.

⭐ Star our repo and stay tuned for more exciting updates!
In this post