AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders Paper • 2510.19779 • Published 7 days ago • 58
Large Scale Diffusion Distillation via Score-Regularized Continuous-Time Consistency Paper • 2510.08431 • Published 20 days ago • 8
Large Scale Diffusion Distillation via Score-Regularized Continuous-Time Consistency Paper • 2510.08431 • Published 20 days ago • 8
SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention Paper • 2509.24006 • Published about 1 month ago • 115 • 4
SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention Paper • 2509.24006 • Published about 1 month ago • 115
Efficient Hyperparameter Tuning via Trajectory Invariance Principle Paper • 2509.25049 • Published 30 days ago • 4
SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention Paper • 2509.24006 • Published about 1 month ago • 115 • 4
SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention Paper • 2509.24006 • Published about 1 month ago • 115
SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention Paper • 2509.24006 • Published about 1 month ago • 115 • 4