view article Article Making LLMs Smaller Without Breaking Them: A GLU-Aware Pruning Approach Nov 24, 2024 • 15
Pruning via Merging: Compressing LLMs via Manifold Alignment Based Layer Merging Paper • 2406.16330 • Published Jun 24, 2024 • 1