Snowflake/snowflake-arctic-embed-m-v2.0 Sentence Similarity • Updated Apr 24, 2025 • 74.8k • 101
view article Article Making LLMs Smaller Without Breaking Them: A GLU-Aware Pruning Approach Nov 24, 2024 • 19
Pruning via Merging: Compressing LLMs via Manifold Alignment Based Layer Merging Paper • 2406.16330 • Published Jun 24, 2024 • 1