Efficient Large Vision-Language Model Collection ERGO: LVLM trained with RL on efficiency objectives; https://github.com/nota-github/ERGO • 3 items • Updated 1 day ago • 20
Efficient MoE-based LLM Collection Mixture-of-Experts Large Language Models with Advanced Quantization • 4 items • Updated 1 day ago • 18
Shortened LLaMA: A Simple Depth Pruning for Large Language Models Paper • 2402.02834 • Published Feb 5, 2024 • 17