Lost in Sampling: Assessing Lexical Reachability in LLMs via the Word Coverage Score (WCS) Paper • 2605.27268 • Published 4 days ago • 10
The Generative Energy Arena (GEA): Incorporating Energy Awareness in Large Language Model (LLM) Human Evaluations Paper • 2507.13302 • Published Jul 17, 2025 • 6
Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong Paper • 2501.09775 • Published Jan 16, 2025 • 32
How Stable is Stable Diffusion under Recursive InPainting (RIP)? Paper • 2407.09549 • Published Jun 27, 2024 • 2