"Be My Cheese?": Cultural Nuance Benchmarking for Machine Translation in Multilingual LLMs Paper • 2602.04729 • Published Feb 4
Red Teaming Multimodal Language Models: Evaluating Harm Across Prompt Modalities and Models Paper • 2509.15478 • Published Nov 21, 2025