arxiv:2604.10866
huxiaomeng
gregH
AI & ML interests
None yet
Recent Activity
submitted a paper about 9 hours ago
OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models authored a paper about 22 hours ago
RADAR: Robust AI-Text Detection via Adversarial Learning authored a paper about 22 hours ago
Gradient Cuff: Detecting Jailbreak Attacks on Large Language Models by
Exploring Refusal Loss Landscapes