Tamper-Resistant Safeguards for Open-Weight LLMs Collection Models & datasets from the paper "Tamper-Resistant Safeguards for Open-Weight LLMs" (https://arxiv.org/pdf/2408.00761) • 9 items • Updated Feb 15, 2025 • 5
Deep Ignorance Collection This collection contains the model and data artifacts from O'Brien et al. (2025). https://deepignorance.ai • 40 items • Updated 4 days ago • 10