Deep Ignorance: Filtering Pretraining Data Builds Tamper-Resistant Safeguards into Open-Weight LLMs Paper • 2508.06601 • Published 14 days ago • 6
Improving Black-box Robustness with In-Context Rewriting Collection 24 items • Updated Feb 20, 2024 • 1