SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law Paper • 2507.18576 • Published Jul 24 • 7
Rethinking Bottlenecks in Safety Fine-Tuning of Vision Language Models Paper • 2501.18533 • Published Jan 30 • 1
ETA: Evaluating Then Aligning Safety of Vision Language Models at Inference Time Paper • 2410.06625 • Published Oct 9, 2024
Sherlock: Self-Correcting Reasoning in Vision-Language Models Paper • 2505.22651 • Published May 28 • 49