arxiv:2411.11504
Boxi Cao
Bowieee
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
When Models Outthink Their Safety: Mitigating Self-Jailbreak in Large
Reasoning Models with Chain-of-Guardrails
liked
a dataset
8 months ago
agentica-org/DeepCoder-Preview-Dataset
liked
a dataset
8 months ago
inclusionAI/AReaL-boba-Data