floyed shen
floyed
AI & ML interests
None yet
Recent Activity
submitted
a paper
about 1 hour ago
VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training
upvoted
a
paper
about 1 hour ago
DeepEyesV2: Toward Agentic Multimodal Model
authored
a paper
6 days ago
Safety Instincts: LLMs Learn to Trust Their Internal Compass for Self-Defense