Critique to Verify: Accurate and Honest Test-Time Scaling with RL-Trained Verifiers (https://arxiv.org/abs/2509.23152)
Zhicheng YANG
yangzhch6
AI & ML interests
reasoning with LLMs
Recent Activity
upvoted a paper about 11 hours ago
ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual
Recall updated
a dataset 17 days ago
yangzhch6/Accordion-Thinking-Synthetic-Data published
a dataset 17 days ago
yangzhch6/Accordion-Thinking-Synthetic-Data Organizations
None yet