DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle Paper • 2512.04324 • Published Dec 3, 2025 • 150
Focusing by Contrastive Attention: Enhancing VLMs' Visual Reasoning Paper • 2509.06461 • Published Sep 8, 2025 • 19
Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR Paper • 2508.14029 • Published Aug 19, 2025 • 118
RAVine: Reality-Aligned Evaluation for Agentic Search Paper • 2507.16725 • Published Jul 22, 2025 • 29
MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning Paper • 2507.16812 • Published Jul 22, 2025 • 63
A Survey of Context Engineering for Large Language Models Paper • 2507.13334 • Published Jul 17, 2025 • 259
RefineX: Learning to Refine Pre-training Data at Scale from Expert-Guided Programs Paper • 2507.03253 • Published Jul 4, 2025 • 18
Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models Paper • 2503.15888 • Published Mar 20, 2025 • 1
Training a Utility-based Retriever Through Shared Context Attribution for Retrieval-Augmented Language Models Paper • 2504.00573 • Published Apr 1, 2025 • 2
Efficient Inference for Large Reasoning Models: A Survey Paper • 2503.23077 • Published Mar 29, 2025 • 46
Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation Paper • 2503.19622 • Published Mar 25, 2025 • 31
Context-Faithful LLMs Collection Usage Instructions can be found at https://github.com/byronBBL/Context-DPO?tab=readme-ov-file#context-faithful-models • 4 items • Updated Feb 17, 2025 • 1