When Does Reasoning Matter? A Controlled Study of Reasoning's Contribution to Model Performance Paper • 2509.22193 • Published Sep 26 • 37
Should We Still Pretrain Encoders with Masked Language Modeling? Paper • 2507.00994 • Published Jul 1 • 79
EuroBERT: Scaling Multilingual Encoders for European Languages Paper • 2503.05500 • Published Mar 7 • 79
Towards Cross-Tokenizer Distillation: the Universal Logit Distillation Loss for LLMs Paper • 2402.12030 • Published Feb 19, 2024 • 3
Deep learning-based stereo camera multi-video synchronization Paper • 2303.12916 • Published Mar 22, 2023
CroissantLLM: A Truly Bilingual French-English Language Model Paper • 2402.00786 • Published Feb 1, 2024 • 26