Ministral 3 Collection Mistral Ministral 3: new multimodal models in Base, Instruct, and Reasoning variants, available in 3B, 8B, and 14B sizes. • 36 items • Updated about 24 hours ago • 20
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 8 days ago • 226
Ministral 3 Collection A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated 6 days ago • 116
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Paper • 2402.03300 • Published Feb 5, 2024 • 137
INTELLECT-3 Collection INTELLECT-3: A 100B+ MoE trained with large-scale RL • 4 items • Updated 10 days ago • 11
AICC: Parse HTML Finer, Make Models Better -- A 7.3T AI-Ready Corpus Built by a Model-Based HTML Parser Paper • 2511.16397 • Published 18 days ago • 7
The SA-FARI Dataset: Segment Anything in Footage of Animals for Recognition and Identification Paper • 2511.15622 • Published 19 days ago • 1
view article Article Open ASR Leaderboard: Trends and Insights with New Multilingual & Long-Form Tracks +2 18 days ago • 21
BBox DocVQA: A Large Scale Bounding Box Grounded Dataset for Enhancing Reasoning in Document Visual Question Answer Paper • 2511.15090 • Published 20 days ago • 1
CASTELLA: Long Audio Dataset with Captions and Temporal Boundaries Paper • 2511.15131 • Published 20 days ago • 1
view article Article We’re open-sourcing our text-to-image model and the process behind it 26 days ago • 73
E-MM1 Collection Multimodal embedding model, supporting datasets, and a paper describing the process going into building both the datasets and the models 🤗 • 6 items • Updated 18 days ago • 10