MMFineReason Collection High-quality STEM reasoning dataset for Multimodal LLM post-training. • 13 items • Updated about 14 hours ago • 17
MMFineReason Collection High-quality STEM reasoning dataset for Multimodal LLM post-training. • 13 items • Updated about 14 hours ago • 17
OpenDataArena/MMFineReason-SFT-123K-Qwen3-VL-235B-Thinking Viewer • Updated 1 day ago • 123k • 20 • 19
MMFineReason: Closing the Multimodal Reasoning Gap via Open Data-Centric Methods Paper • 2601.21821 • Published 2 days ago • 45
Scientific Image Synthesis: Benchmarking, Methodologies, and Downstream Utility Paper • 2601.17027 • Published 14 days ago • 40
ChartVerse: Scaling Chart Reasoning via Reliable Programmatic Synthesis from Scratch Paper • 2601.13606 • Published 12 days ago • 10
ChartVerse: Scaling Chart Reasoning via Reliable Programmatic Synthesis from Scratch Paper • 2601.13606 • Published 12 days ago • 10
ODA-Mixture Collection High-quality mixture datasets for post-training covering multiple domains. • 7 items • Updated 15 days ago • 4
ODA-Math Collection High-quality mathematical datasets for post training. • 5 items • Updated 15 days ago • 1
ODA-Mixture Collection High-quality mixture datasets for post-training covering multiple domains. • 7 items • Updated 15 days ago • 4
ODA-Math Collection High-quality mathematical datasets for post training. • 5 items • Updated 15 days ago • 1
Closing the Data Loop: Using OpenDataArena to Engineer Superior Training Datasets Paper • 2601.09733 • Published Dec 30, 2025 • 8