A Benchmark and Agentic Framework for Omni-Modal Reasoning and Tool Use in Long Videos Paper • 2512.16978 • Published 16 days ago • 4
A Benchmark and Agentic Framework for Omni-Modal Reasoning and Tool Use in Long Videos Paper • 2512.16978 • Published 16 days ago • 4
A Culturally-diverse Multilingual Multimodal Video Benchmark & Model Paper • 2506.07032 • Published Jun 8, 2025
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM Paper • 2503.04724 • Published Mar 6, 2025 • 72
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM Paper • 2503.04724 • Published Mar 6, 2025 • 72
BiMediX2: Bio-Medical EXpert LMM for Diverse Medical Modalities Paper • 2412.07769 • Published Dec 10, 2024 • 30
BiMediX2: Bio-Medical EXpert LMM for Diverse Medical Modalities Paper • 2412.07769 • Published Dec 10, 2024 • 30