VIDEOP2R: Video Understanding from Perception to Reasoning Paper • 2511.11113 • Published 25 days ago • 111
AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning Paper • 2412.03248 • Published Dec 4, 2024 • 27