MATRIX: Mask Track Alignment for Interaction-aware Video Generation Paper โข 2510.07310 โข Published Oct 8 โข 35 โข 3
Visual Representation Alignment for Multimodal Large Language Models Paper โข 2509.07979 โข Published Sep 9 โข 83 โข 7