Describe, Don't Dictate: Semantic Image Editing with Natural Language Intent Paper • 2508.20505 • Published Aug 28 • 4
MotionSight: Boosting Fine-Grained Motion Understanding in Multimodal LLMs Paper • 2506.01674 • Published Jun 2 • 28