arxiv:2509.22186
Bin Wang
wanderkid
AI & ML interests
Computer Vision, Multimodal Large Language Model
Recent Activity
upvoted
a
paper
about 10 hours ago
ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning
upvoted
a
paper
12 days ago
DocDancer: Towards Agentic Document-Grounded Information Seeking
liked
a model
about 1 month ago
opendatalab/TRivia-3B