Ziyang Ding's picture

4 4

Ziyang Ding

sdudzy

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 22 hours ago

MM-UAVBench: How Well Do Multimodal Large Language Models See, Think, and Plan in Low-Altitude UAV Scenarios?

liked a dataset 13 days ago

daisq/MM-UAVBench

upvoted a paper 3 months ago

MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization

View all activity

Organizations

None yet

upvoted a paper about 22 hours ago

MM-UAVBench: How Well Do Multimodal Large Language Models See, Think, and Plan in Low-Altitude UAV Scenarios?

Paper • 2512.23219 • Published Dec 29, 2025 • 4

liked a dataset 13 days ago

daisq/MM-UAVBench

Viewer • Updated 19 days ago • 6.5k • 3.63k • 5

upvoted a paper 3 months ago

MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization

Paper • 2510.08540 • Published Oct 9, 2025 • 109

liked 2 Spaces 7 months ago

Detect AI-generated Image

SAM2 Video Predictor

Segment and propagate masks in videos

liked a model 10 months ago

MaxyLee/DeepPerception

Image-Text-to-Text • 8B • Updated Mar 19, 2025 • 6 • 2

authored a paper 11 months ago

DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding

Paper • 2503.12797 • Published Mar 17, 2025 • 32

upvoted a paper 11 months ago

DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding

Paper • 2503.12797 • Published Mar 17, 2025 • 32

upvoted a paper about 1 year ago

Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models

Paper • 2501.05767 • Published Jan 10, 2025 • 29