Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shehan Munasinghe's picture
2 11 2

Shehan Munasinghe

shehan97
seeniameenullah's profile picture Sarim-Hash's profile picture boda's profile picture
·
https://shehanmunasinghe.github.io/
  • shehan_u_e_m
  • shehanmunasinghe

AI & ML interests

Computer Vision, Multi-modal learning

Recent Activity

authored a paper 19 days ago
VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos
upvoted a paper 6 months ago
Sekai: A Video Dataset towards World Exploration
upvoted a paper 6 months ago
CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark
View all activity

Organizations

Mohamed Bin Zayed University of Artificial Intelligence's profile picture

authored a paper 19 days ago

VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos

Paper • 2411.04923 • Published Nov 7, 2024 • 24
authored a paper about 2 years ago

PG-Video-LLaVA: Pixel Grounding Large Video-Language Models

Paper • 2311.13435 • Published Nov 22, 2023 • 19
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs