Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shehan Munasinghe's picture
2 11 2

Shehan Munasinghe

shehan97
SasikaA073's profile picture Saeid's profile picture seeniameenullah's profile picture
·
https://shehanmunasinghe.github.io/
  • shehan_u_e_m
  • shehanmunasinghe

AI & ML interests

Computer Vision, Multi-modal learning

Recent Activity

authored a paper about 2 months ago
VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos
upvoted a paper 7 months ago
Sekai: A Video Dataset towards World Exploration
upvoted a paper 7 months ago
CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark
View all activity

Organizations

Mohamed Bin Zayed University of Artificial Intelligence's profile picture

commented 2 papers about 1 year ago

VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos

Paper • 2411.04923 • Published Nov 7, 2024 • 23 •
3

VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos

Paper • 2411.04923 • Published Nov 7, 2024 • 23 •
3
New activity in MBZUAI/swiftformer-xs about 2 years ago

Adding `safetensors` variant of this model

1
#1 opened over 2 years ago by
SFconvertbot
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs