Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
sikang99 's Collections
SLAM
3DGS NeRF
Diffusion Models
VLM, MLLM
Diffusion Model
Reinforcement Learning
Vision Processing
Simulation
VLA Models
AI Agents
3D Generation
Video Generation

VLM, MLLM

updated Jul 1, 2025
Upvote
-

  • UrbanLLaVA: A Multi-modal Large Language Model for Urban Intelligence with Spatial Reasoning and Understanding

    Paper • 2506.23219 • Published Jun 29, 2025 • 7
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs