arxiv:2601.16208
Jihan Yang PRO
jihanyang
AI & ML interests
Computer Vision, Multimodality, Embodied AI
Recent Activity
upvoted a paper 17 days ago
Beyond Language Modeling: An Exploration of Multimodal Pretraining upvoted a paper 22 days ago
Solaris: Building a Multiplayer Video World Model in Minecraft liked a dataset about 1 month ago
nyu-visionx/scale-rae-data