Xudong Xu's picture

265 12

Xudong Xu

Sheldoooon

·

https://sheldontsui.github.io/

SheldonTsui

AI & ML interests

AIGC for Embodied AI

Recent Activity

upvoted a paper 1 day ago

RoboVIP: Multi-View Video Generation with Visual Identity Prompting Augments Robot Manipulation

upvoted a paper 3 months ago

MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning

upvoted a paper 3 months ago

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

View all activity

Organizations

upvoted a paper 1 day ago

RoboVIP: Multi-View Video Generation with Visual Identity Prompting Augments Robot Manipulation

Paper • 2601.05241 • Published 2 days ago • 21

upvoted 8 papers 3 months ago

MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning

Paper • 2509.22281 • Published Sep 26, 2025 • 32

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published Sep 26, 2025 • 139

SceneWeaver: All-in-One 3D Scene Synthesis with an Extensible and Self-Reflective Agent

Paper • 2509.20414 • Published Sep 24, 2025 • 9

Hunyuan3D-Omni: A Unified Framework for Controllable Generation of 3D Assets

Paper • 2509.21245 • Published Sep 25, 2025 • 39

PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation

Paper • 2509.20358 • Published Sep 24, 2025 • 14

Video models are zero-shot learners and reasoners

Paper • 2509.20328 • Published Sep 24, 2025 • 99

How Far are VLMs from Visual Spatial Intelligence? A Benchmark-Driven Perspective

Paper • 2509.18905 • Published Sep 23, 2025 • 29

Qwen3-Omni Technical Report

Paper • 2509.17765 • Published Sep 22, 2025 • 145

liked a dataset 4 months ago

InternRobotics/MesaTask-10K

Updated Sep 29, 2025 • 4.16k • 15

upvoted a paper 4 months ago

Hunyuan3D Studio: End-to-End AI Pipeline for Game-Ready 3D Asset Generation

Paper • 2509.12815 • Published Sep 16, 2025 • 40

liked 2 models 4 months ago

lhjiang/anysplat

Image-to-3D • Updated Sep 17, 2025 • 15.8k • 11

InternRobotics/F1-VLA

Robotics • 4B • Updated Sep 9, 2025 • 32 • 32

liked 6 datasets 4 months ago

InternRobotics/MotionMillion

Viewer • Updated Nov 17, 2025 • 1.25M • 1.12k • 37

InternRobotics/InternData-N1

Updated 1 day ago • 46.1k • 44

InternRobotics/InternData-A1

Viewer • Updated about 15 hours ago • 5.61M • 26k • 65

InternRobotics/InternData-M1

Viewer • Updated 29 days ago • 1.66M • 4.41k • 27

InternRobotics/InternScenes

Updated Sep 19, 2025 • 16.5k • 33

InternRobotics/OmniWorld

Viewer • Updated 2 days ago • 6.35B • 25.7k • 77

liked a model 4 months ago

InternRobotics/InternVLA-N1-wo-dagger

Robotics • 8B • Updated Nov 25, 2025 • 124 • 40