Spatial-Aware VLA Pretraining through Visual-Physical Alignment from Human Videos Paper โข 2512.13080 โข Published 17 days ago โข 15