What about gravity in video generation? Post-Training Newton's Laws with Verifiable Rewards Paper • 2512.00425 • Published 29 days ago • 49
variante/llava-1.5-7b-llara-D-RT2-Style-VIMA-80k Image-Text-to-Text • 7B • Updated Aug 28, 2024 • 6
LLaRA Collection Models released with LLaRA: Supercharging Robot Learning Data for Vision-Language Policy • 7 items • Updated Aug 28, 2024 • 1
Salesforce/xgen-mm-phi3-mini-instruct-interleave-r-v1.5 Image-Text-to-Text • 4B • Updated Feb 3 • 406 • 59
variante/llava-1.5-7b-llara-D-inBC-VIMA-80k Image-Text-to-Text • 7B • Updated Jul 13, 2024 • 11 • 1
variante/llava-1.5-7b-llara-D-inBC-Aux-D-VIMA-80k Image-Text-to-Text • 7B • Updated Jul 13, 2024 • 13 • 1
variante/llava-1.5-7b-llara-D-inBC-Aux-B-VIMA-80k Image-Text-to-Text • 7B • Updated Jul 15, 2024 • 9 • 2
Theia: Distilling Diverse Vision Foundation Models for Robot Learning Paper • 2407.20179 • Published Jul 29, 2024 • 47