Microsoft Research has presented an object-centric residual reinforcement learning method that trains a lightweight corrective robot policy entirely in simulation, adds it to a frozen vision-language-action model, and reports zero-shot real-robot gains across five manipulation tasks from 42...