Paper page - Learning while Deploying: Fleet-Scale Reinforcement Learning for Generalist Robot Policies
…Starting from a pretrained VLA policy, LWD closes the loop between deployment, shared physical experience, policy improvement , and redeployment by using autonomous rollouts and human interventions collected across a robot fleet. To…
