paperarXivTrust 82 · PrimaryPublished 5d agoLive · 3d ago

Behavior Uncloning: Distilling Mode Redirection into Policy Weights without Inference-Time Steering

Behavior-cloned policies often learn multiple behavior modes from demonstration datasets, including modes that are unsafe or otherwise undesired at deployment. For example, a policy trained on diverse handover demonstrations may learn to pass a knife blade-first. Standard remedies such as data curation and inference-time steering either require access to the original demonstrations for full retraining or add substantial inference-time overhead. To address this gap, we propose MoRE(Mode Redirection), which redirects policy rollouts toward desired behavior modes through a short "uncloning" step.

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Covers

newsPredicting model behavior before release by simulating deployment

Related across the graph

newsPredicting model behavior before release by simulating deployment

Topics

cs.AI