paperarXivTrust 82 · PrimaryPublished 5d agoLive · 3d ago

A Linear Matching Bandit Approach to Online Multi-Human Multi-Robot Teaming

We address the problem of online multi-human multi-robot teaming through the lens of a linear matching bandit framework, where a learner assigns robots with unknown features from a fixed pool to distinct sets of human agents over multiple rounds. To solve this problem, we propose LinMatch, an online learning algorithm that updates the confidence intervals of the unknown features and makes the optimistic matching under uncertainty. The contributions and novelty of this work are twofold. First, we recast the optimistic matching problem in each round as a linear program of maximum weighted matchi

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Topics

cs.LG