paperarXivTrust 82 · PrimaryPublished 8d agoLive · 7d ago
LISA: Likelihood Score Alignment for Visual-condition Controllable Generation
The prevalent dual-branch paradigm, i.e., training a side network to encode visual conditions and fusing its intermediate-layer features to a frozen pretrained main network, has shown remarkable success in visual-condition controllable generation. Despite its widespread adoption, the role of the side branch and its training efficiency remain underexplored. In this paper, we first revisit this mainstream paradigm through the lens of score-based generative modeling: 1) The main network preserves visual perceptual quality by providing a prior unconditional score. 2) The side network steers condit
Lineage graph
Paper → model → repo connections mined from source citations (Tier-1 exact match).
