paperarXivTrust 82 · PrimaryPublished 4d agoLive · 3d ago

T2LDM++: A Self-Conditioned Representation Guided Diffusion Model for Realistic Text-to-LiDAR Scene Generation

Recent progress in Text-to-Image generation benefits from large-scale Text-Image pairs. However, the scarcity of Text-LiDAR pairs often causes over-smoothed scenes and limited controllability. In this paper, we rethink the limitations of Text-LiDAR generation task, focusing on alleviating insufficient training priors and constructing controllable Text-LiDAR data. We propose a \textbf{T}ext-\textbf{to}-\textbf{L}iDAR \textbf{D}iffusion \textbf{M}odel for LiDAR scene generation, T2LDM++, with a Self-Conditioned Representation Guidance (SCRG). Specifically, to alleviate object over-smoothing, SCR

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Has model

modelDiffuse-XL

Covers

newsDiffusionGemma: 4x faster text generation

Related across the graph

newsDiffusionGemma: 4x faster text generation modelDiffuse-XL

Topics

cs.CV