Read original ↗
paperarXivTrust 82 · PrimaryPublished 3d agoLive · 2d ago

LuxEmo: Expressive Text-to-Speech Corpus for Luxembourgish

State-of-the-art speech datasets predominantly focus on widely spoken languages, often overlooking low-resource languages such as Luxembourgish, which remain underrepresented in speech technology research. In this work, we introduce LuxEmo, a 21-hour conversational expressive speech corpus for Luxembourgish with 4 emotion categories. LuxEmo is derived from Radio Télévision Luxembourg (RTL) youth broadcasts, using automated detection followed by human validation. We propose a semi-automatic curation workflow combining voice activity detection, denoising, language identification, LuxASR-based se

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Has model

Implements (incoming)

Related across the graph

Topics