Read original ↗
paperarXivTrust 82 · PrimaryPublished 3d agoLive · 2d ago

Tone-Conditioned Curriculum Learning for Low-Resource Bantu Speech Recognition

Southern Bantu languages are spoken by over 80 million people, yet current foundation ASR models still produce zero-shot WER above 100%, which limits practical use in education and public services. We addressed this gap with a tone conditioned curriculum framework for 6 Southern Bantu languages that combined hybrid difficulty scoring, gated adapters driven by tonal statistics and staged curriculum training. We trained on a community corpus and tested transfer to NCHLT to measure robustness beyond matched evaluation. Results revealed clear interactions between architecture and language, with W2

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Implements

Related to

Related across the graph

Topics