Read original ↗
paperarXivTrust 82 · PrimaryPublished 3d agoLive · 2d ago

Adapting Foundation ASR Models to Dysarthric Speech: A Case Study

Automatic speech recognition (ASR) systems often perform poorly in dysarthric speech, limiting their usefulness to affected speakers in everyday communication. This paper presents a personalized ASR system for a dysarthric speaker, built by adapting a foundation ASR model to speaker-specific data. Using the TEQST tool, we collected 92 hours of read speech and later added 8.8 hours of user corrections gathered through a deployed mobile application. Starting from Whisper, fine-tuning reduced word error rate to 15.8% with only 1.4 hours of adaptation data, reached 10.7% with 22.5 hours, and achie

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Has model

Implements

Implements (incoming)

Related across the graph

Topics