paperarXivTrust 82 · PrimaryPublished 3d agoLive · 2d ago

Adapting Foundation ASR Models to Dysarthric Speech: A Case Study

Automatic speech recognition (ASR) systems often perform poorly in dysarthric speech, limiting their usefulness to affected speakers in everyday communication. This paper presents a personalized ASR system for a dysarthric speaker, built by adapting a foundation ASR model to speaker-specific data. Using the TEQST tool, we collected 92 hours of read speech and later added 8.8 hours of user corrections gathered through a deployed mobile application. Starting from Whisper, fine-tuning reduced word error rate to 15.8% with only 1.4 hours of adaptation data, reached 10.7% with 22.5 hours, and achie

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Has model

modelWhisper-Lite

Implements

repoamplitudesoldierheed/AI-Voice-Changer-Real-Time-Desktop

Implements (incoming)

repoattevon-llc/OpenTranscribe repolgy1027/matrix-live-diarizer

Related across the graph

repoattevon-llc/OpenTranscribe modelWhisper-Lite repoamplitudesoldierheed/AI-Voice-Changer-Real-Time-Desktop repolgy1027/matrix-live-diarizer

Topics

cs.CL