Project · 2026
Whisper Clinical Speech
LoRA fine-tuning of OpenAI Whisper for dysarthric speech recognition using TORGO.
Parameter-efficient fine-tuning of OpenAI’s Whisper for clinical speech recognition — targeting dysarthria (TORGO) with cross-domain evaluation on aphasia (AphasiaBank).
ASR models trained on healthy speech fail catastrophically on clinical populations. Whisper Large-v3 achieves ~5% WER on LibriSpeech but 45–74% WER on dysarthric speech. This project closes that gap using LoRA adaptation with minimal compute.