David S. Barbera

Project · 2026

Whisper Clinical Speech

LoRA fine-tuning of OpenAI Whisper for dysarthric speech recognition using TORGO.

HuggingFace Model License: MIT Python 3.10+

Parameter-efficient fine-tuning of OpenAI’s Whisper for clinical speech recognition — targeting dysarthria (TORGO) with cross-domain evaluation on aphasia (AphasiaBank).

ASR models trained on healthy speech fail catastrophically on clinical populations. Whisper Large-v3 achieves ~5% WER on LibriSpeech but 45–74% WER on dysarthric speech. This project closes that gap using LoRA adaptation with minimal compute.

← All projects