Whisper#

"Faster Whisper" is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification.

Availability / Target HPC systems#

JupyterHub: best suited for interactive use (only for Tier3 HPC accounts)
TinyGPU/Alex: best suited for processing of larger files in batch mode

Whisper on JupyterHub#

Interactive usage of Whisper is available as a custom kernel in JupyterHub. The following steps are necessary:

Access JupyterHub according to this guide.
Select Whisper Transcription Web Service from the job profiles.
Choose whisper-demo from the available notebook types.
You can now use Whisper.
After you finished your work, remember to stop your instance manually by going back to the hub control panel (File > Hub Control Panel) and selecting Stop My Server. Closing the browser or logging out from JupyterHub will NOT free the resources!