Spin up an instance of g4dn.xlarge with Deep Learning Base OSS Nvidia Driver GPU AMI (Amazon Linux 2023). Log in to the instance via Session Manager.
Make sure the GPU is available.
nvidia-smi
Install https://github.com/SYSTRAN/faster-whisper and cuDNN.
pip3 install faster-whisper
pip3 install nvidia-cublas-cu12 'nvidia-cudnn-cu12==9.*'
export LD_LIBRARY_PATH=`python3 -c 'import os; import nvidia.cublas.lib; import nvidia.cudnn.lib; print(os.path.dirname(nvidia.cublas.lib.__file__) + ":" + os.path.dirname(nvidia.cudnn.lib.__file__))'`
Run the example script of https://github.com/SYSTRAN/faster-whisper?tab=readme-ov-file#faster-whisper.
python3 test.py
Here is an example log of time to transcript 30 minutes audio.
2025-03-30 06:21:31.513916
2025-03-30 06:23:53.890056