numpy torch torchaudio tqdm more-itertools transformers>=4.19.0 ffmpeg-python==0.2.0 pyannote.audio soundfile>=0.12.0