interspeech

2025-07-01 18:17:27 -04:00 · 2023-06-01 16:54:16 +01:00
parent 9797a67391
commit 93ed6cfa93
1 changed files with 2 additions and 1 deletions
--- a/README.md
+++ b/README.md
@ -54,6 +54,7 @@ This repository provides fast automatic speech recognition (70x realtime with la

 <h2 align="left", id="highlights">New🚨</h2>

+- _WhisperX_ accepted at INTERSPEECH 2023 
 - v3 transcript segment-per-sentence: using nltk sent_tokenize for better subtitlting & better diarization
 - v3 released, 70x speed-up open-sourced. Using batched whisper with [faster-whisper](https://github.com/guillaumekln/faster-whisper) backend!
 - v2 released, code cleanup, imports whisper library VAD filtering is now turned on by default, as in the paper.
@ -276,7 +277,7 @@ If you use this in your research, please cite the paper:
@article{bain2022whisperx,
  title={WhisperX: Time-Accurate Speech Transcription of Long-Form Audio},
  author={Bain, Max and Huh, Jaesung and Han, Tengda and Zisserman, Andrew},
-  journal={arXiv preprint, arXiv:2303.00747},
+  journal={INTERSPEECH 2023},
  year={2023}
 }
 ```