Accept alternative VAD methods. Extend to use Silero VAD.

2025-07-01 18:17:27 -04:00 · 2024-09-26 10:28:52 +02:00
parent 10b05fc43f
commit 79eb8fa53d
8 changed files with 262 additions and 101 deletions
--- a/README.md
+++ b/README.md
@ -278,7 +278,7 @@ Bug finding and pull requests are also highly appreciated to keep this project g

 * [ ] Add benchmarking code (TEDLIUM for spd/WER & word segmentation)

-* [ ] Allow silero-vad as alternative VAD option
+* [x] Allow silero-vad as alternative VAD option

 * [ ] Improve diarization (word level). *Harder than first thought...*

@ -300,7 +300,9 @@ Borrows important alignment code from [PyTorch tutorial on forced alignment](htt
 And uses the wonderful pyannote VAD / Diarization https://github.com/pyannote/pyannote-audio


-Valuable VAD & Diarization Models from [pyannote audio](https://github.com/pyannote/pyannote-audio)
+Valuable VAD & Diarization Models from:
+- [pyannote audio][https://github.com/pyannote/pyannote-audio]
+- [silero vad][https://github.com/snakers4/silero-vad]

 Great backend from [faster-whisper](https://github.com/guillaumekln/faster-whisper) and [CTranslate2](https://github.com/OpenNMT/CTranslate2)