mirror of
https://github.com/m-bain/whisperX.git
synced 2025-07-01 18:17:27 -04:00
Update README.md
added additional instructions to use PyAnnote modules
This commit is contained in:
@ -54,6 +54,8 @@ This repository refines the timestamps of openAI's Whisper model via forced alig
|
|||||||
- Character level timestamps (see `*.char.ass` file output)
|
- Character level timestamps (see `*.char.ass` file output)
|
||||||
- Diarization (still in beta, add `--diarization`)
|
- Diarization (still in beta, add `--diarization`)
|
||||||
|
|
||||||
|
To enable VAD filtering and Diarization, include your Hugging Face access token that you can generate from [Here](https://huggingface.co/settings/tokens) after the `--hf_token` argument and accept the user agreement for the following models: [Segmentation](https://huggingface.co/pyannote/segmentation) , [Voice Activity Detection (VAD)](https://huggingface.co/pyannote/voice-activity-detection) , and [Speaker Diarization](https://huggingface.co/pyannote/speaker-diarization)
|
||||||
|
|
||||||
|
|
||||||
<h2 align="left" id="setup">Setup ⚙️</h2>
|
<h2 align="left" id="setup">Setup ⚙️</h2>
|
||||||
Install this package using
|
Install this package using
|
||||||
|
Reference in New Issue
Block a user