From 99b6e79fbffaba9301888d254996a975beb516c0 Mon Sep 17 00:00:00 2001 From: Mahmoud Ashraf <32404268+MahmoudAshraf97@users.noreply.github.com> Date: Thu, 26 Jan 2023 00:56:10 +0200 Subject: [PATCH] Update README.md added additional instructions to use PyAnnote modules --- README.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/README.md b/README.md index 9fae9fc..af5dec7 100644 --- a/README.md +++ b/README.md @@ -54,6 +54,8 @@ This repository refines the timestamps of openAI's Whisper model via forced alig - Character level timestamps (see `*.char.ass` file output) - Diarization (still in beta, add `--diarization`) +To enable VAD filtering and Diarization, include your Hugging Face access token that you can generate from [Here](https://huggingface.co/settings/tokens) after the `--hf_token` argument and accept the user agreement for the following models: [Segmentation](https://huggingface.co/pyannote/segmentation) , [Voice Activity Detection (VAD)](https://huggingface.co/pyannote/voice-activity-detection) , and [Speaker Diarization](https://huggingface.co/pyannote/speaker-diarization) +