From 681d4c6749f6bc2eed9ef27b7ca9384cbe3a6d05 Mon Sep 17 00:00:00 2001 From: m-bain <36994049+m-bain@users.noreply.github.com> Date: Wed, 14 Dec 2022 19:02:32 +0000 Subject: [PATCH] Update README.md change desc --- README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index 60859f7..d982e57 100644 --- a/README.md +++ b/README.md @@ -1,13 +1,13 @@

WhisperX

-

Whisper Automatic Speech Recognition with improved timestamp accuracy using forced alignment. +

Whisper-Based Automatic Speech Recognition with improved timestamp accuracy using forced alignment.

What is it

-This repository refines the timestamps of openAI's Whisper model via forced aligment with phoneme-level ASR models (e.g. wav2vec2) +This repository refines the timestamps of openAI's Whisper model via forced aligment with phoneme-based ASR models (e.g. wav2vec2.0) **Whisper** is an Automatic Speech Recognition model [developed by OpenAI](https://github.com/openai/whisper), trained on a large dataset of diverse audio. Whilst it does produces highly accurate transcriptions, the corresponding timestamps are at the utterance-level, not per word, and can be inaccurate by several seconds.