diff --git a/README.md b/README.md index 60859f7..d982e57 100644 --- a/README.md +++ b/README.md @@ -1,13 +1,13 @@

WhisperX

-

Whisper Automatic Speech Recognition with improved timestamp accuracy using forced alignment. +

Whisper-Based Automatic Speech Recognition with improved timestamp accuracy using forced alignment.

What is it

-This repository refines the timestamps of openAI's Whisper model via forced aligment with phoneme-level ASR models (e.g. wav2vec2) +This repository refines the timestamps of openAI's Whisper model via forced aligment with phoneme-based ASR models (e.g. wav2vec2.0) **Whisper** is an Automatic Speech Recognition model [developed by OpenAI](https://github.com/openai/whisper), trained on a large dataset of diverse audio. Whilst it does produces highly accurate transcriptions, the corresponding timestamps are at the utterance-level, not per word, and can be inaccurate by several seconds.