mirror of
https://github.com/m-bain/whisperX.git
synced 2025-07-01 18:17:27 -04:00
Update README.md
change desc
This commit is contained in:
@ -1,13 +1,13 @@
|
|||||||
<h1 align="center">WhisperX</h1>
|
<h1 align="center">WhisperX</h1>
|
||||||
|
|
||||||
<p align="center">Whisper Automatic Speech Recognition with improved timestamp accuracy using forced alignment.
|
<p align="center">Whisper-Based Automatic Speech Recognition with improved timestamp accuracy using forced alignment.
|
||||||
|
|
||||||
</p>
|
</p>
|
||||||
|
|
||||||
|
|
||||||
<h2 align="center">What is it</h2>
|
<h2 align="center">What is it</h2>
|
||||||
|
|
||||||
This repository refines the timestamps of openAI's Whisper model via forced aligment with phoneme-level ASR models (e.g. wav2vec2)
|
This repository refines the timestamps of openAI's Whisper model via forced aligment with phoneme-based ASR models (e.g. wav2vec2.0)
|
||||||
|
|
||||||
|
|
||||||
**Whisper** is an Automatic Speech Recognition model [developed by OpenAI](https://github.com/openai/whisper), trained on a large dataset of diverse audio. Whilst it does produces highly accurate transcriptions, the corresponding timestamps are at the utterance-level, not per word, and can be inaccurate by several seconds.
|
**Whisper** is an Automatic Speech Recognition model [developed by OpenAI](https://github.com/openai/whisper), trained on a large dataset of diverse audio. Whilst it does produces highly accurate transcriptions, the corresponding timestamps are at the utterance-level, not per word, and can be inaccurate by several seconds.
|
||||||
|
Reference in New Issue
Block a user