mirror of
https://github.com/m-bain/whisperX.git
synced 2025-07-01 18:17:27 -04:00
Update README.md
This commit is contained in:
@ -223,7 +223,7 @@ Transcription differences from openai's whisper:
|
|||||||
|
|
||||||
- Transcript words which do not contain characters in the alignment models dictionary e.g. "2014." or "£13.60" cannot be aligned and therefore are not given a timing.
|
- Transcript words which do not contain characters in the alignment models dictionary e.g. "2014." or "£13.60" cannot be aligned and therefore are not given a timing.
|
||||||
- Overlapping speech is not handled particularly well by whisper nor whisperx
|
- Overlapping speech is not handled particularly well by whisper nor whisperx
|
||||||
- Diarization is far from perfect (working on this with custom model v4 -- see contact me).
|
- Diarization is far from perfect
|
||||||
- Language specific wav2vec2 model is needed
|
- Language specific wav2vec2 model is needed
|
||||||
|
|
||||||
|
|
||||||
@ -267,7 +267,7 @@ Bug finding and pull requests are also highly appreciated to keep this project g
|
|||||||
<h2 align="left" id="contact">Contact/Support 📇</h2>
|
<h2 align="left" id="contact">Contact/Support 📇</h2>
|
||||||
|
|
||||||
|
|
||||||
Contact maxhbain@gmail.com for queries. WhisperX v4 development is underway with with siginificantly improved diarization. To support v4 and get early access, get in touch.
|
Contact maxhbain@gmail.com for queries.
|
||||||
|
|
||||||
<a href="https://www.buymeacoffee.com/maxhbain" target="_blank"><img src="https://cdn.buymeacoffee.com/buttons/default-orange.png" alt="Buy Me A Coffee" height="41" width="174"></a>
|
<a href="https://www.buymeacoffee.com/maxhbain" target="_blank"><img src="https://cdn.buymeacoffee.com/buttons/default-orange.png" alt="Buy Me A Coffee" height="41" width="174"></a>
|
||||||
|
|
||||||
|
Reference in New Issue
Block a user