From 3a91aa13842095372813a2058210b931125ec0b8 Mon Sep 17 00:00:00 2001 From: m-bain <36994049+m-bain@users.noreply.github.com> Date: Sun, 18 Dec 2022 12:43:17 +0000 Subject: [PATCH] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 2ee095a..eb68222 100644 --- a/README.md +++ b/README.md @@ -75,7 +75,7 @@ https://user-images.githubusercontent.com/36994049/208298819-6f462b2c-8cae-4c54-

Limitations ⚠️

-- Currently only tested for _english_ language, results may vary with different languages. +- Not thoroughly tested, especially for non-english, results may vary -- please post issue to let me know its results on your data - Whisper normalises spoken numbers e.g. "fifty seven" to arabic numerals "57". Need to perform this normalization after alignment, so the phonemes can be aligned. Currently just ignores numbers. - Assumes the initial whisper timestamps are accurate to some degree (within margin of 2 seconds, adjust if needed -- bigger margins more prone to alignment errors) - Hacked this up quite quickly, there might be some errors, please raise an issue if you encounter any.