From c912f96ed35f59a602075c817510403f3f1fa9e7 Mon Sep 17 00:00:00 2001 From: Max Bain Date: Fri, 23 Dec 2022 00:50:32 +0000 Subject: [PATCH] add examples.md --- EXAMPLES.md | 37 +++++++++++++++++++++++++++++++++++++ 1 file changed, 37 insertions(+) create mode 100644 EXAMPLES.md diff --git a/EXAMPLES.md b/EXAMPLES.md new file mode 100644 index 0000000..c39c0c4 --- /dev/null +++ b/EXAMPLES.md @@ -0,0 +1,37 @@ +# More Examples + +## Other Languages + +For non-english ASR, it is best to use the `large` whisper model. Alignment models are automatically picked by the chosen language from the default [lists](https://github.com/m-bain/whisperX/blob/e909f2f766b23b2000f2d95df41f9b844ac53e49/whisperx/transcribe.py#L22). + +Currently support default models tested for {en, fr, de, es, it, ja, zh, nl} + + +If the detected language is not in this list, you need to find a phoneme-based ASR model from [huggingface model hub](https://huggingface.co/models) and test it on your data. + +### French + whisperx --model large --language fr examples/sample_fr_01.wav + + +https://user-images.githubusercontent.com/36994049/208298804-31c49d6f-6787-444e-a53f-e93c52706752.mov + + +### German + whisperx --model large --language de examples/sample_de_01.wav + + +https://user-images.githubusercontent.com/36994049/208298811-e36002ba-3698-4731-97d4-0aebd07e0eb3.mov + + +### Italian + whisperx --model large --language de examples/sample_it_01.wav + + +https://user-images.githubusercontent.com/36994049/208298819-6f462b2c-8cae-4c54-b8e1-90855794efc7.mov + + +### Japanese + whisperx --model large --language ja examples/sample_ja_01.wav + + +https://user-images.githubusercontent.com/19920981/208731743-311f2360-b73b-4c60-809d-aaf3cd7e06f4.mov