Commit Graph

117 Commits

Author SHA1 Message Date
bc2776017e v2 lets go 2023-04-01 00:09:29 +01:00
11a78d7ced handle tmp wav file better 2023-04-01 00:06:40 +01:00
b9ca701d69 .wav conversion, handle audio with no detected speech 2023-03-31 23:02:38 +01:00
d0fa028045 fix tfile naming 2023-03-30 19:24:42 +01:00
ae4a9de307 add vad model external dl 2023-03-30 18:57:55 +01:00
18b63d46e2 skeleton v2 2023-03-30 05:31:57 +01:00
1e7c2c337b Merge pull request #148 from FernanOrtega/main
Update decoding.py
2023-03-24 07:57:43 -07:00
33dd3b9bcd Update decoding.py
Changes from https://github.com/openai/whisper/pull/914/
2023-03-24 11:56:41 +01:00
d1b4ff8228 Merge pull request #114 from mshakirDr/patch-1
Fix hugging face error
2023-03-23 15:12:09 -07:00
809700e286 remove soundfile version constraint 2023-03-06 00:20:31 +00:00
cea42ca470 Fix hugging face error
Model should be loaded with an id to avoid this error:
huggingface_hub.utils._validators.HFValidationError: Repo id must use alphanumeric chars or '-', '_', '.', '--' and '..' are forbidden, '-' and '.' cannot start or end the name, max length is 96: 'pyannote\segmentation'.
2023-03-04 19:12:13 +01:00
d1d420e70c Merge pull request #111 from Barabazs/patch-1
fix: force soundfile version update for mp3 support
2023-03-04 11:46:57 +00:00
844eb30710 fix: force soundfile version update for mp3 support 2023-03-04 11:01:26 +01:00
31e6fe7e36 Merge pull request #107 from JCGoran/fix/python3.7_compatibility
Added Python 3.7 compatibility
2023-03-02 15:31:36 +00:00
cfcede41f6 Added Python 3.7 compatibility
- removed use of walrus operator in favor of `np.cumsum`
2023-03-02 15:46:07 +01:00
186b06e032 paper drop 2023-03-02 12:04:16 +00:00
847a3cd85b Merge pull request #96 from smly/fix-batch-processing
FIX: Assertion error in batch processing
v1.0.0
2023-02-22 12:11:01 +00:00
2b1ffa12b8 Merge pull request #97 from smly/gpu-vad-filter
GPU acceleration when using VAD filters
2023-02-21 18:57:14 +00:00
57f5957e0e Pass device to pyannote.audio.Inference 2023-02-22 03:48:20 +09:00
27fe502344 Fix assertion error in batch processing 2023-02-22 02:45:13 +09:00
f7093e60d3 Merge pull request #90 from Pikauba/translation_starting_point_improvement
Improvement to transcription starting point with VAD
2023-02-18 21:59:57 +00:00
a1d2229416 Improvement to transcription starting point with VAD 2023-02-18 11:12:23 -05:00
4cb167a225 Merge pull request #74 from Camb-ai/level-bug-fix
added if clause for checking 'level-1'
2023-02-14 19:22:22 +00:00
2e307814dd added if clause for checking 2023-02-10 14:48:51 +05:30
d687cf3358 Merge pull request #58 from MahmoudAshraf97/main
added turkish wav2vec2 model
2023-02-01 22:11:51 +00:00
0a3fd11562 update readme 2023-02-01 22:09:11 +00:00
29e95b746b Merge pull request #57 from TengdaHan/main
support batch processing
2023-02-01 20:37:54 +00:00
039af89a86 support batch processing 2023-02-01 19:41:20 +00:00
9f26112d5c added turkish wav2vec2 model 2023-02-01 21:38:50 +02:00
fd2a093754 Merge pull request #55 from jonatasgrosman/main
FIX: Error when loading Hugging Face's models with embedded LM
2023-02-01 10:27:45 +00:00
31f069752f Merge pull request #53 from MahmoudAshraf97/main
Add more languages to models list
2023-02-01 10:27:25 +00:00
4cdf7ef856 Merge pull request #48 from Barabazs/main
doc: format checklist
2023-02-01 10:26:58 +00:00
d294e29ad9 fix: error when loading huggingface model with embedded language model 2023-01-31 23:24:26 -03:00
0eae9e1f50 added several wav2vec2 models by jonatasgrosman
since his models were used in other languages before and I tested the arabic model myself, I assumed it's safe to include all the available models
2023-02-01 03:02:10 +02:00
1b08661e42 change arabic model to jonatasgrosman 2023-01-31 19:32:31 +02:00
a49799294b add arabic wav2vec2 model form elgeish 2023-01-31 19:07:48 +02:00
d83c74a79f doc: format checklist 2023-01-29 16:07:58 +01:00
acaefa09a1 Merge pull request #46 from Barabazs/main
Add sponsor link to sidebar
2023-01-28 19:05:36 +00:00
76f79f600a fix short seg timestamps bug 2023-01-28 19:04:19 +00:00
33073f9bba Create FUNDING.yml 2023-01-28 19:43:27 +01:00
50f3965fdb fix tsv file ext 2023-01-28 17:39:07 +00:00
df2b1b70cb increase vad cut default 2023-01-28 14:49:53 +00:00
c19cf407d8 handle non-alignable whole segments 2023-01-28 13:53:03 +00:00
8081ef2dcd add custom vad binarization for vad cut 2023-01-28 00:22:33 +00:00
c6dbac76c8 cut up vad segments when too long to prevent OOM 2023-01-28 00:01:39 +00:00
69673eb39b buy-me-a-coffee 2023-01-27 15:12:49 +00:00
5b8c8a7bd3 pandas fix 2023-01-27 15:05:08 +00:00
7f2159a953 Merge branch 'main' of https://github.com/m-bain/whisperX into main 2023-01-26 10:46:36 +00:00
16d24b1c96 only pad timestamps if not using VAD 2023-01-26 10:46:13 +00:00
d20a2a4ea2 typo in --diarize flag 2023-01-26 10:28:54 +00:00