51da22771f
feat: add verbose output ( #759 )
...
---------
Co-authored-by: Abhishek Sharma <abhishek@zipteams.com >
Co-authored-by: Barabazs <31799121+Barabazs@users.noreply.github.com >
2025-01-01 13:07:52 +01:00
15ad5bf7df
feat: update versions for pyannote:3.3.2 and faster-whisper:1.1.0 ( #936 )
...
* chore: bump faster-whisper to 1.1.0
* chore: bump pyannote to 3.3.2
* feat: add multilingual option in load_model function
---------
Co-authored-by: Barabazs <31799121+Barabazs@users.noreply.github.com >
2024-12-31 10:41:09 +01:00
7fdbd21fe3
feat: add support for faster-whisper 1.0.3 ( #875 )
...
---------
Co-authored-by: Barabazs <31799121+Barabazs@users.noreply.github.com >
2024-12-31 10:07:42 +01:00
3ff625c561
feat: update faster-whisper to 1.0.2 ( #814 )
...
* Update faster-whisper to 1.0.2 to enable model distil-large-v3
* feat: add hotwords option to default_asr_options
---------
Co-authored-by: Barabazs <31799121+Barabazs@users.noreply.github.com >
2024-12-31 09:41:22 +01:00
bbaa2f0d1a
update kwargs
2024-02-22 15:59:14 +00:00
2686f74bc9
Get rid of numeral_symbol_tokens variable in printed message
2024-01-19 22:25:21 +00:00
6bb2f1cd48
Added Vad custom option
2024-01-01 14:56:51 +05:30
71a5281bde
support for large-v3
2023-11-25 12:09:00 +00:00
bd3aa03b6f
Move load_model after WhisperModel
2023-11-16 08:59:28 -03:00
48d651e5ea
Update asr.py and make the model parameter be used
2023-11-16 15:29:24 +08:00
14a7cab8eb
Pass patience and beam_size to faster-whisper.
2023-10-14 13:51:29 +02:00
79801167ac
Fix: Allow vad options to be configurable by correctly passing down to FasterWhisperPipeline.
2023-10-05 10:06:34 -04:00
ffd6167b26
Merge pull request #473 from sorgfresser/fix-faster-whisper-threads
2023-09-19 16:53:34 -07:00
0ae0d49d1d
add faster whisper threading
2023-09-14 11:47:51 +02:00
15451d0f1c
fix: correct defaut_asr_options with new options (patch 0.8)
2023-09-04 17:08:19 +02:00
9647f60fca
Merge branch 'main' into add-merge-chunk-size-as-argument
2023-08-29 10:05:05 -06:00
eb771cf56d
feat: Add merge chunks chunk_size as arguments.
...
Suggest from https://github.com/m-bain/whisperX/issues/200#issuecomment-1666507780
2023-08-29 23:09:02 +08:00
ea7bb91a56
Update asr.py
2023-08-17 14:49:57 +02:00
72685d0398
Update asr.py
2023-08-16 16:15:24 +02:00
4acb5b3abc
Update asr.py
2023-08-16 16:11:46 +02:00
225f6b4d69
fix suppress_numerals
2023-07-29 19:34:51 +02:00
864976af23
fix issue by resetting tokenizer
2023-07-29 18:56:33 +02:00
9d736dca1c
add some warning if languages do not match
2023-07-29 18:20:59 +02:00
d87f6268d0
fix preset language
2023-07-29 18:13:36 +02:00
d7f1d16f19
suppress numerals change logic
2023-06-05 15:44:17 +01:00
74a00eecd7
suppress numerals fix
2023-06-05 15:33:04 +01:00
b026407fd9
Merge branch 'v3' of https://github.com/m-bain/whisperX into v3
...
Conflicts:
whisperx/asr.py
2023-06-05 15:30:02 +01:00
a323cff654
--suppress_numerals option, ensures non-numerical words, for wav2vec2 alignment
2023-06-05 15:27:42 +01:00
5a47f458ac
Added download path parameter.
2023-05-27 11:38:54 +02:00
7c5468116f
Merge branch 'm-bain:main' into transcribe_keywords
2023-05-20 16:03:40 +02:00
a1c705b3a7
fix tokenizer is None
2023-05-20 15:52:45 +02:00
715435db42
add tokenizer is None case
2023-05-20 15:42:21 +02:00
1fc965bc1a
add task, language keyword to transcribe
2023-05-20 15:30:25 +02:00
53396adb21
add device_index
2023-05-20 13:02:46 +02:00
d8a2b4ffc9
Merge pull request #246 from m-bain/v3
...
V3
2023-05-13 12:18:09 +01:00
fd8f1003cf
add translate, fix word_timestamp error
2023-05-13 12:14:06 +01:00
eabf35dff0
Custom result types
2023-05-08 20:45:34 +02:00
b50aafb17b
Fix tuple unpacking
2023-05-08 20:03:42 +02:00
24008aa1ed
fix long segments, break into sentences using nltk, improve align logic, improve diarize (sentence-based)
2023-05-07 15:32:58 +01:00
4e2ac4e4e9
torch2.0, remove compile for now, round to times to 3 decimal
2023-05-04 20:38:13 +01:00
2d59eb9726
Add torch compile to log mel spectrogram
2023-05-03 23:17:44 +02:00
b9c8c5072b
Pad language detection if audio is too short
2023-04-30 18:34:18 +02:00
cb176a186e
added num_workers to fix pickling error
2023-04-29 19:51:05 +02:00
558d980535
v3 init
2023-04-24 21:08:43 +01:00
6a72b61564
clamp end_timestamp to prevent infinite loop
2023-04-11 20:15:37 +01:00
b9ca701d69
.wav conversion, handle audio with no detected speech
2023-03-31 23:02:38 +01:00
ae4a9de307
add vad model external dl
2023-03-30 18:57:55 +01:00
18b63d46e2
skeleton v2
2023-03-30 05:31:57 +01:00