b9c8c5072b
Pad language detection if audio is too short
2023-04-30 18:34:18 +02:00
a903e57cf1
Merge pull request #199 from thomasmol/v3
2023-04-29 23:35:42 +01:00
cb176a186e
added num_workers to fix pickling error
2023-04-29 19:51:05 +02:00
cc7e168d2b
add checkout command
v3.0.0
2023-04-25 12:14:23 +01:00
db97f29678
update pip install
2023-04-25 11:19:23 +01:00
25be8210e5
add v3 tag for install
2023-04-25 10:07:34 +01:00
0efad26066
pass compute_type
2023-04-24 21:26:44 +01:00
2a29f0ec6a
add compute types
2023-04-24 21:24:22 +01:00
558d980535
v3 init
2023-04-24 21:08:43 +01:00
da458863d7
allow custom model_dir for torchaudio models
v2.0.1
2023-04-14 21:40:36 +01:00
cf252a8592
allow custom path for vad model
2023-04-14 15:02:58 +01:00
6a72b61564
clamp end_timestamp to prevent infinite loop
2023-04-11 20:15:37 +01:00
48ed89834e
Merge pull request #169 from invisprints/v2-opt-load-model
...
Optimize the inference process and reduce the memory usage
2023-04-09 13:39:13 +01:00
bb15c9428f
opti the inference loop
2023-04-09 15:58:55 +08:00
9482d324d0
Merge pull request #162 from dev-nomi/cli_argument_type
...
Added vad_filter type
2023-04-05 13:40:04 -07:00
4146e56d5b
Added vad_filter type
2023-04-05 17:11:29 +05:00
118e7deedb
Merge pull request #161 from diasks2/fix_typo
...
Fix typo in utils.py
2023-04-04 19:00:18 -07:00
70a4a0a25c
Fix typo
2023-04-05 10:50:49 +09:00
40948a3d00
fix whisper version to 20230314 for no breaking
2023-04-04 12:42:34 -07:00
c8be6ac94d
update python example
2023-04-03 12:18:31 -07:00
a582a59493
mkdir for torch cache in case it doesnt exist
2023-04-01 13:05:40 -07:00
861379edc3
Merge pull request #157 from Ryan5453/fix/whisper-req
...
Fix Requirements
2023-03-31 16:40:19 -07:00
4af345434a
Update requirements.txt
2023-03-31 19:36:38 -04:00
634799b3be
hf token only for diarization
2023-03-31 16:15:40 -07:00
189aeac83e
v2 lets goo
v2.0.0
2023-04-01 00:10:45 +01:00
bc2776017e
v2 lets go
2023-04-01 00:09:29 +01:00
11a78d7ced
handle tmp wav file better
2023-04-01 00:06:40 +01:00
b9ca701d69
.wav conversion, handle audio with no detected speech
2023-03-31 23:02:38 +01:00
d0fa028045
fix tfile naming
2023-03-30 19:24:42 +01:00
ae4a9de307
add vad model external dl
2023-03-30 18:57:55 +01:00
18b63d46e2
skeleton v2
2023-03-30 05:31:57 +01:00
1e7c2c337b
Merge pull request #148 from FernanOrtega/main
...
Update decoding.py
2023-03-24 07:57:43 -07:00
33dd3b9bcd
Update decoding.py
...
Changes from https://github.com/openai/whisper/pull/914/
2023-03-24 11:56:41 +01:00
d1b4ff8228
Merge pull request #114 from mshakirDr/patch-1
...
Fix hugging face error
2023-03-23 15:12:09 -07:00
809700e286
remove soundfile version constraint
2023-03-06 00:20:31 +00:00
cea42ca470
Fix hugging face error
...
Model should be loaded with an id to avoid this error:
huggingface_hub.utils._validators.HFValidationError: Repo id must use alphanumeric chars or '-', '_', '.', '--' and '..' are forbidden, '-' and '.' cannot start or end the name, max length is 96: 'pyannote\segmentation'.
2023-03-04 19:12:13 +01:00
d1d420e70c
Merge pull request #111 from Barabazs/patch-1
...
fix: force soundfile version update for mp3 support
2023-03-04 11:46:57 +00:00
844eb30710
fix: force soundfile version update for mp3 support
2023-03-04 11:01:26 +01:00
31e6fe7e36
Merge pull request #107 from JCGoran/fix/python3.7_compatibility
...
Added Python 3.7 compatibility
2023-03-02 15:31:36 +00:00
cfcede41f6
Added Python 3.7 compatibility
...
- removed use of walrus operator in favor of `np.cumsum`
2023-03-02 15:46:07 +01:00
186b06e032
paper drop
2023-03-02 12:04:16 +00:00
847a3cd85b
Merge pull request #96 from smly/fix-batch-processing
...
FIX: Assertion error in batch processing
v1.0.0
2023-02-22 12:11:01 +00:00
2b1ffa12b8
Merge pull request #97 from smly/gpu-vad-filter
...
GPU acceleration when using VAD filters
2023-02-21 18:57:14 +00:00
57f5957e0e
Pass device to pyannote.audio.Inference
2023-02-22 03:48:20 +09:00
27fe502344
Fix assertion error in batch processing
2023-02-22 02:45:13 +09:00
f7093e60d3
Merge pull request #90 from Pikauba/translation_starting_point_improvement
...
Improvement to transcription starting point with VAD
2023-02-18 21:59:57 +00:00
a1d2229416
Improvement to transcription starting point with VAD
2023-02-18 11:12:23 -05:00
4cb167a225
Merge pull request #74 from Camb-ai/level-bug-fix
...
added if clause for checking 'level-1'
2023-02-14 19:22:22 +00:00
2e307814dd
added if clause for checking
2023-02-10 14:48:51 +05:30
d687cf3358
Merge pull request #58 from MahmoudAshraf97/main
...
added turkish wav2vec2 model
2023-02-01 22:11:51 +00:00