whisperX

mirror of https://github.com/m-bain/whisperX.git synced 2025-07-01 18:17:27 -04:00

Author	SHA1	Message	Date
Simon	b9c8c5072b	Pad language detection if audio is too short	2023-04-30 18:34:18 +02:00
Max Bain	a903e57cf1	Merge pull request #199 from thomasmol/v3	2023-04-29 23:35:42 +01:00
Thomas Mol	cb176a186e	added num_workers to fix pickling error	2023-04-29 19:51:05 +02:00
m-bain	cc7e168d2b	add checkout command v3.0.0	2023-04-25 12:14:23 +01:00
m-bain	db97f29678	update pip install	2023-04-25 11:19:23 +01:00
m-bain	25be8210e5	add v3 tag for install	2023-04-25 10:07:34 +01:00
Max Bain	0efad26066	pass compute_type	2023-04-24 21:26:44 +01:00
Max Bain	2a29f0ec6a	add compute types	2023-04-24 21:24:22 +01:00
Max Bain	558d980535	v3 init	2023-04-24 21:08:43 +01:00
Max Bain	da458863d7	allow custom model_dir for torchaudio models v2.0.1	2023-04-14 21:40:36 +01:00
Max Bain	cf252a8592	allow custom path for vad model	2023-04-14 15:02:58 +01:00
m-bain	6a72b61564	clamp end_timestamp to prevent infinite loop	2023-04-11 20:15:37 +01:00
m-bain	48ed89834e	Merge pull request #169 from invisprints/v2-opt-load-model Optimize the inference process and reduce the memory usage	2023-04-09 13:39:13 +01:00
invisprints	bb15c9428f	opti the inference loop	2023-04-09 15:58:55 +08:00
m-bain	9482d324d0	Merge pull request #162 from dev-nomi/cli_argument_type Added vad_filter type	2023-04-05 13:40:04 -07:00
dev-nomi	4146e56d5b	Added vad_filter type	2023-04-05 17:11:29 +05:00
m-bain	118e7deedb	Merge pull request #161 from diasks2/fix_typo Fix typo in utils.py	2023-04-04 19:00:18 -07:00
Kevin Dias	70a4a0a25c	Fix typo	2023-04-05 10:50:49 +09:00
m-bain	40948a3d00	fix whisper version to 20230314 for no breaking	2023-04-04 12:42:34 -07:00
m-bain	c8be6ac94d	update python example	2023-04-03 12:18:31 -07:00
m-bain	a582a59493	mkdir for torch cache in case it doesnt exist	2023-04-01 13:05:40 -07:00
m-bain	861379edc3	Merge pull request #157 from Ryan5453/fix/whisper-req Fix Requirements	2023-03-31 16:40:19 -07:00
Ryan	4af345434a	Update requirements.txt	2023-03-31 19:36:38 -04:00
m-bain	634799b3be	hf token only for diarization	2023-03-31 16:15:40 -07:00
Max Bain	189aeac83e	v2 lets goo v2.0.0	2023-04-01 00:10:45 +01:00
Max Bain	bc2776017e	v2 lets go	2023-04-01 00:09:29 +01:00
Max Bain	11a78d7ced	handle tmp wav file better	2023-04-01 00:06:40 +01:00
Max Bain	b9ca701d69	.wav conversion, handle audio with no detected speech	2023-03-31 23:02:38 +01:00
Max Bain	d0fa028045	fix tfile naming	2023-03-30 19:24:42 +01:00
Max Bain	ae4a9de307	add vad model external dl	2023-03-30 18:57:55 +01:00
Max Bain	18b63d46e2	skeleton v2	2023-03-30 05:31:57 +01:00
m-bain	1e7c2c337b	Merge pull request #148 from FernanOrtega/main Update decoding.py	2023-03-24 07:57:43 -07:00
Fernando O. Gallego	33dd3b9bcd	Update decoding.py Changes from https://github.com/openai/whisper/pull/914/	2023-03-24 11:56:41 +01:00
m-bain	d1b4ff8228	Merge pull request #114 from mshakirDr/patch-1 Fix hugging face error	2023-03-23 15:12:09 -07:00
m-bain	809700e286	remove soundfile version constraint	2023-03-06 00:20:31 +00:00
Muhammad Shakir	cea42ca470	Fix hugging face error Model should be loaded with an id to avoid this error: huggingface_hub.utils._validators.HFValidationError: Repo id must use alphanumeric chars or '-', '_', '.', '--' and '..' are forbidden, '-' and '.' cannot start or end the name, max length is 96: 'pyannote\segmentation'.	2023-03-04 19:12:13 +01:00
m-bain	d1d420e70c	Merge pull request #111 from Barabazs/patch-1 fix: force soundfile version update for mp3 support	2023-03-04 11:46:57 +00:00
Barabazs	844eb30710	fix: force soundfile version update for mp3 support	2023-03-04 11:01:26 +01:00
m-bain	31e6fe7e36	Merge pull request #107 from JCGoran/fix/python3.7_compatibility Added Python 3.7 compatibility	2023-03-02 15:31:36 +00:00
JCGoran	cfcede41f6	Added Python 3.7 compatibility - removed use of walrus operator in favor of `np.cumsum`	2023-03-02 15:46:07 +01:00
m-bain	186b06e032	paper drop	2023-03-02 12:04:16 +00:00
m-bain	847a3cd85b	Merge pull request #96 from smly/fix-batch-processing FIX: Assertion error in batch processing v1.0.0	2023-02-22 12:11:01 +00:00
m-bain	2b1ffa12b8	Merge pull request #97 from smly/gpu-vad-filter GPU acceleration when using VAD filters	2023-02-21 18:57:14 +00:00
smly	57f5957e0e	Pass device to pyannote.audio.Inference	2023-02-22 03:48:20 +09:00
smly	27fe502344	Fix assertion error in batch processing	2023-02-22 02:45:13 +09:00
m-bain	f7093e60d3	Merge pull request #90 from Pikauba/translation_starting_point_improvement Improvement to transcription starting point with VAD	2023-02-18 21:59:57 +00:00
Antoine Dufour	a1d2229416	Improvement to transcription starting point with VAD	2023-02-18 11:12:23 -05:00
m-bain	4cb167a225	Merge pull request #74 from Camb-ai/level-bug-fix added if clause for checking 'level-1'	2023-02-14 19:22:22 +00:00
arnavmehta7	2e307814dd	added if clause for checking	2023-02-10 14:48:51 +05:30
m-bain	d687cf3358	Merge pull request #58 from MahmoudAshraf97/main added turkish wav2vec2 model	2023-02-01 22:11:51 +00:00

1 2 3

142 Commits