whisperX

mirror of https://github.com/m-bain/whisperX.git synced 2025-07-01 18:17:27 -04:00

Author	SHA1	Message	Date
Abhishek Sharma	51da22771f	feat: add verbose output (#759 ) --------- Co-authored-by: Abhishek Sharma <abhishek@zipteams.com> Co-authored-by: Barabazs <31799121+Barabazs@users.noreply.github.com>	2025-01-01 13:07:52 +01:00
Icaro Bombonato	15ad5bf7df	feat: update versions for pyannote:3.3.2 and faster-whisper:1.1.0 (#936 ) * chore: bump faster-whisper to 1.1.0 * chore: bump pyannote to 3.3.2 * feat: add multilingual option in load_model function --------- Co-authored-by: Barabazs <31799121+Barabazs@users.noreply.github.com>	2024-12-31 10:41:09 +01:00
Hasan Naseer	7fdbd21fe3	feat: add support for faster-whisper 1.0.3 (#875 ) --------- Co-authored-by: Barabazs <31799121+Barabazs@users.noreply.github.com>	2024-12-31 10:07:42 +01:00
moritzbrantner	3ff625c561	feat: update faster-whisper to 1.0.2 (#814 ) * Update faster-whisper to 1.0.2 to enable model distil-large-v3 * feat: add hotwords option to default_asr_options --------- Co-authored-by: Barabazs <31799121+Barabazs@users.noreply.github.com>	2024-12-31 09:41:22 +01:00
Max Bain	bbaa2f0d1a	update kwargs	2024-02-22 15:59:14 +00:00
kossaisbai	2686f74bc9	Get rid of numeral_symbol_tokens variable in printed message	2024-01-19 22:25:21 +00:00
Full Name	6bb2f1cd48	Added Vad custom option	2024-01-01 14:56:51 +05:30
MahmoudAshraf97	71a5281bde	support for `large-v3`	2023-11-25 12:09:00 +00:00
Douglas Trajano	bd3aa03b6f	Move load_model after WhisperModel	2023-11-16 08:59:28 -03:00
kaka1909	48d651e5ea	Update asr.py and make the model parameter be used	2023-11-16 15:29:24 +08:00
Jakub Kukul	14a7cab8eb	Pass patience and beam_size to faster-whisper.	2023-10-14 13:51:29 +02:00
Andrew Bettke	79801167ac	Fix: Allow vad options to be configurable by correctly passing down to FasterWhisperPipeline.	2023-10-05 10:06:34 -04:00
Max Bain	ffd6167b26	Merge pull request #473 from sorgfresser/fix-faster-whisper-threads	2023-09-19 16:53:34 -07:00
Simon Sorg	0ae0d49d1d	add faster whisper threading	2023-09-14 11:47:51 +02:00
Remc	15451d0f1c	fix: correct defaut_asr_options with new options (patch 0.8)	2023-09-04 17:08:19 +02:00
Max Bain	9647f60fca	Merge branch 'main' into add-merge-chunk-size-as-argument	2023-08-29 10:05:05 -06:00
陳鈞	eb771cf56d	feat: Add merge chunks chunk_size as arguments. Suggest from https://github.com/m-bain/whisperX/issues/200#issuecomment-1666507780	2023-08-29 23:09:02 +08:00
awerks	ea7bb91a56	Update asr.py	2023-08-17 14:49:57 +02:00
awerks	72685d0398	Update asr.py	2023-08-16 16:15:24 +02:00
awerks	4acb5b3abc	Update asr.py	2023-08-16 16:11:46 +02:00
briguetjo	225f6b4d69	fix suppress_numerals	2023-07-29 19:34:51 +02:00
briguetjo	864976af23	fix issue by resetting tokenizer	2023-07-29 18:56:33 +02:00
briguetjo	9d736dca1c	add some warning if languages do not match	2023-07-29 18:20:59 +02:00
briguetjo	d87f6268d0	fix preset language	2023-07-29 18:13:36 +02:00
Max Bain	d7f1d16f19	suppress numerals change logic	2023-06-05 15:44:17 +01:00
Max Bain	74a00eecd7	suppress numerals fix	2023-06-05 15:33:04 +01:00
Max Bain	b026407fd9	Merge branch 'v3' of https://github.com/m-bain/whisperX into v3 Conflicts: whisperx/asr.py	2023-06-05 15:30:02 +01:00
Max Bain	a323cff654	--suppress_numerals option, ensures non-numerical words, for wav2vec2 alignment	2023-06-05 15:27:42 +01:00
prameshbajra	5a47f458ac	Added download path parameter.	2023-05-27 11:38:54 +02:00
Simon	7c5468116f	Merge branch 'm-bain:main' into transcribe_keywords	2023-05-20 16:03:40 +02:00
Simon	a1c705b3a7	fix tokenizer is None	2023-05-20 15:52:45 +02:00
Simon	715435db42	add tokenizer is None case	2023-05-20 15:42:21 +02:00
Simon	1fc965bc1a	add task, language keyword to transcribe	2023-05-20 15:30:25 +02:00
Simon	53396adb21	add device_index	2023-05-20 13:02:46 +02:00
Max Bain	d8a2b4ffc9	Merge pull request #246 from m-bain/v3 V3	2023-05-13 12:18:09 +01:00
Max Bain	fd8f1003cf	add translate, fix word_timestamp error	2023-05-13 12:14:06 +01:00
Simon	eabf35dff0	Custom result types	2023-05-08 20:45:34 +02:00
Simon	b50aafb17b	Fix tuple unpacking	2023-05-08 20:03:42 +02:00
Max Bain	24008aa1ed	fix long segments, break into sentences using nltk, improve align logic, improve diarize (sentence-based)	2023-05-07 15:32:58 +01:00
Max Bain	4e2ac4e4e9	torch2.0, remove compile for now, round to times to 3 decimal	2023-05-04 20:38:13 +01:00
Simon	2d59eb9726	Add torch compile to log mel spectrogram	2023-05-03 23:17:44 +02:00
Simon	b9c8c5072b	Pad language detection if audio is too short	2023-04-30 18:34:18 +02:00
Thomas Mol	cb176a186e	added num_workers to fix pickling error	2023-04-29 19:51:05 +02:00
Max Bain	558d980535	v3 init	2023-04-24 21:08:43 +01:00
m-bain	6a72b61564	clamp end_timestamp to prevent infinite loop	2023-04-11 20:15:37 +01:00
Max Bain	b9ca701d69	.wav conversion, handle audio with no detected speech	2023-03-31 23:02:38 +01:00
Max Bain	ae4a9de307	add vad model external dl	2023-03-30 18:57:55 +01:00
Max Bain	18b63d46e2	skeleton v2	2023-03-30 05:31:57 +01:00

48 Commits