whisperX

mirror of https://github.com/m-bain/whisperX.git synced 2025-07-01 18:17:27 -04:00

Author	SHA1	Message	Date
canoalberto	942c336b8f	Fixes --model_dir path	2023-12-27 14:03:54 -05:00
pere	5dfbfcbdc0	Adding Norwegian Bokmål and Norwegian Nynorsk Adding Wav2Vec2-models for Norwegian Bokmål and Norwegian Nynorsk. The models are testet together with WhisperX, and works great. For Bokmål I have added the 1B model, even if I see fairly little difference between that and the 300M model. For Norwegian Nynorsk only a 300M exist.The quality of the Wav2Vec models are also reported here: https://arxiv.org/abs/2307.01672	2023-12-19 08:48:21 +01:00
Max Bain	089cd5ab21	Merge pull request #585 from kurianbenoy/ml-asr Add alignment model for Malayalam	2023-12-10 17:35:14 -06:00
Mahmoud Ashraf	f865dfe710	fix typo	2023-12-04 17:38:50 +03:00
Mahmoud Ashraf	4acbdd75be	add "yue" to supported languages that was added along with Large-V3	2023-12-04 17:27:54 +03:00
MahmoudAshraf97	71a5281bde	support for `large-v3`	2023-11-25 12:09:00 +00:00
Remc	20161935a1	feat: pass model to 3.1 in code	2023-11-17 11:12:16 +01:00
Kurian Benoy	5756b0fb13	Update alignment.py	2023-11-17 05:21:23 +05:30
Kurian Benoy	aaaa3de810	Update alignment.py	2023-11-17 05:18:19 +05:30
Douglas Trajano	bd3aa03b6f	Move load_model after WhisperModel	2023-11-16 08:59:28 -03:00
Max Bain	f5c544ff90	Merge pull request #581 from davidmartinrius/catalan_align_model Add align model for catalan language.	2023-11-16 10:54:24 +00:00
David Martin Rius	9f41c49fe5	Add align model for catalan language.	2023-11-16 11:43:36 +01:00
kaka1909	48d651e5ea	Update asr.py and make the model parameter be used	2023-11-16 15:29:24 +08:00
Max Bain	4ece2369d7	Merge pull request #556 from sorgfresser/remove-space-segment-align no align based on space	2023-11-11 02:03:56 +00:00
hidenori-endo	6703d2774b	Drop ffmpeg-python dependency	2023-11-10 03:26:47 +09:00
Simon Sorg	0c7f32f55c	no align based on space	2023-11-03 19:47:00 +01:00
Simon Sorg	6936dd6991	default t	2023-11-03 18:50:15 +01:00
amosal	d4a600b568	REMOVE duplicated code	2023-10-31 18:55:50 +01:00
amosal	afd5ef1d58	FIX warnings for word options	2023-10-31 18:55:35 +01:00
Max Bain	c6fe379d9e	Merge pull request #517 from jkukul/support-language-names-as-parameters Support language names in `--language` parameter.	2023-10-25 11:16:30 -07:00
Max Bain	66808f6147	Merge pull request #529 from MahmoudAshraf97/main	2023-10-16 10:53:18 -07:00
Mahmoud Ashraf	b69956d725	.	2023-10-16 20:43:37 +03:00
Mahmoud Ashraf	02c0323777	fix	2023-10-15 16:25:15 +03:00
Jakub Kukul	14a7cab8eb	Pass patience and beam_size to faster-whisper.	2023-10-14 13:51:29 +02:00
Marco Vela	a5356509b6	fix(diarize): key error on empty track	2023-10-10 14:50:41 -05:00
Jakub Kukul	1001a055db	Support language names in --language.	2023-10-10 13:55:47 +02:00
Mahmoud Ashraf	8049dba2f7	fix minimum input length for torch wav2vec2 models	2023-10-06 00:41:23 +03:00
Andrew Bettke	79801167ac	Fix: Allow vad options to be configurable by correctly passing down to FasterWhisperPipeline.	2023-10-05 10:06:34 -04:00
Manohar Reddy	a0b6459c8b	fix: ZeroDivisionError when --print_progress True	2023-09-27 20:10:43 +05:30
Max Bain	2a11ce3ef0	Merge pull request #487 from piuy11/main Update alignment.py	2023-09-26 14:17:46 -07:00
Remc	b17908473d	correct 3.0 pyannote weights	2023-09-26 17:18:20 +02:00
piuy11	f137f31de6	Update alignment.py	2023-09-25 15:33:06 +09:00
Max Bain	ffd6167b26	Merge pull request #473 from sorgfresser/fix-faster-whisper-threads	2023-09-19 16:53:34 -07:00
Simon Sorg	0ae0d49d1d	add faster whisper threading	2023-09-14 11:47:51 +02:00
darwintree	c6d9e6cb67	chore(writer): improve text display(ja etc) in json file	2023-09-10 22:02:47 +08:00
awerks	2ca99ce909	A solution to long subitles Example usage: subtitles_proccessor = SubtitlesProcessor(output["segments"], detected_language, max_line_length = 50, min_char_length_splitter = 35) subtitles_proccessor.save("subtitles.srt", advanced_splitting = True)	2023-09-04 21:49:34 +02:00
Remc	15451d0f1c	fix: correct defaut_asr_options with new options (patch 0.8)	2023-09-04 17:08:19 +02:00
陳鈞	5223de2a41	fix: UnboundLocalError: local variable 'align_language' referenced before assignment	2023-08-30 01:11:09 +08:00
陳鈞	f505702dc7	chore(writer): Join words without spaces for ja, zh fix #248, fix #310	2023-08-30 01:11:09 +08:00
Max Bain	9647f60fca	Merge branch 'main' into add-merge-chunk-size-as-argument	2023-08-29 10:05:05 -06:00
Max Bain	a8bfac6bef	Merge pull request #427 from awerks/main Update alignment.py	2023-08-29 10:03:46 -06:00
陳鈞	eb771cf56d	feat: Add merge chunks chunk_size as arguments. Suggest from https://github.com/m-bain/whisperX/issues/200#issuecomment-1666507780	2023-08-29 23:09:02 +08:00
invisprints	cc81ab7db7	fix missing prefix Fixed missing the speaker part when enable --highlight_words	2023-08-25 12:08:16 +08:00
awerks	4e28492dbd	Update alignment.py	2023-08-17 14:57:53 +02:00
awerks	6cb7267dc2	Update alignment.py	2023-08-17 14:56:54 +02:00
awerks	abbb66b58e	Update alignment.py	2023-08-17 14:53:53 +02:00
awerks	ea7bb91a56	Update asr.py	2023-08-17 14:49:57 +02:00
awerks	d2d840f06c	Update utils.py	2023-08-17 14:45:23 +02:00
Max Bain	0a1137e41c	Merge pull request #429 from sorgfresser/no-segments-writer fix writer fail on segments 0	2023-08-17 13:20:38 +01:00
Simon Sorg	0767597bff	fix writer fail on segments 0	2023-08-17 14:18:16 +02:00

1 2 3 4

197 Commits