whisperX

mirror of https://github.com/m-bain/whisperX.git synced 2025-07-01 18:17:27 -04:00

Author	SHA1	Message	Date
Max Bain	d0fa028045	fix tfile naming	2023-03-30 19:24:42 +01:00
Max Bain	ae4a9de307	add vad model external dl	2023-03-30 18:57:55 +01:00
Max Bain	18b63d46e2	skeleton v2	2023-03-30 05:31:57 +01:00
Muhammad Shakir	cea42ca470	Fix hugging face error Model should be loaded with an id to avoid this error: huggingface_hub.utils._validators.HFValidationError: Repo id must use alphanumeric chars or '-', '_', '.', '--' and '..' are forbidden, '-' and '.' cannot start or end the name, max length is 96: 'pyannote\segmentation'.	2023-03-04 19:12:13 +01:00
m-bain	847a3cd85b	Merge pull request #96 from smly/fix-batch-processing FIX: Assertion error in batch processing	2023-02-22 12:11:01 +00:00
smly	57f5957e0e	Pass device to pyannote.audio.Inference	2023-02-22 03:48:20 +09:00
smly	27fe502344	Fix assertion error in batch processing	2023-02-22 02:45:13 +09:00
Antoine Dufour	a1d2229416	Improvement to transcription starting point with VAD	2023-02-18 11:12:23 -05:00
Max Bain	0a3fd11562	update readme	2023-02-01 22:09:11 +00:00
Tengda Han	039af89a86	support batch processing	2023-02-01 19:41:20 +00:00
Max Bain	50f3965fdb	fix tsv file ext	2023-01-28 17:39:07 +00:00
Max Bain	df2b1b70cb	increase vad cut default	2023-01-28 14:49:53 +00:00
Max Bain	c19cf407d8	handle non-alignable whole segments	2023-01-28 13:53:03 +00:00
Max Bain	c6dbac76c8	cut up vad segments when too long to prevent OOM	2023-01-28 00:01:39 +00:00
Mahmoud Ashraf	e7773358a3	Update transcribe.py added the ability to include HF access token in order to use PyAnnote models	2023-01-26 00:42:35 +02:00
Max Bain	286a2f2c14	clean up logic, use pandas where possibl	2023-01-25 18:42:52 +00:00
Max Bain	eec6d1f8d8	missing word timestamps	2023-01-24 16:37:19 +00:00
Max Bain	d1600e5b0f	Merge branch 'main' of https://github.com/m-bain/whisperX into main Conflicts: whisperx/transcribe.py whisperx/utils.py	2023-01-24 15:38:05 +00:00
Max Bain	d395c21b83	new logic, diarization, vad filtering	2023-01-24 15:02:08 +00:00
Max Bain	ba102feb7f	vad filter	2023-01-20 12:54:20 +00:00
Rhenan Bartels	7adead16e0	Update pt model to wav2vec2-large-xlsr-53-portuguese	2023-01-11 19:50:34 -03:00
Felippe Chemello	7459bf8ad0	Add PT (pt-br) align support	2023-01-11 12:11:41 -03:00
m-bain	d51353a4b6	uncomment .ass	2023-01-08 18:02:36 +00:00
Max Bain	78c87d3bfd	handle negative / tiny duration segments, final	2023-01-08 14:01:10 +00:00
Max Bain	a6eb33778b	additional waveform segment check	2023-01-08 12:24:35 +00:00
Max Bain	44b62064f6	fix starting timestamp for multiple fail-to-aligned words	2023-01-07 14:59:11 +00:00
Max Bain	5a668a7d80	fallback on whisper alignment failures, update readme	2023-01-05 11:15:19 +00:00
m-bain	93d661f2e4	fix whisper hallucination outside of audio length	2022-12-29 10:54:23 +00:00
Yehor Smoliakov	97526f1111	Add Ukrainian wav2vec2 model	2022-12-24 15:05:13 +02:00
Max Bain	c6fa7df3cc	add chinese, dutch. python usage. readme update	2022-12-23 00:41:12 +00:00
Max Bain	e909f2f766	support huggingface + model select based on lang.	2022-12-20 19:54:55 +00:00
Yasutaka Odo	93e568b3bf	replace magic strings	2022-12-21 02:11:08 +09:00
Yasutaka Odo	d7546def91	fix errors when recognizing non-japanese audios	2022-12-21 01:42:45 +09:00
Yasutaka Odo	c8000f6e3f	resolve conflicts	2022-12-21 01:20:35 +09:00
Yasutaka Odo	2ad813c57d	undo unnecessary formatting	2022-12-21 01:03:52 +09:00
Yasutaka Odo	5d7c3b521c	fix error message	2022-12-20 22:29:18 +09:00
Max Bain	cbaeb85034	restructure readme,	2022-12-19 19:41:39 +00:00
Max Bain	228b857597	add back word .srt, update readme	2022-12-19 19:12:50 +00:00
Yasutaka Odo	50cda426ba	add preliminary japanese support	2022-12-19 22:28:28 +09:00
Max Bain	45e9509227	multilingual init	2022-12-18 12:21:24 +00:00
m-bain	59a390d868	remove merge str	2022-12-17 23:47:32 +00:00
Max Bain	575c61b5a9	Merge branch 'main' of https://github.com/m-bain/whisperX into main Conflicts: whisperx/transcribe.py	2022-12-17 17:26:11 +00:00
Max Bain	645d55903a	add .ass output	2022-12-17 17:24:48 +00:00
m-bain	b47bb769a0	change regex sub to align_dictionary sub This allows the alignment method to be used for other languages -- depending on what characters are in the align_model dictionary.	2022-12-17 15:04:08 +00:00
m-bain	938341c05a	remove choices error in args	2022-12-16 12:51:32 +00:00
m-bain	9f9a00ff4c	default condition_on_previous_text to False	2022-12-15 19:44:49 +00:00
Max Bain	9f6fa61160	init commit	2022-12-14 18:59:12 +00:00

47 Commits