whisperX

mirror of https://github.com/m-bain/whisperX.git synced 2025-07-01 18:17:27 -04:00

Author	SHA1	Message	Date
Matheus Bach	c72c627d10	add on_progress callback	2025-01-25 22:29:55 -03:00
Reinis Ivanovs	36d2622e27	feat: add Latvian align model	2025-01-25 09:45:17 +01:00
liupeng	ffbc73664c	change the docstrings and comments to English	2025-01-13 22:56:48 +08:00
liupeng	289eadfc76	fix a merge error.	2025-01-13 20:26:27 +08:00
bfs18	22a93f2932	Merge branch 'main' into main	2025-01-13 19:34:21 +08:00
Barabazs	2f93e029c7	feat: add SegmentData type for temporary processing during alignment	2025-01-13 10:45:50 +01:00
Barabazs	024bc8481b	refactor: consolidate segment data handling in alignment function	2025-01-13 10:45:50 +01:00
Barabazs	f286e7f3de	refactor: improve type hints and clean up imports	2025-01-13 10:45:50 +01:00
liupeng	4ebfb078c5	make no beam consistent with backtrack.	2025-01-09 23:13:11 +08:00
liupeng	65b2332e13	make align a bit faster.	2025-01-09 19:33:26 +08:00
liupeng	69281f3a29	support timestamps for numbers.	2025-01-09 15:23:40 +08:00
Barabazs	9a8967f27e	refactor: add type hints	2025-01-05 11:48:24 +01:00
bnitsan	1c6d9327bc	feat: use model_dir as cache_dir for wav2vec2 (#681 )	2025-01-01 13:22:27 +01:00
pere	9b9e03c4cc	feat: update Norwegian models (#687 ) Updated Norwegian Bokmål and Norwegian Nynorsk models Co-authored-by: Barabazs <31799121+Barabazs@users.noreply.github.com>	2024-12-16 11:08:48 +01:00
Ismael Ruiz Ranz	19eff8e79a	feat: add new align models (#922 ) Co-authored-by: Barabazs <31799121+Barabazs@users.noreply.github.com>	2024-12-16 11:06:43 +01:00
Ruhollah Majdoddin	6f3bc5b7b8	Added Romanian phoneme-based ASR model (#791 ) Co-authored-by: Barabazs <31799121+Barabazs@users.noreply.github.com>	2024-12-16 08:09:53 +01:00
Jan Panoch	3f339f9515	Update alignment.py - remove commented-out alignment modules for hr language	2024-08-09 13:00:12 +02:00
Jan Panoch	9a9b6171e6	Update alignment.py - trying another hr alignment	2024-08-08 08:37:55 +02:00
Jan Panoch	59b4d88d1d	Update alignment.py - trying another hr alignment file	2024-08-08 08:29:11 +02:00
Jan Panoch	6f70aa6beb	Update alignment.py - added croatian (hr) language	2024-08-08 08:10:55 +02:00
Jan Panoch	912920c591	Update alignment.py - added alignment for sk and sl languages	2024-08-07 10:05:17 +02:00
pere	5dfbfcbdc0	Adding Norwegian Bokmål and Norwegian Nynorsk Adding Wav2Vec2-models for Norwegian Bokmål and Norwegian Nynorsk. The models are testet together with WhisperX, and works great. For Bokmål I have added the 1B model, even if I see fairly little difference between that and the 300M model. For Norwegian Nynorsk only a 300M exist.The quality of the Wav2Vec models are also reported here: https://arxiv.org/abs/2307.01672	2023-12-19 08:48:21 +01:00
Kurian Benoy	5756b0fb13	Update alignment.py	2023-11-17 05:21:23 +05:30
Kurian Benoy	aaaa3de810	Update alignment.py	2023-11-17 05:18:19 +05:30
David Martin Rius	9f41c49fe5	Add align model for catalan language.	2023-11-16 11:43:36 +01:00
Simon Sorg	0c7f32f55c	no align based on space	2023-11-03 19:47:00 +01:00
Mahmoud Ashraf	b69956d725	.	2023-10-16 20:43:37 +03:00
Mahmoud Ashraf	02c0323777	fix	2023-10-15 16:25:15 +03:00
Mahmoud Ashraf	8049dba2f7	fix minimum input length for torch wav2vec2 models	2023-10-06 00:41:23 +03:00
Manohar Reddy	a0b6459c8b	fix: ZeroDivisionError when --print_progress True	2023-09-27 20:10:43 +05:30
piuy11	f137f31de6	Update alignment.py	2023-09-25 15:33:06 +09:00
awerks	4e28492dbd	Update alignment.py	2023-08-17 14:57:53 +02:00
awerks	6cb7267dc2	Update alignment.py	2023-08-17 14:56:54 +02:00
awerks	abbb66b58e	Update alignment.py	2023-08-17 14:53:53 +02:00
awerks	65688208c9	Update alignment.py	2023-08-16 16:18:00 +02:00
awerks	1bb4839b0f	Update alignment.py	2023-08-16 16:13:28 +02:00
awerks	14e593f60b	Update alignment.py	2023-08-16 16:08:25 +02:00
Max Bain	18d5fdc995	Add telugu language to alignment.py	2023-08-10 12:13:52 +01:00
Ayushi-Desynova	423667f00b	Update alignment.py	2023-08-09 17:08:56 +05:30
Max Bain	8c0fee90d3	Update alignment.py	2023-07-24 10:47:41 +01:00
Ahmad Bilal	734ecc2844	Add Urdu model support for alignment	2023-07-17 19:29:41 +05:00
Max Bain	8d8c027a92	Merge pull request #278 from Mr-Turtleeeee/add_align_for_vi Add war2vec model for Vietnamese	2023-05-29 12:54:37 +01:00
Max Bain	4cbd3030cc	no sentence split on mr. mrs. dr...	2023-05-29 12:48:14 +01:00
Max Bain	c65e7ba9b4	Merge pull request #280 from Thebys/patch-1	2023-05-27 11:18:27 +01:00
Max Bain	bc8a03881a	Merge pull request #281 from m-bain/v3 fix Unequal Stack Size VAD error	2023-05-26 20:37:57 +01:00
Max Bain	42b4909bc0	fix Unequal Stack Size VAD error	2023-05-26 20:36:03 +01:00
Thebys	bb15d6b68e	Add Czech alignment model This PR adds the following Czech alignment model: https://huggingface.co/comodoro/wav2vec2-xls-r-300m-cs-250. I have successfully tested this with several Czech audio recordings with length of up to 3 hours, and the results are satisfactory. However, I have received the following warnings and I am not sure how relevant it is: ``` Lightning automatically upgraded your loaded checkpoint from v1.5.4 to v2.0.2. To apply the upgrade to your files permanently, run `python -m pytorch_lightning.utilities.upgrade_checkpoint --file C:\Users\Thebys\.cache\torch\whisperx-vad-segmentation.bin` Model was trained with pyannote.audio 0.0.1, yours is 2.1.1. Bad things might happen unless you revert pyannote.audio to 0.x. Model was trained with torch 1.10.0+cu102, yours is 2.0.0. Bad things might happen unless you revert torch to 1.x. ```	2023-05-26 21:17:01 +02:00
Max Bain	23d405e1cf	Merge branch 'main' into add_align_for_vi	2023-05-26 17:14:09 +01:00
Youssef Boulaoaune	1d9d630fb9	added Korean wav2vec2 model	2023-05-26 20:33:16 +09:00
iambestfeeddddd	9c042c2d28	Add war2vec model for Vietnamese	2023-05-26 16:46:55 +07:00

1 2

80 Commits