Commit Graph

  • 1b08661e42 change arabic model to jonatasgrosman Mahmoud Ashraf 2023-01-31 19:32:31 +02:00
  • a49799294b add arabic wav2vec2 model form elgeish Mahmoud Ashraf 2023-01-31 19:07:48 +02:00
  • d83c74a79f doc: format checklist Barabazs 2023-01-29 16:07:58 +01:00
  • acaefa09a1 Merge pull request #46 from Barabazs/main m-bain 2023-01-28 19:05:36 +00:00
  • 76f79f600a fix short seg timestamps bug Max Bain 2023-01-28 19:04:19 +00:00
  • 33073f9bba Create FUNDING.yml Barabazs 2023-01-28 19:43:27 +01:00
  • 50f3965fdb fix tsv file ext Max Bain 2023-01-28 17:39:07 +00:00
  • df2b1b70cb increase vad cut default Max Bain 2023-01-28 14:49:53 +00:00
  • c19cf407d8 handle non-alignable whole segments Max Bain 2023-01-28 13:53:03 +00:00
  • 8081ef2dcd add custom vad binarization for vad cut Max Bain 2023-01-28 00:22:33 +00:00
  • c6dbac76c8 cut up vad segments when too long to prevent OOM Max Bain 2023-01-28 00:01:39 +00:00
  • 69673eb39b buy-me-a-coffee Max Bain 2023-01-27 15:12:49 +00:00
  • 5b8c8a7bd3 pandas fix Max Bain 2023-01-27 15:05:08 +00:00
  • 7f2159a953 Merge branch 'main' of https://github.com/m-bain/whisperX into main Max Bain 2023-01-26 10:46:36 +00:00
  • 16d24b1c96 only pad timestamps if not using VAD Max Bain 2023-01-26 10:46:13 +00:00
  • d20a2a4ea2 typo in --diarize flag m-bain 2023-01-26 10:28:54 +00:00
  • 312f1cc50c Merge pull request #40 from MahmoudAshraf97/main m-bain 2023-01-26 00:34:03 +00:00
  • 99b6e79fbf Update README.md Mahmoud Ashraf 2023-01-26 00:56:10 +02:00
  • e7773358a3 Update transcribe.py Mahmoud Ashraf 2023-01-26 00:42:35 +02:00
  • 6b2aa4ff3e Merge pull request #1 from MahmoudAshraf97/patch-1 Mahmoud Ashraf 2023-01-26 00:37:38 +02:00
  • c3de5e9580 Update README.md Mahmoud Ashraf 2023-01-26 00:36:29 +02:00
  • 58d7191949 add diarize Max Bain 2023-01-25 19:40:41 +00:00
  • 286a2f2c14 clean up logic, use pandas where possibl Max Bain 2023-01-25 18:42:52 +00:00
  • eec6d1f8d8 missing word timestamps stable Max Bain 2023-01-24 16:37:19 +00:00
  • d1600e5b0f Merge branch 'main' of https://github.com/m-bain/whisperX into main Max Bain 2023-01-24 15:38:05 +00:00
  • d395c21b83 new logic, diarization, vad filtering Max Bain 2023-01-24 15:02:08 +00:00
  • ba102feb7f vad filter Max Bain 2023-01-20 12:54:20 +00:00
  • 4569cb982a fix file_ass display bug m-bain 2023-01-12 12:57:12 +00:00
  • ce281eb6f6 Merge pull request #28 from aosfatos/update/wav2vec2-large-xlsr-53-portuguese m-bain 2023-01-12 09:10:02 +00:00
  • 7adead16e0 Update pt model to wav2vec2-large-xlsr-53-portuguese Rhenan Bartels 2023-01-11 19:50:34 -03:00
  • a4edb130ef Merge pull request #27 from FelippeChemello/main m-bain 2023-01-11 15:35:15 +00:00
  • 7459bf8ad0 Add PT (pt-br) align support Felippe Chemello 2023-01-11 12:11:41 -03:00
  • d51353a4b6 uncomment .ass m-bain 2023-01-08 18:02:36 +00:00
  • 78c87d3bfd handle negative / tiny duration segments, final Max Bain 2023-01-08 14:01:10 +00:00
  • a6eb33778b additional waveform segment check Max Bain 2023-01-08 12:24:35 +00:00
  • 857bcca238 Merge branch 'main' of https://github.com/m-bain/whisperX into main Max Bain 2023-01-07 15:00:22 +00:00
  • 44b62064f6 fix starting timestamp for multiple fail-to-aligned words Max Bain 2023-01-07 14:59:11 +00:00
  • 2aa074e0e6 remove duplicate line m-bain 2023-01-05 13:12:11 +00:00
  • 5a668a7d80 fallback on whisper alignment failures, update readme Max Bain 2023-01-05 11:15:19 +00:00
  • 93d661f2e4 fix whisper hallucination outside of audio length m-bain 2022-12-29 10:54:23 +00:00
  • 644b04e8d1 Merge pull request #13 from egorsmkv/patch-1 m-bain 2022-12-25 20:32:56 +00:00
  • 97526f1111 Add Ukrainian wav2vec2 model Yehor Smoliakov 2022-12-24 15:05:13 +02:00
  • c912f96ed3 add examples.md Max Bain 2022-12-23 00:50:32 +00:00
  • c6fa7df3cc add chinese, dutch. python usage. readme update Max Bain 2022-12-23 00:41:12 +00:00
  • e909f2f766 support huggingface + model select based on lang. Max Bain 2022-12-20 19:54:55 +00:00
  • 8b2f40d02a Change resolution for Japanese example video Yasutaka Odo 2022-12-21 02:44:11 +09:00
  • 93e568b3bf replace magic strings Yasutaka Odo 2022-12-21 02:11:08 +09:00
  • d7546def91 fix errors when recognizing non-japanese audios Yasutaka Odo 2022-12-21 01:42:45 +09:00
  • 7a3575dfdf fix formatting Yasutaka Odo 2022-12-21 01:23:02 +09:00
  • c8000f6e3f resolve conflicts Yasutaka Odo 2022-12-21 01:20:35 +09:00
  • 2ad813c57d undo unnecessary formatting Yasutaka Odo 2022-12-21 01:03:52 +09:00
  • 080dc1b337 fix path for sample japanese audio Yasutaka Odo 2022-12-21 00:42:39 +09:00
  • f6cd4b9bed correct example command for japanese Yasutaka Odo 2022-12-21 00:39:48 +09:00
  • 0e61f10443 Merge branch 'main' of https://github.com/yasutak/whisperX into main Yasutaka Odo 2022-12-21 00:31:38 +09:00
  • 5d7c3b521c fix error message Yasutaka Odo 2022-12-20 22:29:18 +09:00
  • cbaeb85034 restructure readme, Max Bain 2022-12-19 19:41:39 +00:00
  • 228b857597 add back word .srt, update readme Max Bain 2022-12-19 19:12:50 +00:00
  • 398ede98d5 Add missing space Yasutaka Odo 2022-12-20 04:02:25 +09:00
  • d1288204c8 add sample Japanese audio Yasutaka Odo 2022-12-19 23:41:23 +09:00
  • f00e9cb149 add sample audio for japanese example Yasutaka Odo 2022-12-19 23:40:05 +09:00
  • 5437be01db Update README.md Yasutaka Odo 2022-12-19 23:30:01 +09:00
  • 1c8ea78975 Update README.md Yasutaka Odo 2022-12-19 23:12:04 +09:00
  • 479088ac31 Update README.md Yasutaka Odo 2022-12-19 23:09:05 +09:00
  • 62adf12d4e add preliminary japanese support Yasutaka Odo 2022-12-19 22:34:59 +09:00
  • 50cda426ba add preliminary japanese support Yasutaka Odo 2022-12-19 22:28:28 +09:00
  • 6b64cb079a add arch figure, citation m-bain 2022-12-18 18:43:33 +00:00
  • 3a91aa1384 Update README.md m-bain 2022-12-18 12:43:17 +00:00
  • 87d4af8e17 add other language examples m-bain 2022-12-18 12:39:16 +00:00
  • 45e9509227 multilingual init Max Bain 2022-12-18 12:21:24 +00:00
  • 59a390d868 remove merge str m-bain 2022-12-17 23:47:32 +00:00
  • 550b8b760d update license to bsd-4 m-bain 2022-12-17 17:38:12 +00:00
  • 6b41216902 Update README.md m-bain 2022-12-17 17:34:38 +00:00
  • 575c61b5a9 Merge branch 'main' of https://github.com/m-bain/whisperX into main Max Bain 2022-12-17 17:26:11 +00:00
  • 645d55903a add .ass output Max Bain 2022-12-17 17:24:48 +00:00
  • b47bb769a0 change regex sub to align_dictionary sub m-bain 2022-12-17 15:04:08 +00:00
  • 938341c05a remove choices error in args m-bain 2022-12-16 12:51:32 +00:00
  • 9f9a00ff4c default condition_on_previous_text to False m-bain 2022-12-15 19:44:49 +00:00
  • 55a8cce437 add torchaudio to reqs m-bain 2022-12-15 13:42:11 +00:00
  • fb2b727e82 Update README.md m-bain 2022-12-15 01:01:48 +00:00
  • 5a9d0fa06b Update README.md m-bain 2022-12-15 00:31:20 +00:00
  • 681d4c6749 Update README.md m-bain 2022-12-14 19:02:32 +00:00
  • 9f6fa61160 init commit Max Bain 2022-12-14 18:59:12 +00:00
  • 9791862c45 Create README.md m-bain 2022-12-13 18:18:40 +00:00