|
0a3fd11562
|
update readme
|
2023-02-01 22:09:11 +00:00 |
|
|
039af89a86
|
support batch processing
|
2023-02-01 19:41:20 +00:00 |
|
|
50f3965fdb
|
fix tsv file ext
|
2023-01-28 17:39:07 +00:00 |
|
|
df2b1b70cb
|
increase vad cut default
|
2023-01-28 14:49:53 +00:00 |
|
|
c19cf407d8
|
handle non-alignable whole segments
|
2023-01-28 13:53:03 +00:00 |
|
|
c6dbac76c8
|
cut up vad segments when too long to prevent OOM
|
2023-01-28 00:01:39 +00:00 |
|
|
e7773358a3
|
Update transcribe.py
added the ability to include HF access token in order to use PyAnnote models
|
2023-01-26 00:42:35 +02:00 |
|
|
286a2f2c14
|
clean up logic, use pandas where possibl
|
2023-01-25 18:42:52 +00:00 |
|
|
eec6d1f8d8
|
missing word timestamps
|
2023-01-24 16:37:19 +00:00 |
|
|
d1600e5b0f
|
Merge branch 'main' of https://github.com/m-bain/whisperX into main
Conflicts:
whisperx/transcribe.py
whisperx/utils.py
|
2023-01-24 15:38:05 +00:00 |
|
|
d395c21b83
|
new logic, diarization, vad filtering
|
2023-01-24 15:02:08 +00:00 |
|
|
ba102feb7f
|
vad filter
|
2023-01-20 12:54:20 +00:00 |
|
|
7adead16e0
|
Update pt model to wav2vec2-large-xlsr-53-portuguese
|
2023-01-11 19:50:34 -03:00 |
|
|
7459bf8ad0
|
Add PT (pt-br) align support
|
2023-01-11 12:11:41 -03:00 |
|
|
d51353a4b6
|
uncomment .ass
|
2023-01-08 18:02:36 +00:00 |
|
|
78c87d3bfd
|
handle negative / tiny duration segments, final
|
2023-01-08 14:01:10 +00:00 |
|
|
a6eb33778b
|
additional waveform segment check
|
2023-01-08 12:24:35 +00:00 |
|
|
44b62064f6
|
fix starting timestamp for multiple fail-to-aligned words
|
2023-01-07 14:59:11 +00:00 |
|
|
5a668a7d80
|
fallback on whisper alignment failures, update readme
|
2023-01-05 11:15:19 +00:00 |
|
|
93d661f2e4
|
fix whisper hallucination outside of audio length
|
2022-12-29 10:54:23 +00:00 |
|
|
97526f1111
|
Add Ukrainian wav2vec2 model
|
2022-12-24 15:05:13 +02:00 |
|
|
c6fa7df3cc
|
add chinese, dutch. python usage. readme update
|
2022-12-23 00:41:12 +00:00 |
|
|
e909f2f766
|
support huggingface + model select based on lang.
|
2022-12-20 19:54:55 +00:00 |
|
|
93e568b3bf
|
replace magic strings
|
2022-12-21 02:11:08 +09:00 |
|
|
d7546def91
|
fix errors when recognizing non-japanese audios
|
2022-12-21 01:42:45 +09:00 |
|
|
c8000f6e3f
|
resolve conflicts
|
2022-12-21 01:20:35 +09:00 |
|
|
2ad813c57d
|
undo unnecessary formatting
|
2022-12-21 01:03:52 +09:00 |
|
|
5d7c3b521c
|
fix error message
|
2022-12-20 22:29:18 +09:00 |
|
|
cbaeb85034
|
restructure readme,
|
2022-12-19 19:41:39 +00:00 |
|
|
228b857597
|
add back word .srt, update readme
|
2022-12-19 19:12:50 +00:00 |
|
|
50cda426ba
|
add preliminary japanese support
|
2022-12-19 22:28:28 +09:00 |
|
|
45e9509227
|
multilingual init
|
2022-12-18 12:21:24 +00:00 |
|
|
59a390d868
|
remove merge str
|
2022-12-17 23:47:32 +00:00 |
|
|
575c61b5a9
|
Merge branch 'main' of https://github.com/m-bain/whisperX into main
Conflicts:
whisperx/transcribe.py
|
2022-12-17 17:26:11 +00:00 |
|
|
645d55903a
|
add .ass output
|
2022-12-17 17:24:48 +00:00 |
|
|
b47bb769a0
|
change regex sub to align_dictionary sub
This allows the alignment method to be used for other languages -- depending on what characters are in the align_model dictionary.
|
2022-12-17 15:04:08 +00:00 |
|
|
938341c05a
|
remove choices error in args
|
2022-12-16 12:51:32 +00:00 |
|
|
9f9a00ff4c
|
default condition_on_previous_text to False
|
2022-12-15 19:44:49 +00:00 |
|
|
9f6fa61160
|
init commit
|
2022-12-14 18:59:12 +00:00 |
|