29e95b746b
Merge pull request #57 from TengdaHan/main
...
support batch processing
2023-02-01 20:37:54 +00:00
039af89a86
support batch processing
2023-02-01 19:41:20 +00:00
fd2a093754
Merge pull request #55 from jonatasgrosman/main
...
FIX: Error when loading Hugging Face's models with embedded LM
2023-02-01 10:27:45 +00:00
31f069752f
Merge pull request #53 from MahmoudAshraf97/main
...
Add more languages to models list
2023-02-01 10:27:25 +00:00
4cdf7ef856
Merge pull request #48 from Barabazs/main
...
doc: format checklist
2023-02-01 10:26:58 +00:00
d294e29ad9
fix: error when loading huggingface model with embedded language model
2023-01-31 23:24:26 -03:00
0eae9e1f50
added several wav2vec2 models by jonatasgrosman
...
since his models were used in other languages before and I tested the arabic model myself, I assumed it's safe to include all the available models
2023-02-01 03:02:10 +02:00
1b08661e42
change arabic model to jonatasgrosman
2023-01-31 19:32:31 +02:00
a49799294b
add arabic wav2vec2 model form elgeish
2023-01-31 19:07:48 +02:00
d83c74a79f
doc: format checklist
2023-01-29 16:07:58 +01:00
acaefa09a1
Merge pull request #46 from Barabazs/main
...
Add sponsor link to sidebar
2023-01-28 19:05:36 +00:00
76f79f600a
fix short seg timestamps bug
2023-01-28 19:04:19 +00:00
33073f9bba
Create FUNDING.yml
2023-01-28 19:43:27 +01:00
50f3965fdb
fix tsv file ext
2023-01-28 17:39:07 +00:00
df2b1b70cb
increase vad cut default
2023-01-28 14:49:53 +00:00
c19cf407d8
handle non-alignable whole segments
2023-01-28 13:53:03 +00:00
8081ef2dcd
add custom vad binarization for vad cut
2023-01-28 00:22:33 +00:00
c6dbac76c8
cut up vad segments when too long to prevent OOM
2023-01-28 00:01:39 +00:00
69673eb39b
buy-me-a-coffee
2023-01-27 15:12:49 +00:00
5b8c8a7bd3
pandas fix
2023-01-27 15:05:08 +00:00
7f2159a953
Merge branch 'main' of https://github.com/m-bain/whisperX into main
2023-01-26 10:46:36 +00:00
16d24b1c96
only pad timestamps if not using VAD
2023-01-26 10:46:13 +00:00
d20a2a4ea2
typo in --diarize flag
2023-01-26 10:28:54 +00:00
312f1cc50c
Merge pull request #40 from MahmoudAshraf97/main
...
Added arguments and instructions to enable the usage VAD and Diarization
2023-01-26 00:34:03 +00:00
99b6e79fbf
Update README.md
...
added additional instructions to use PyAnnote modules
2023-01-26 00:56:10 +02:00
e7773358a3
Update transcribe.py
...
added the ability to include HF access token in order to use PyAnnote models
2023-01-26 00:42:35 +02:00
6b2aa4ff3e
Merge pull request #1 from MahmoudAshraf97/patch-1
...
Update README.md
2023-01-26 00:37:38 +02:00
c3de5e9580
Update README.md
...
fixed model name
2023-01-26 00:36:29 +02:00
58d7191949
add diarize
2023-01-25 19:40:41 +00:00
286a2f2c14
clean up logic, use pandas where possibl
2023-01-25 18:42:52 +00:00
eec6d1f8d8
missing word timestamps
2023-01-24 16:37:19 +00:00
d1600e5b0f
Merge branch 'main' of https://github.com/m-bain/whisperX into main
...
Conflicts:
whisperx/transcribe.py
whisperx/utils.py
2023-01-24 15:38:05 +00:00
d395c21b83
new logic, diarization, vad filtering
2023-01-24 15:02:08 +00:00
ba102feb7f
vad filter
2023-01-20 12:54:20 +00:00
4569cb982a
fix file_ass display bug
...
sentence start time on .ass files had a bug where if the first word did not have a timestamp, it would set sentence start_time to 0, but this needs to be the local 0 not actual file 0 (i.e. it should be segment['start'])
2023-01-12 12:57:12 +00:00
ce281eb6f6
Merge pull request #28 from aosfatos/update/wav2vec2-large-xlsr-53-portuguese
...
Update Portuguese model to wav2vec2-large-xlsr-53-portuguese
2023-01-12 09:10:02 +00:00
7adead16e0
Update pt model to wav2vec2-large-xlsr-53-portuguese
2023-01-11 19:50:34 -03:00
a4edb130ef
Merge pull request #27 from FelippeChemello/main
...
Add PT (pt-br) align support
2023-01-11 15:35:15 +00:00
7459bf8ad0
Add PT (pt-br) align support
2023-01-11 12:11:41 -03:00
d51353a4b6
uncomment .ass
2023-01-08 18:02:36 +00:00
78c87d3bfd
handle negative / tiny duration segments, final
2023-01-08 14:01:10 +00:00
a6eb33778b
additional waveform segment check
2023-01-08 12:24:35 +00:00
857bcca238
Merge branch 'main' of https://github.com/m-bain/whisperX into main
2023-01-07 15:00:22 +00:00
44b62064f6
fix starting timestamp for multiple fail-to-aligned words
2023-01-07 14:59:11 +00:00
2aa074e0e6
remove duplicate line
2023-01-05 13:12:11 +00:00
5a668a7d80
fallback on whisper alignment failures, update readme
2023-01-05 11:15:19 +00:00
93d661f2e4
fix whisper hallucination outside of audio length
2022-12-29 10:54:23 +00:00
644b04e8d1
Merge pull request #13 from egorsmkv/patch-1
...
Add Ukrainian wav2vec2 model
2022-12-25 20:32:56 +00:00
97526f1111
Add Ukrainian wav2vec2 model
2022-12-24 15:05:13 +02:00
c912f96ed3
add examples.md
2022-12-23 00:50:32 +00:00