Commit Graph

  • 53396adb21 add device_index Simon 2023-05-20 13:02:46 +02:00
  • 63fb5fc46f Suggest using pytorch-cuda 11.8 instead of 11.7 Tijs Zwinkels 2023-05-16 12:07:09 +02:00
  • d8a2b4ffc9 Merge pull request #246 from m-bain/v3 v3.1.1 Max Bain 2023-05-13 12:18:09 +01:00
  • 9ffb7e7a23 Merge branch 'v3' of https://github.com/m-bain/whisperX into v3 Max Bain 2023-05-13 12:16:33 +01:00
  • fd8f1003cf add translate, fix word_timestamp error Max Bain 2023-05-13 12:14:06 +01:00
  • 46b416296f Merge pull request #123 from koldbrandt/danish_alignment Max Bain 2023-05-09 23:10:24 +01:00
  • 7642390d0a Merge branch 'main' into danish_alignment Max Bain 2023-05-09 23:10:13 +01:00
  • 8b05ad4dae Merge pull request #235 from sorgfresser/main Max Bain 2023-05-09 23:05:02 +01:00
  • 5421f1d7ca remove v3 tag on pip install Max Bain 2023-05-09 13:42:50 +01:00
  • 91e959ec4f Merge branch 'm-bain:main' into main Simon 2023-05-08 20:46:25 +02:00
  • eabf35dff0 Custom result types Simon 2023-05-08 20:45:34 +02:00
  • 4919ad21fc Merge pull request #233 from sorgfresser/main Max Bain 2023-05-08 19:05:47 +01:00
  • b50aafb17b Fix tuple unpacking Simon 2023-05-08 20:03:42 +02:00
  • 2efa136114 update python usage example Max Bain 2023-05-08 17:20:38 +01:00
  • 0b839f3f01 Update README.md Max Bain 2023-05-07 20:36:08 +01:00
  • 1caddfb564 Merge pull request #225 from m-bain/v3 v3.1.0 Max Bain 2023-05-07 20:31:16 +01:00
  • 7ad554c64f Merge branch 'main' into v3 Max Bain 2023-05-07 20:30:57 +01:00
  • 4603f010a5 update readme, setup, add option to return char_timestamps Max Bain 2023-05-07 20:28:33 +01:00
  • 24008aa1ed fix long segments, break into sentences using nltk, improve align logic, improve diarize (sentence-based) Max Bain 2023-05-07 15:32:58 +01:00
  • 07361ba1d7 add device to dia pipeline @sorgfresser Max Bain 2023-05-05 11:53:51 +01:00
  • 4e2ac4e4e9 torch2.0, remove compile for now, round to times to 3 decimal v3.0.2 Max Bain 2023-05-04 20:38:13 +01:00
  • d2116b98ca Merge pull request #210 from sorgfresser/v3 Max Bain 2023-05-04 20:32:06 +01:00
  • d8f0ef4a19 Set diarization device manually Simon 2023-05-04 16:25:34 +02:00
  • 1b62c61c71 Merge pull request #216 from aramlang/blank_id-fix Max Bain 2023-05-04 01:13:23 +01:00
  • 2d59eb9726 Add torch compile to log mel spectrogram Simon 2023-05-03 23:17:44 +02:00
  • cb53661070 Enable Hebrew support aramlang 2023-05-03 11:26:12 -05:00
  • 2a6830492c Fix pyannote to specific commit Simon 2023-05-02 20:25:56 +02:00
  • da3aabe181 Merge branch 'm-bain:v3' into v3 Simon 2023-05-02 18:55:43 +02:00
  • 067189248f Use pyannote develop branch and torch version 2 Simon 2023-05-02 18:44:43 +02:00
  • b666523004 add v3 pre-release comment, and v4 progress update Max Bain 2023-05-02 15:10:40 +01:00
  • 69e038cbc4 Merge pull request #209 from SohaibAnwaar/feat-dockerfile Max Bain 2023-05-02 14:55:30 +01:00
  • 9fb51412c0 Merge pull request #208 from arnavmehta7/patch-1 Max Bain 2023-05-02 10:55:13 +01:00
  • a693a779fa feat: adding the docker file sohaibanwaar 2023-05-02 13:28:20 +05:00
  • 64ca208cc8 Fixed the word_start variable not initialized bug. Arnav Mehta 2023-05-02 13:13:02 +05:30
  • 5becc99e56 Version bump pyannote, pytorch Simon 2023-05-01 13:47:41 +02:00
  • e24ca9e0a2 Merge pull request #205 from prashanthellina/v3-fix-diarization v3.0.1 Max Bain 2023-04-30 21:08:45 +01:00
  • 601c91140f references #202, attempt to fix speaker diarization failing in v3 Prashanth Ellina 2023-04-30 17:33:24 +00:00
  • 31a9ec7466 Merge pull request #204 from sorgfresser/v3 Max Bain 2023-04-30 18:29:46 +01:00
  • b9c8c5072b Pad language detection if audio is too short Simon 2023-04-30 18:34:18 +02:00
  • a903e57cf1 Merge pull request #199 from thomasmol/v3 Max Bain 2023-04-29 23:35:42 +01:00
  • cb176a186e added num_workers to fix pickling error Thomas Mol 2023-04-29 19:51:05 +02:00
  • 5b85c5433f Update setup.py Max Bain 2023-04-28 16:47:04 +01:00
  • cc7e168d2b add checkout command v3.0.0 m-bain 2023-04-25 12:14:23 +01:00
  • db97f29678 update pip install m-bain 2023-04-25 11:19:23 +01:00
  • 25be8210e5 add v3 tag for install m-bain 2023-04-25 10:07:34 +01:00
  • 0efad26066 pass compute_type Max Bain 2023-04-24 21:26:44 +01:00
  • 2a29f0ec6a add compute types Max Bain 2023-04-24 21:24:22 +01:00
  • 558d980535 v3 init Max Bain 2023-04-24 21:08:43 +01:00
  • da458863d7 allow custom model_dir for torchaudio models v2.0.1 Max Bain 2023-04-14 21:40:36 +01:00
  • cf252a8592 allow custom path for vad model Max Bain 2023-04-14 15:02:58 +01:00
  • 6a72b61564 clamp end_timestamp to prevent infinite loop m-bain 2023-04-11 20:15:37 +01:00
  • 48ed89834e Merge pull request #169 from invisprints/v2-opt-load-model m-bain 2023-04-09 13:39:13 +01:00
  • bb15c9428f opti the inference loop invisprints 2023-04-09 15:58:55 +08:00
  • 9482d324d0 Merge pull request #162 from dev-nomi/cli_argument_type m-bain 2023-04-05 13:40:04 -07:00
  • 4146e56d5b Added vad_filter type dev-nomi 2023-04-05 17:11:29 +05:00
  • 118e7deedb Merge pull request #161 from diasks2/fix_typo m-bain 2023-04-04 19:00:18 -07:00
  • 70a4a0a25c Fix typo Kevin Dias 2023-04-05 10:50:49 +09:00
  • 40948a3d00 fix whisper version to 20230314 for no breaking m-bain 2023-04-04 12:42:34 -07:00
  • c8be6ac94d update python example m-bain 2023-04-03 12:18:31 -07:00
  • a582a59493 mkdir for torch cache in case it doesnt exist m-bain 2023-04-01 13:05:40 -07:00
  • 861379edc3 Merge pull request #157 from Ryan5453/fix/whisper-req m-bain 2023-03-31 16:40:19 -07:00
  • 4af345434a Update requirements.txt Ryan 2023-03-31 19:36:38 -04:00
  • 634799b3be hf token only for diarization m-bain 2023-03-31 16:15:40 -07:00
  • 189aeac83e v2 lets goo v2.0.0 Max Bain 2023-04-01 00:10:45 +01:00
  • bc2776017e v2 lets go Max Bain 2023-04-01 00:09:29 +01:00
  • 11a78d7ced handle tmp wav file better Max Bain 2023-04-01 00:06:40 +01:00
  • b9ca701d69 .wav conversion, handle audio with no detected speech Max Bain 2023-03-31 23:02:38 +01:00
  • d0fa028045 fix tfile naming Max Bain 2023-03-30 19:24:42 +01:00
  • ae4a9de307 add vad model external dl Max Bain 2023-03-30 18:57:55 +01:00
  • 18b63d46e2 skeleton v2 Max Bain 2023-03-30 05:31:57 +01:00
  • 1e7c2c337b Merge pull request #148 from FernanOrtega/main m-bain 2023-03-24 07:57:43 -07:00
  • 33dd3b9bcd Update decoding.py Fernando O. Gallego 2023-03-24 11:56:41 +01:00
  • d1b4ff8228 Merge pull request #114 from mshakirDr/patch-1 m-bain 2023-03-23 15:12:09 -07:00
  • d31f6e0b8a Merge branch 'm-bain:main' into danish_alignment Marcus Brandt 2023-03-06 10:52:47 +01:00
  • 809700e286 remove soundfile version constraint m-bain 2023-03-06 00:20:31 +00:00
  • cea42ca470 Fix hugging face error Muhammad Shakir 2023-03-04 19:12:13 +01:00
  • c8404d9805 added a danish alignment model Marcus Brandt 2023-03-04 13:20:40 +01:00
  • d1d420e70c Merge pull request #111 from Barabazs/patch-1 m-bain 2023-03-04 11:46:57 +00:00
  • 844eb30710 fix: force soundfile version update for mp3 support Barabazs 2023-03-04 11:01:26 +01:00
  • 31e6fe7e36 Merge pull request #107 from JCGoran/fix/python3.7_compatibility m-bain 2023-03-02 15:31:36 +00:00
  • cfcede41f6 Added Python 3.7 compatibility JCGoran 2023-03-02 15:09:02 +01:00
  • 186b06e032 paper drop m-bain 2023-03-02 12:04:16 +00:00
  • 847a3cd85b Merge pull request #96 from smly/fix-batch-processing v1.0.0 m-bain 2023-02-22 12:11:01 +00:00
  • 2b1ffa12b8 Merge pull request #97 from smly/gpu-vad-filter m-bain 2023-02-21 18:57:14 +00:00
  • 57f5957e0e Pass device to pyannote.audio.Inference smly 2023-02-22 03:48:20 +09:00
  • 27fe502344 Fix assertion error in batch processing smly 2023-02-22 02:45:13 +09:00
  • f7093e60d3 Merge pull request #90 from Pikauba/translation_starting_point_improvement m-bain 2023-02-18 21:59:57 +00:00
  • a1d2229416 Improvement to transcription starting point with VAD Antoine Dufour 2023-02-18 11:12:23 -05:00
  • 4cb167a225 Merge pull request #74 from Camb-ai/level-bug-fix m-bain 2023-02-14 19:22:22 +00:00
  • 2e307814dd added if clause for checking arnavmehta7 2023-02-10 14:48:51 +05:30
  • d687cf3358 Merge pull request #58 from MahmoudAshraf97/main m-bain 2023-02-01 22:11:51 +00:00
  • 0a3fd11562 update readme Max Bain 2023-02-01 22:09:11 +00:00
  • 29e95b746b Merge pull request #57 from TengdaHan/main m-bain 2023-02-01 20:37:54 +00:00
  • 039af89a86 support batch processing Tengda Han 2023-02-01 19:41:20 +00:00
  • 9f26112d5c added turkish wav2vec2 model Mahmoud Ashraf 2023-02-01 21:38:50 +02:00
  • fd2a093754 Merge pull request #55 from jonatasgrosman/main m-bain 2023-02-01 10:27:45 +00:00
  • 31f069752f Merge pull request #53 from MahmoudAshraf97/main m-bain 2023-02-01 10:27:25 +00:00
  • 4cdf7ef856 Merge pull request #48 from Barabazs/main m-bain 2023-02-01 10:26:58 +00:00
  • d294e29ad9 fix: error when loading huggingface model with embedded language model Jonatas Grosman 2023-01-31 23:24:26 -03:00
  • 0eae9e1f50 added several wav2vec2 models by jonatasgrosman Mahmoud Ashraf 2023-02-01 03:02:10 +02:00