Binbin Zhang
8aa68ad750
[doc] rename conda env to wekws ( #77 )
2022-08-27 16:23:10 +08:00
Binbin Zhang
c9a262866f
[wekws] rename kws to wekws ( #76 )
...
* [wekws] rename kws to wekws
* fix lint
2022-08-27 11:57:44 +08:00
Wall.E
51f0fe6dc3
fixed the parameter transfer problem for criterion ( #75 )
...
* fixed the parameter transfer problem for criterion
Co-authored-by: yangyyt <yuntingyang@yuntingdeMacBook-Pro.local>
2022-07-13 23:50:48 +08:00
胡大炮
141d40704f
[fix bug] add optimizer.zero_grad() in kws/utils/executor.py ( #72 ) ( #73 )
...
* fix bug in kws/utils/executor.py (#72 )
* [fix bug] add zero_grad() above backward() in kws/utils/executor.py (#72 )
2022-06-05 22:39:26 +08:00
ryoha
41a3432198
fix export in export_onnx ( #71 )
2022-05-29 09:30:53 +08:00
Binbin Zhang
663a31d9ea
Update doc.yml ( #68 )
2022-04-14 16:06:02 +08:00
Cyan
015748b94e
learning rate won't initiate from 0.001 when continuing training from checkpoint ( #67 )
...
* add .gitattributes
* add long wav
* fix some bugs
* updated lint error
* back the hi_xiaowen/run.sh to the same
* remove the space
* better one
* remove 'num_keyword' parameter
* remove files
* flask8 examine
* override the score and compute_det file
* remove defaultdict
* remove import defaultdict
* learning rate won't initial from 0.001 when continuing training from checkpoint
* fix intent bug with initial learning rate != 0.001
2022-04-14 16:02:18 +08:00
Cyan
7d142b9528
[examples] refactor FAR computation to support long audio test ( #64 )
...
* add .gitattributes
* add long wav
* fix some bugs
* updated lint error
* back the hi_xiaowen/run.sh to the same
* remove the space
* better one
* remove 'num_keyword' parameter
* remove files
* flask8 examine
* override the score and compute_det file
* remove defaultdict
* remove import defaultdict
2022-03-24 14:35:07 +08:00
Menglong Xu
ff4b47f94d
[kws] update cross_entropy loss ( #62 )
...
* [kws] update cross_entropy loss
replace nn.CrossEntropyLoss() with F.cross_entropy()
* format
2022-03-15 19:34:28 +08:00
Menglong Xu
66fcfa2ce5
[doc] add result on GSC dataset ( #61 )
...
* [examples] reset grad_clip
* [doc] add basic result of mdtc model
2022-02-13 19:53:19 +08:00
Menglong Xu
d805c55560
[examples] update to use torchrun launch ( #60 )
2022-02-11 14:51:00 +08:00
lxiao336
db2685d1a4
[tools] add a bash script that trimmes silence using sox and split-based multi-processing ( #56 )
...
Co-authored-by: hp <shawl336@163.com>
2022-01-15 13:54:44 +08:00
Binbin Zhang
57021924cb
[kws] support onnx export ( #53 )
2022-01-15 13:50:34 +08:00
Binbin Zhang
665df6113e
[doc] update qr code ( #55 )
2021-12-21 17:38:01 +08:00
Menglong Xu
f622c55b04
[kws] update parameter for plotting det curve ( #54 )
2021-12-17 20:52:45 +08:00
Menglong Xu
768900307a
[kws] add code for plotting det curve ( #52 )
...
* [kws] add code for plotting det curve
* format
* format
* format
* format
* [kws] add code for plotting det curve
format
format
format
format
* set xlim and ylim by parameter
* set xlim and ylim optional
* update help information
* update parser type
* Update run.sh
2021-12-16 18:21:04 +08:00
Menglong Xu
20891f90e6
Merge pull request #51 from wenet-e2e/binbin-activation
...
[kws] put activation in model, so the activation could be exported in…
2021-12-15 21:32:52 +08:00
Binbin Zhang
8943acb51f
[kws] put activation in model, so the activation could be exported in script model
2021-12-15 21:10:27 +08:00
Menglong Xu
6a58993390
[examples] update to use torchrun launch ( #50 )
2021-12-15 21:03:59 +08:00
Menglong Xu
566baca343
[examples] update ds_tcn config for hey_snips ( #49 )
2021-12-15 19:31:58 +08:00
Binbin Zhang
dc1ac8fecd
[examples] use big model for ds_tcn ( #47 )
2021-12-15 11:12:24 +08:00
Binbin Zhang
e3bfcf9f4e
[doc] add quantize result ( #46 )
2021-12-15 11:07:59 +08:00
Binbin Zhang
f86a797b10
[kws] add static quantize ( #44 )
...
* [kws] add static quantize
* refine lint error in shuffle_list.py
* refine lint
* fix topo
2021-12-14 14:32:54 +08:00
ZY
05bb5d1bdb
[doc] update readme ( #45 )
2021-12-14 09:38:57 +08:00
Binbin Zhang
a61db05ff4
[examples] add weight decay in aishell ( #43 )
2021-12-13 19:53:24 +08:00
Binbin Zhang
171309bd9e
[bin] use torchrun to launch ddp training ( #42 )
2021-12-13 19:45:55 +08:00
Binbin Zhang
28652a766f
[kws] support fuse and quantization ( #41 )
2021-12-13 19:16:50 +08:00
Binbin Zhang
fd255fd7c6
[examples] update spec aug parameters in hi xiaowen( #40 )
2021-12-10 15:48:16 +08:00
Binbin Zhang
bc8d9f1c37
[examples] fix mdtc small config in hi_xiaowen ( #39 )
2021-12-09 17:53:34 +08:00
Menglong Xu
4a875776e5
[example] support hey_snips_kws_4.0 dataset ( #38 )
...
* [example] support hey_snips_kws_4.0 dataset
* format
* format
2021-12-08 23:46:05 +08:00
Menglong Xu
1eda27647b
[fix bug] add sigmoid() in score.py ( #37 )
...
The sigmoid() in kws/model/kws_model.py:KWSModel() was moved into kws/model/loss.py:max_pooling_loss()
To compute the posterior score correctly, the sigmoid() should also be added to kws/bin/score.py:main()
2021-12-08 13:41:20 +08:00
xiaohou
bd504c3cee
[bin] check torch script ( #36 )
...
* update run.sh
* [network] add code to check whether the model can be exported to torch script or not
2021-12-07 11:17:47 +08:00
xiaohou
afbc1d2960
[example] add testing code for speech command dataset ( #32 )
...
* update run.sh
* update run.sh
* rename test.py to compute_accuracy.py
* update run,sh
2021-12-07 10:56:30 +08:00
Binbin Zhang
b55ae111ae
[model] refactor tcn and ds_tcn share the same base class ( #35 )
2021-12-07 10:52:14 +08:00
Binbin Zhang
92a4c19ffe
[examples] use ds_tcn as default model ( #34 )
...
* [examples] use ds_tcn as default model
* fix scoring gpu id
2021-12-07 10:36:38 +08:00
Binbin Zhang
93bf93565e
[doc] add wechat group discuss ( #33 )
2021-12-06 22:00:54 +08:00
Binbin Zhang
c7c5bd3edc
[kws] refine tcn and ds_tcp, add batchnorm ( #31 )
...
* [kws] fix seed type
* [kws] refine tcn and ds_tcn, add batch norm
2021-12-06 17:24:48 +08:00
xiaohou
37f56db5af
[exampels] add speechcommand train ( #30 )
...
* [example] added code for training speech command dataset
* update kes_model.py
* update kes_model.py
* format
* format
* add more comments to explain the new classifier designed for speech command classification task
* add copyrigh info
* update copyrigh info of classifier.py
2021-12-06 17:14:33 +08:00
xiaohou
8be4bef405
[examples] speech command data prepare ( #27 )
...
* [examples] added speech command data preparation code
* update
* updata path.sh
2021-12-06 12:00:25 +08:00
Binbin Zhang
5241491e95
[kws] fix seed type ( #26 )
2021-12-06 11:38:41 +08:00
Binbin Zhang
a5a54782cc
[kws] use **kvargs for optim to reduce code ( #25 )
2021-12-06 11:23:44 +08:00
Menglong Xu
88444ab177
[examples] correct a spelling mistake ( #24 )
2021-12-05 21:24:06 +08:00
Binbin Zhang
8cfd4ed4f2
[kws] fix weight_decay key error ( #23 )
2021-12-04 17:19:46 +08:00
Binbin Zhang
dfe8b2536b
Revert "[recipe] suport speech command dataset ( #21 )" ( #22 )
...
This reverts commit c48c959807e7e80cdd514be9bd019b16e3b816eb.
2021-12-04 13:55:58 +08:00
xiaohou
c48c959807
[recipe] suport speech command dataset ( #21 )
...
* [recipe] suport speech command dataset
* format
* format
* format
* update run.sh
2021-12-03 21:07:42 +08:00
Binbin Zhang
6416f9c7db
Merge pull request #20 from wenet-e2e/fix-jingyonghou
...
fix bug in checking kernel size, kernel size should be a odd number
2021-12-03 10:29:27 +08:00
jingyong hou
8ea11dc572
fix bug in checking kernel size, kernel size should be a odd number
2021-12-03 10:25:02 +08:00
Binbin Zhang
56bdce8775
[doc] add reference ( #18 )
...
* [doc] add reference
* [doc] refine format
* [doc] refine format
* fix url
2021-11-30 20:56:26 +08:00
Binbin Zhang
1909fcd360
Merge pull request #17 from wenet-e2e/dev_jingyonghou
...
[network] update mdtc.py to prevent possible errors and remove useless functions
2021-11-30 17:20:09 +08:00
xiaohou
5236d42800
Update mdtc.py
2021-11-30 17:14:30 +08:00