blessyyyu
255eec296a
flask8 examine
2022-03-23 18:41:03 +08:00
blessyyyu
07b7beabad
remove files
2022-03-23 18:32:01 +08:00
blessyyyu
db9fc7a738
remove 'num_keyword' parameter
2022-03-23 18:28:06 +08:00
blessyyyu
b2130d7458
better one
2022-03-23 17:53:48 +08:00
blessyyyu
1ebc3bff88
remove the space
2022-03-23 14:37:52 +08:00
blessyyyu
c2572d9abf
Merge branch 'master' of https://github.com/blessyyyu/wenet-kws
2022-03-23 14:32:04 +08:00
blessyyyu
538c1702ad
back the hi_xiaowen/run.sh to the same
2022-03-23 14:16:30 +08:00
Cyan
84a6843391
Merge branch 'wenet-e2e:master' into master
2022-03-23 14:05:09 +08:00
blessyyyu
17a67fe579
updated lint error
2022-03-23 14:04:00 +08:00
blessyyyu
074a501a82
fix some bugs
2022-03-23 11:14:52 +08:00
blessyyyu
c270cbe38f
add long wav
2022-03-22 17:14:48 +08:00
blessyyyu
e54cf59b56
add .gitattributes
2022-03-22 17:00:47 +08:00
Menglong Xu
ff4b47f94d
[kws] update cross_entropy loss ( #62 )
...
* [kws] update cross_entropy loss
replace nn.CrossEntropyLoss() with F.cross_entropy()
* format
2022-03-15 19:34:28 +08:00
Menglong Xu
66fcfa2ce5
[doc] add result on GSC dataset ( #61 )
...
* [examples] reset grad_clip
* [doc] add basic result of mdtc model
2022-02-13 19:53:19 +08:00
Menglong Xu
d805c55560
[examples] update to use torchrun launch ( #60 )
2022-02-11 14:51:00 +08:00
lxiao336
db2685d1a4
[tools] add a bash script that trimmes silence using sox and split-based multi-processing ( #56 )
...
Co-authored-by: hp <shawl336@163.com>
2022-01-15 13:54:44 +08:00
Binbin Zhang
57021924cb
[kws] support onnx export ( #53 )
2022-01-15 13:50:34 +08:00
Binbin Zhang
665df6113e
[doc] update qr code ( #55 )
2021-12-21 17:38:01 +08:00
Menglong Xu
f622c55b04
[kws] update parameter for plotting det curve ( #54 )
2021-12-17 20:52:45 +08:00
Menglong Xu
768900307a
[kws] add code for plotting det curve ( #52 )
...
* [kws] add code for plotting det curve
* format
* format
* format
* format
* [kws] add code for plotting det curve
format
format
format
format
* set xlim and ylim by parameter
* set xlim and ylim optional
* update help information
* update parser type
* Update run.sh
2021-12-16 18:21:04 +08:00
Menglong Xu
20891f90e6
Merge pull request #51 from wenet-e2e/binbin-activation
...
[kws] put activation in model, so the activation could be exported in…
2021-12-15 21:32:52 +08:00
Binbin Zhang
8943acb51f
[kws] put activation in model, so the activation could be exported in script model
2021-12-15 21:10:27 +08:00
Menglong Xu
6a58993390
[examples] update to use torchrun launch ( #50 )
2021-12-15 21:03:59 +08:00
Menglong Xu
566baca343
[examples] update ds_tcn config for hey_snips ( #49 )
2021-12-15 19:31:58 +08:00
Binbin Zhang
dc1ac8fecd
[examples] use big model for ds_tcn ( #47 )
2021-12-15 11:12:24 +08:00
Binbin Zhang
e3bfcf9f4e
[doc] add quantize result ( #46 )
2021-12-15 11:07:59 +08:00
Binbin Zhang
f86a797b10
[kws] add static quantize ( #44 )
...
* [kws] add static quantize
* refine lint error in shuffle_list.py
* refine lint
* fix topo
2021-12-14 14:32:54 +08:00
ZY
05bb5d1bdb
[doc] update readme ( #45 )
2021-12-14 09:38:57 +08:00
Binbin Zhang
a61db05ff4
[examples] add weight decay in aishell ( #43 )
2021-12-13 19:53:24 +08:00
Binbin Zhang
171309bd9e
[bin] use torchrun to launch ddp training ( #42 )
2021-12-13 19:45:55 +08:00
Binbin Zhang
28652a766f
[kws] support fuse and quantization ( #41 )
2021-12-13 19:16:50 +08:00
Binbin Zhang
fd255fd7c6
[examples] update spec aug parameters in hi xiaowen( #40 )
2021-12-10 15:48:16 +08:00
Binbin Zhang
bc8d9f1c37
[examples] fix mdtc small config in hi_xiaowen ( #39 )
2021-12-09 17:53:34 +08:00
Menglong Xu
4a875776e5
[example] support hey_snips_kws_4.0 dataset ( #38 )
...
* [example] support hey_snips_kws_4.0 dataset
* format
* format
2021-12-08 23:46:05 +08:00
Menglong Xu
1eda27647b
[fix bug] add sigmoid() in score.py ( #37 )
...
The sigmoid() in kws/model/kws_model.py:KWSModel() was moved into kws/model/loss.py:max_pooling_loss()
To compute the posterior score correctly, the sigmoid() should also be added to kws/bin/score.py:main()
2021-12-08 13:41:20 +08:00
xiaohou
bd504c3cee
[bin] check torch script ( #36 )
...
* update run.sh
* [network] add code to check whether the model can be exported to torch script or not
2021-12-07 11:17:47 +08:00
xiaohou
afbc1d2960
[example] add testing code for speech command dataset ( #32 )
...
* update run.sh
* update run.sh
* rename test.py to compute_accuracy.py
* update run,sh
2021-12-07 10:56:30 +08:00
Binbin Zhang
b55ae111ae
[model] refactor tcn and ds_tcn share the same base class ( #35 )
2021-12-07 10:52:14 +08:00
Binbin Zhang
92a4c19ffe
[examples] use ds_tcn as default model ( #34 )
...
* [examples] use ds_tcn as default model
* fix scoring gpu id
2021-12-07 10:36:38 +08:00
Binbin Zhang
93bf93565e
[doc] add wechat group discuss ( #33 )
2021-12-06 22:00:54 +08:00
Binbin Zhang
c7c5bd3edc
[kws] refine tcn and ds_tcp, add batchnorm ( #31 )
...
* [kws] fix seed type
* [kws] refine tcn and ds_tcn, add batch norm
2021-12-06 17:24:48 +08:00
xiaohou
37f56db5af
[exampels] add speechcommand train ( #30 )
...
* [example] added code for training speech command dataset
* update kes_model.py
* update kes_model.py
* format
* format
* add more comments to explain the new classifier designed for speech command classification task
* add copyrigh info
* update copyrigh info of classifier.py
2021-12-06 17:14:33 +08:00
xiaohou
8be4bef405
[examples] speech command data prepare ( #27 )
...
* [examples] added speech command data preparation code
* update
* updata path.sh
2021-12-06 12:00:25 +08:00
Binbin Zhang
5241491e95
[kws] fix seed type ( #26 )
2021-12-06 11:38:41 +08:00
Binbin Zhang
a5a54782cc
[kws] use **kvargs for optim to reduce code ( #25 )
2021-12-06 11:23:44 +08:00
Menglong Xu
88444ab177
[examples] correct a spelling mistake ( #24 )
2021-12-05 21:24:06 +08:00
Binbin Zhang
8cfd4ed4f2
[kws] fix weight_decay key error ( #23 )
2021-12-04 17:19:46 +08:00
Binbin Zhang
dfe8b2536b
Revert "[recipe] suport speech command dataset ( #21 )" ( #22 )
...
This reverts commit c48c959807e7e80cdd514be9bd019b16e3b816eb.
2021-12-04 13:55:58 +08:00
xiaohou
c48c959807
[recipe] suport speech command dataset ( #21 )
...
* [recipe] suport speech command dataset
* format
* format
* format
* update run.sh
2021-12-03 21:07:42 +08:00
Binbin Zhang
6416f9c7db
Merge pull request #20 from wenet-e2e/fix-jingyonghou
...
fix bug in checking kernel size, kernel size should be a odd number
2021-12-03 10:29:27 +08:00