91 Commits

Author SHA1 Message Date
彭震东
4bacb81f7f
[android] add build.gradle (#89) 2022-09-12 19:09:44 +08:00
彭震东
508938f537
[android] add build.gradle and rename model name (#88) 2022-09-12 17:50:05 +08:00
Binbin Zhang
9f29e033aa
[examples] remove static quantization (#87) 2022-09-12 15:49:28 +08:00
彭震东
0d9237b8c0
[runtime/android] add android runtime (#83)
* [android] init android runtime

* [android] add voice rectangle view

* [android] finished

* [android] fix lint
2022-09-07 15:25:45 +08:00
Binbin Zhang
1ad3102c8c
[fix] fix mdtc training cache (#82) 2022-09-01 18:25:43 +08:00
Binbin Zhang
490a474d4e
[fix] fix training and export error (#81) 2022-08-28 16:49:24 +08:00
Binbin Zhang
50354a38e0
[fix/runtime] fix topo error (#80) 2022-08-28 16:35:19 +08:00
Binbin Zhang
53d7b8f807
[runtime/onnxruntime] add onnxruntime support (#79)
* [runtime/onnxruntime] add onnxruntime support

* add cpplint and clang-format

* fix lint
2022-08-28 13:35:21 +08:00
Binbin Zhang
5037d51ed9
[wekws] add cache support (#78) 2022-08-27 16:44:22 +08:00
Binbin Zhang
8aa68ad750
[doc] rename conda env to wekws (#77) 2022-08-27 16:23:10 +08:00
Binbin Zhang
c9a262866f
[wekws] rename kws to wekws (#76)
* [wekws] rename kws to wekws

* fix lint
2022-08-27 11:57:44 +08:00
Wall.E
51f0fe6dc3
fixed the parameter transfer problem for criterion (#75)
* fixed the parameter transfer problem for criterion

Co-authored-by: yangyyt <yuntingyang@yuntingdeMacBook-Pro.local>
2022-07-13 23:50:48 +08:00
胡大炮
141d40704f
[fix bug] add optimizer.zero_grad() in kws/utils/executor.py (#72) (#73)
* fix bug in kws/utils/executor.py (#72)

* [fix bug] add zero_grad() above backward() in kws/utils/executor.py (#72)
2022-06-05 22:39:26 +08:00
ryoha
41a3432198
fix export in export_onnx (#71) 2022-05-29 09:30:53 +08:00
Binbin Zhang
663a31d9ea
Update doc.yml (#68) 2022-04-14 16:06:02 +08:00
Cyan
015748b94e
learning rate won't initiate from 0.001 when continuing training from checkpoint (#67)
* add .gitattributes

* add long wav

* fix some bugs

* updated lint error

* back the hi_xiaowen/run.sh to the same

* remove the space

* better one

* remove 'num_keyword' parameter

* remove files

* flask8 examine

* override the score and compute_det file

* remove defaultdict

* remove import defaultdict

* learning rate won't initial from 0.001 when continuing training from checkpoint

* fix intent bug with initial learning rate != 0.001
2022-04-14 16:02:18 +08:00
Cyan
7d142b9528
[examples] refactor FAR computation to support long audio test (#64)
* add .gitattributes

* add long wav

* fix some bugs

* updated lint error

* back the hi_xiaowen/run.sh to the same

* remove the space

* better one

* remove 'num_keyword' parameter

* remove files

* flask8 examine

* override the score and compute_det file

* remove defaultdict

* remove import defaultdict
2022-03-24 14:35:07 +08:00
Menglong Xu
ff4b47f94d
[kws] update cross_entropy loss (#62)
* [kws] update cross_entropy loss

replace nn.CrossEntropyLoss() with F.cross_entropy()

* format
2022-03-15 19:34:28 +08:00
Menglong Xu
66fcfa2ce5
[doc] add result on GSC dataset (#61)
* [examples] reset grad_clip

* [doc] add basic result of mdtc model
2022-02-13 19:53:19 +08:00
Menglong Xu
d805c55560
[examples] update to use torchrun launch (#60) 2022-02-11 14:51:00 +08:00
lxiao336
db2685d1a4
[tools] add a bash script that trimmes silence using sox and split-based multi-processing (#56)
Co-authored-by: hp <shawl336@163.com>
2022-01-15 13:54:44 +08:00
Binbin Zhang
57021924cb
[kws] support onnx export (#53) 2022-01-15 13:50:34 +08:00
Binbin Zhang
665df6113e
[doc] update qr code (#55) 2021-12-21 17:38:01 +08:00
Menglong Xu
f622c55b04
[kws] update parameter for plotting det curve (#54) 2021-12-17 20:52:45 +08:00
Menglong Xu
768900307a
[kws] add code for plotting det curve (#52)
* [kws] add code for plotting det curve

* format

* format

* format

* format

* [kws] add code for plotting det curve

format

format

format

format

* set xlim and ylim by parameter

* set xlim and ylim optional

* update help information

* update parser type

* Update run.sh
2021-12-16 18:21:04 +08:00
Menglong Xu
20891f90e6
Merge pull request #51 from wenet-e2e/binbin-activation
[kws] put activation in model, so the activation could be exported in…
2021-12-15 21:32:52 +08:00
Binbin Zhang
8943acb51f [kws] put activation in model, so the activation could be exported in script model 2021-12-15 21:10:27 +08:00
Menglong Xu
6a58993390
[examples] update to use torchrun launch (#50) 2021-12-15 21:03:59 +08:00
Menglong Xu
566baca343
[examples] update ds_tcn config for hey_snips (#49) 2021-12-15 19:31:58 +08:00
Binbin Zhang
dc1ac8fecd
[examples] use big model for ds_tcn (#47) 2021-12-15 11:12:24 +08:00
Binbin Zhang
e3bfcf9f4e
[doc] add quantize result (#46) 2021-12-15 11:07:59 +08:00
Binbin Zhang
f86a797b10
[kws] add static quantize (#44)
* [kws] add static quantize

* refine lint error in shuffle_list.py

* refine lint

* fix topo
2021-12-14 14:32:54 +08:00
ZY
05bb5d1bdb
[doc] update readme (#45) 2021-12-14 09:38:57 +08:00
Binbin Zhang
a61db05ff4
[examples] add weight decay in aishell (#43) 2021-12-13 19:53:24 +08:00
Binbin Zhang
171309bd9e
[bin] use torchrun to launch ddp training (#42) 2021-12-13 19:45:55 +08:00
Binbin Zhang
28652a766f
[kws] support fuse and quantization (#41) 2021-12-13 19:16:50 +08:00
Binbin Zhang
fd255fd7c6
[examples] update spec aug parameters in hi xiaowen(#40) 2021-12-10 15:48:16 +08:00
Binbin Zhang
bc8d9f1c37
[examples] fix mdtc small config in hi_xiaowen (#39) 2021-12-09 17:53:34 +08:00
Menglong Xu
4a875776e5
[example] support hey_snips_kws_4.0 dataset (#38)
* [example] support hey_snips_kws_4.0 dataset

* format

* format
2021-12-08 23:46:05 +08:00
Menglong Xu
1eda27647b
[fix bug] add sigmoid() in score.py (#37)
The sigmoid() in kws/model/kws_model.py:KWSModel() was moved into kws/model/loss.py:max_pooling_loss()
To compute the posterior score correctly, the sigmoid() should also be added to kws/bin/score.py:main()
2021-12-08 13:41:20 +08:00
xiaohou
bd504c3cee
[bin] check torch script (#36)
* update run.sh

* [network] add code to check whether the model can be exported to torch script or not
2021-12-07 11:17:47 +08:00
xiaohou
afbc1d2960
[example] add testing code for speech command dataset (#32)
* update run.sh

* update run.sh

* rename test.py to compute_accuracy.py

* update run,sh
2021-12-07 10:56:30 +08:00
Binbin Zhang
b55ae111ae
[model] refactor tcn and ds_tcn share the same base class (#35) 2021-12-07 10:52:14 +08:00
Binbin Zhang
92a4c19ffe
[examples] use ds_tcn as default model (#34)
* [examples] use ds_tcn as default model

* fix scoring gpu id
2021-12-07 10:36:38 +08:00
Binbin Zhang
93bf93565e
[doc] add wechat group discuss (#33) 2021-12-06 22:00:54 +08:00
Binbin Zhang
c7c5bd3edc
[kws] refine tcn and ds_tcp, add batchnorm (#31)
* [kws] fix seed type

* [kws] refine tcn and ds_tcn, add batch norm
2021-12-06 17:24:48 +08:00
xiaohou
37f56db5af
[exampels] add speechcommand train (#30)
* [example] added code for training speech command dataset

* update kes_model.py

* update kes_model.py

* format

* format

* add more comments to explain the new classifier designed for speech command classification task

* add copyrigh info

* update copyrigh info of classifier.py
2021-12-06 17:14:33 +08:00
xiaohou
8be4bef405
[examples] speech command data prepare (#27)
* [examples] added speech command data preparation code

* update

* updata path.sh
2021-12-06 12:00:25 +08:00
Binbin Zhang
5241491e95
[kws] fix seed type (#26) 2021-12-06 11:38:41 +08:00
Binbin Zhang
a5a54782cc
[kws] use **kvargs for optim to reduce code (#25) 2021-12-06 11:23:44 +08:00