wekws

AI/wekws

Author	SHA1	Message	Date
Binbin Zhang	8aa68ad750	[doc] rename conda env to wekws (#77 )	2022-08-27 16:23:10 +08:00
Binbin Zhang	c9a262866f	[wekws] rename kws to wekws (#76 ) * [wekws] rename kws to wekws * fix lint	2022-08-27 11:57:44 +08:00
Wall.E	51f0fe6dc3	fixed the parameter transfer problem for criterion (#75 ) * fixed the parameter transfer problem for criterion Co-authored-by: yangyyt <yuntingyang@yuntingdeMacBook-Pro.local>	2022-07-13 23:50:48 +08:00
胡大炮	141d40704f	[fix bug] add optimizer.zero_grad() in kws/utils/executor.py (#72 ) (#73 ) * fix bug in kws/utils/executor.py (#72) * [fix bug] add zero_grad() above backward() in kws/utils/executor.py (#72)	2022-06-05 22:39:26 +08:00
ryoha	41a3432198	fix export in export_onnx (#71 )	2022-05-29 09:30:53 +08:00
Binbin Zhang	663a31d9ea	Update doc.yml (#68 )	2022-04-14 16:06:02 +08:00
Cyan	015748b94e	learning rate won't initiate from 0.001 when continuing training from checkpoint (#67 ) * add .gitattributes * add long wav * fix some bugs * updated lint error * back the hi_xiaowen/run.sh to the same * remove the space * better one * remove 'num_keyword' parameter * remove files * flask8 examine * override the score and compute_det file * remove defaultdict * remove import defaultdict * learning rate won't initial from 0.001 when continuing training from checkpoint * fix intent bug with initial learning rate != 0.001	2022-04-14 16:02:18 +08:00
Cyan	7d142b9528	[examples] refactor FAR computation to support long audio test (#64 ) * add .gitattributes * add long wav * fix some bugs * updated lint error * back the hi_xiaowen/run.sh to the same * remove the space * better one * remove 'num_keyword' parameter * remove files * flask8 examine * override the score and compute_det file * remove defaultdict * remove import defaultdict	2022-03-24 14:35:07 +08:00
Menglong Xu	ff4b47f94d	[kws] update cross_entropy loss (#62 ) * [kws] update cross_entropy loss replace nn.CrossEntropyLoss() with F.cross_entropy() * format	2022-03-15 19:34:28 +08:00
Menglong Xu	66fcfa2ce5	[doc] add result on GSC dataset (#61 ) * [examples] reset grad_clip * [doc] add basic result of mdtc model	2022-02-13 19:53:19 +08:00
Menglong Xu	d805c55560	[examples] update to use torchrun launch (#60 )	2022-02-11 14:51:00 +08:00
lxiao336	db2685d1a4	[tools] add a bash script that trimmes silence using sox and split-based multi-processing (#56 ) Co-authored-by: hp <shawl336@163.com>	2022-01-15 13:54:44 +08:00
Binbin Zhang	57021924cb	[kws] support onnx export (#53 )	2022-01-15 13:50:34 +08:00
Binbin Zhang	665df6113e	[doc] update qr code (#55 )	2021-12-21 17:38:01 +08:00
Menglong Xu	f622c55b04	[kws] update parameter for plotting det curve (#54 )	2021-12-17 20:52:45 +08:00
Menglong Xu	768900307a	[kws] add code for plotting det curve (#52 ) * [kws] add code for plotting det curve * format * format * format * format * [kws] add code for plotting det curve format format format format * set xlim and ylim by parameter * set xlim and ylim optional * update help information * update parser type * Update run.sh	2021-12-16 18:21:04 +08:00
Menglong Xu	20891f90e6	Merge pull request #51 from wenet-e2e/binbin-activation [kws] put activation in model, so the activation could be exported in…	2021-12-15 21:32:52 +08:00
Binbin Zhang	8943acb51f	[kws] put activation in model, so the activation could be exported in script model	2021-12-15 21:10:27 +08:00
Menglong Xu	6a58993390	[examples] update to use torchrun launch (#50 )	2021-12-15 21:03:59 +08:00
Menglong Xu	566baca343	[examples] update ds_tcn config for hey_snips (#49 )	2021-12-15 19:31:58 +08:00
Binbin Zhang	dc1ac8fecd	[examples] use big model for ds_tcn (#47 )	2021-12-15 11:12:24 +08:00
Binbin Zhang	e3bfcf9f4e	[doc] add quantize result (#46 )	2021-12-15 11:07:59 +08:00
Binbin Zhang	f86a797b10	[kws] add static quantize (#44 ) * [kws] add static quantize * refine lint error in shuffle_list.py * refine lint * fix topo	2021-12-14 14:32:54 +08:00
ZY	05bb5d1bdb	[doc] update readme (#45 )	2021-12-14 09:38:57 +08:00
Binbin Zhang	a61db05ff4	[examples] add weight decay in aishell (#43 )	2021-12-13 19:53:24 +08:00
Binbin Zhang	171309bd9e	[bin] use torchrun to launch ddp training (#42 )	2021-12-13 19:45:55 +08:00
Binbin Zhang	28652a766f	[kws] support fuse and quantization (#41 )	2021-12-13 19:16:50 +08:00
Binbin Zhang	fd255fd7c6	[examples] update spec aug parameters in hi xiaowen(#40 )	2021-12-10 15:48:16 +08:00
Binbin Zhang	bc8d9f1c37	[examples] fix mdtc small config in hi_xiaowen (#39 )	2021-12-09 17:53:34 +08:00
Menglong Xu	4a875776e5	[example] support hey_snips_kws_4.0 dataset (#38 ) * [example] support hey_snips_kws_4.0 dataset * format * format	2021-12-08 23:46:05 +08:00
Menglong Xu	1eda27647b	[fix bug] add sigmoid() in score.py (#37 ) The sigmoid() in kws/model/kws_model.py:KWSModel() was moved into kws/model/loss.py:max_pooling_loss() To compute the posterior score correctly, the sigmoid() should also be added to kws/bin/score.py:main()	2021-12-08 13:41:20 +08:00
xiaohou	bd504c3cee	[bin] check torch script (#36 ) * update run.sh * [network] add code to check whether the model can be exported to torch script or not	2021-12-07 11:17:47 +08:00
xiaohou	afbc1d2960	[example] add testing code for speech command dataset (#32 ) * update run.sh * update run.sh * rename test.py to compute_accuracy.py * update run,sh	2021-12-07 10:56:30 +08:00
Binbin Zhang	b55ae111ae	[model] refactor tcn and ds_tcn share the same base class (#35 )	2021-12-07 10:52:14 +08:00
Binbin Zhang	92a4c19ffe	[examples] use ds_tcn as default model (#34 ) * [examples] use ds_tcn as default model * fix scoring gpu id	2021-12-07 10:36:38 +08:00
Binbin Zhang	93bf93565e	[doc] add wechat group discuss (#33 )	2021-12-06 22:00:54 +08:00
Binbin Zhang	c7c5bd3edc	[kws] refine tcn and ds_tcp, add batchnorm (#31 ) * [kws] fix seed type * [kws] refine tcn and ds_tcn, add batch norm	2021-12-06 17:24:48 +08:00
xiaohou	37f56db5af	[exampels] add speechcommand train (#30 ) * [example] added code for training speech command dataset * update kes_model.py * update kes_model.py * format * format * add more comments to explain the new classifier designed for speech command classification task * add copyrigh info * update copyrigh info of classifier.py	2021-12-06 17:14:33 +08:00
xiaohou	8be4bef405	[examples] speech command data prepare (#27 ) * [examples] added speech command data preparation code * update * updata path.sh	2021-12-06 12:00:25 +08:00
Binbin Zhang	5241491e95	[kws] fix seed type (#26 )	2021-12-06 11:38:41 +08:00
Binbin Zhang	a5a54782cc	[kws] use **kvargs for optim to reduce code (#25 )	2021-12-06 11:23:44 +08:00
Menglong Xu	88444ab177	[examples] correct a spelling mistake (#24 )	2021-12-05 21:24:06 +08:00
Binbin Zhang	8cfd4ed4f2	[kws] fix weight_decay key error (#23 )	2021-12-04 17:19:46 +08:00
Binbin Zhang	dfe8b2536b	Revert "[recipe] suport speech command dataset (#21 )" (#22 ) This reverts commit c48c959807e7e80cdd514be9bd019b16e3b816eb.	2021-12-04 13:55:58 +08:00
xiaohou	c48c959807	[recipe] suport speech command dataset (#21 ) * [recipe] suport speech command dataset * format * format * format * update run.sh	2021-12-03 21:07:42 +08:00
Binbin Zhang	6416f9c7db	Merge pull request #20 from wenet-e2e/fix-jingyonghou fix bug in checking kernel size, kernel size should be a odd number	2021-12-03 10:29:27 +08:00
jingyong hou	8ea11dc572	fix bug in checking kernel size, kernel size should be a odd number	2021-12-03 10:25:02 +08:00
Binbin Zhang	56bdce8775	[doc] add reference (#18 ) * [doc] add reference * [doc] refine format * [doc] refine format * fix url	2021-11-30 20:56:26 +08:00
Binbin Zhang	1909fcd360	Merge pull request #17 from wenet-e2e/dev_jingyonghou [network] update mdtc.py to prevent possible errors and remove useless functions	2021-11-30 17:20:09 +08:00
xiaohou	5236d42800	Update mdtc.py	2021-11-30 17:14:30 +08:00

1 2

82 Commits