marcoyang
/

icefall-multi-kd-pretrain-amp-fp16

Model card Files Files and versions

Metrics Training metrics Community

icefall-multi-kd-pretrain-amp-fp16 / notes.txt

yangxiaoyu6

add files

2ad1ea3 about 1 year ago

history blame contribute delete

164 Bytes

	The two experiments are the same configuration, except for the max-duration.
	The md=1000 experiment has better pre-training performance.
	Both experiments uses fp16.