Xiao Shi Huang et al.: Improving Transformer Optimization Through Better Initialization. (2020)conf/icml/HuangPBV20Improving Transformer Optimization Through Better Initialization.4Xiao Shi Huang1Felipe Pérez2Jimmy Ba3Maksims Volkovs44475-4483ICMLICML20202020provenance information for RDF data of dblp record 'conf/icml/HuangPBV20'2020-12-15T17:40:19+0100