SmallInitEmb

LayerNorm(SmallInit(Embedding)) in a Transformer to improve convergence

Stars
45

Statistics for this project are still being loaded, please check back later.

Related Projects