Louis Martin
Louis Martin
Home
Experience
Publications
Projects
Contact
CV
Light
Dark
Automatic
2
Efficient Large Scale Language Modeling with Mixtures of Experts
Mixture of Experts layers (MoEs) enable efficient scaling of language models through conditional computation. This paper presents a detailed empirical study of how autoregressive MoE language models scale in comparison with dense models in a wide …
Rethinking Automatic Evaluation in Sentence Simplification
Cite
×