Analysis of Data Parallel Methods in Training Neural Language Models via Multiple GPUs. CWMT2017.
Release time:2024-01-09
Hits:
- First Author:
- Yinqiao Li, Ambyer Han, Le Bo, Tong Xiao, Jingbo Zhu, Li Zhang.
- Translation or Not:
- no
