SWALP: Stochastic Weight Averaging in Low-Precision Training

预备知识

核心思想

20250506174256

20250506174807

参考文献

  1. Yang G., Zhang T., Kirichenko P., Bai J., Wilson A. G., and Sa De C. SWALP: Stochastic Weight Averaging in Low-Precision Training. ICML, 2019. [PDF] [Code]
  2. Izmailov P., Podoprikhin D., Garipov T., Vetrov D., and Wilson A. G. Averaging Weights Leads to Wider Optima and Better Generalization. arXiv, 2018. [PDF] [Code]