GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

预备知识

核心思想

20250507104959

20250507105029

参考文献

  1. Zhao J., Zhang Z., Chen B., Wang Z., Anandkumar A., and Tian Y. GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection. ICML, 2024. [PDF] [Code]