May 7, 20251-bit Adam: Communication Efficient Large-Scale Training with Adam's Convergence SpeedAdam 预训练的 1-bit SGD 优化方法NoteLow-PrecisionQuantizationError CompensationOptimizerTheoreticalICML2021
May 7, 2025CPT: Efficient Deep Neural Network Training via Cyclic PrecisionCPT, 类似 CosineAnnealingWarmRestarts 的 Precision 循环机制NoteLow-PrecisionQuantizationGeneralizationEmpiricalICLR2021
March 11, 2025Taming Transformers for High-Resolution Image SynthesisVQGAN, 自回归式的图片生成NoteGANVector QuantizationImage SynthesisSeminalEmpiricalCVPR2021