January 25, 2026Fine-Tuning Language Models with Just Forward Passes零阶优化 & 收敛理论NoteOptimizationZeroth-OrderLLMTheoretical2023
July 15, 2025MEGABYTE: Predicting Million-byte Sequences with Multiscale Transformers多尺度 Transformer, 探究非 Subword Tokenizer 的可能性NoteTokenizationMultiscaleSeminalEmpiricalNeurIPS2023
May 22, 2025Universal Prompt Tuning for Graph Neural Networks图上特征 prompt 等价各异 graph promptNoteGraphGNNPromptTheoreticalNeurIPS2023
May 21, 2025All in One: Multi-Task Prompting for Graph Neural Networks统一 graph/edge/node-level 的 graph promptNoteGraphGNNPromptEmpiricalKDD2023
March 16, 2025Recommender Systems with Generative RetrievalTIGER, 向量量化生成式检索NoteSequential RecommendationGenerativeVector QuantizationSeminalEmpiricalNeurIPS2023
March 12, 2025Finite Scalar Quantization: VQ-VAE Made SimpleFSQ, 标量量化NoteVector QuantizationCodebook CollapseEmpiricalArXiv2023