Vector Quantization

July 22, 2025

BEiT: BERT Pre-Training of Image Transformers

Visual Tokens & Masked Image Modeling

July 21, 2025

Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction

自回归图像生成: Multi-scale Quantization & Next-scale Prediction

July 20, 2025

TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation

兼顾 Low-level 的 Pixel 信息和 High-level 的 Semantic 信息

July 17, 2025

UniCode$^2$: Cascaded Large-scale Codebooks for Unified Multimodal Understanding and Generation

非常自然的 Image-Codeword+Text-LLM-NextCodeWord-Generation 流程

July 16, 2025

SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer

TiTok 框架 + Softmax 版 Vector Quantization 以期更高的压缩比

July 9, 2025

Is Vector Quantization the Future of Recommendation?

June 30, 2025

Sparse Meets Dense: Unified Generative Recommendations with Cascaded Sparse-Dense Representations

COBRA, 链接离散编码和稠密表示的尝试

June 28, 2025

Adapting Large Language Models by Integrating Collaborative Semantics for Recommendation

LC-Rec, LLM + RQ-VAE + 丰富的多任务训练

June 24, 2025

OneRec: Unifying Retrieve and Rank with Generative Recommender and Preference Alignment

OneRec, 端到端的推荐模型

June 15, 2025

Addressing Representation Collapse in Vector Quantized Models with One Linear Layer

SimVQ, 坐标变换替代可学习 Codebook