July 20, 2025TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation兼顾 Low-level 的 Pixel 信息和 High-level 的 Semantic 信息NoteMLLMVector QuantizationEmpiricalCVPR2025
July 17, 2025UniCode$^2$: Cascaded Large-scale Codebooks for Unified Multimodal Understanding and Generation非常自然的 Image-Codeword+Text-LLM-NextCodeWord-Generation 流程NoteMLLMVector QuantizationEmpirical2025