TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation

研究背景

核心思想

20250720142111

参考文献

  1. Qu L., Zhang H., Liu Y., Wang X., Jiang Y., Gao Y., Ye H., Du D. K., Yuan Z. and Wu X. TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation. CVPR, 2025. [PDF] [Code]