TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation

预备知识

核心思想

20250720142111

参考文献

  1. Qu L., Zhang H., Liu Y., Wang X., Jiang Y., Gao Y., Ye H., Du D. K., Yuan Z. and Wu X. TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation. CVPR, 2025. [PDF] [Code]