July 16, 2025Dynamic Chunking for End-to-End Hierarchical Sequence Modeling符号序列的自动切分, 探究非 Subword Tokenizer 的可能性NoteTokenizationDynamic ChunkingEmpirical2025
July 15, 2025MEGABYTE: Predicting Million-byte Sequences with Multiscale Transformers多尺度 Transformer, 探究非 Subword Tokenizer 的可能性NoteTokenizationMultiscaleSeminalEmpiricalNeurIPS2023
July 15, 2025SpaceByte: Towards Deleting Tokenization from Large Language Modeling探究非 Subword Tokenizer 的可能性NoteTokenizationMultiscaleEmpiricalNeurIPS2024