May 11, 2025Transformers need glasses! Information Over-Squashing in Language TasksLLM Representational CollapseNoteLLMRepresentational CollapseOver-SquashingEmpiricalNeurIPS2024