LLM

August 27, 2025

HLLM-Creator: Hierarchical LLM-based Personalized Creative Generation

利用 LLM 高效生成个性化广告语

August 11, 2025

Distilling LLM Agent into Small Models with Retrieval and Code Tools

Agent Distillation

August 11, 2025

Structured Agent Distillation for Large Language Model

Agent 的 Reasoning & Action 蒸馏

July 2, 2025

EARN: Efficient Inference Acceleration for LLM-based Generative Recommendation by Register Tokens

通过减小 KV cache size 加速 LLMRec 的推理

June 28, 2025

Adapting Large Language Models by Integrating Collaborative Semantics for Recommendation

LC-Rec, LLM + RQ-VAE + 丰富的多任务训练

May 13, 2025

Base of RoPE Bounds Context Length

讨论 RoPE base 对于相似 Tokens 感知能力的影响

May 12, 2025

Round and Round We Go! What makes Rotary Positional Encodings useful?

理解 RoPE 的高低频

May 11, 2025

Transformers need glasses! Information Over-Squashing in Language Tasks

LLM Representational Collapse

April 5, 2025

Language Representations Can be What Recommenders Need: Findings and Potentials

Next-token embedding 之于协同过滤

April 2, 2025

Physics of Language Models: Part 3.1, Knowledge Storage and Extraction

探究 LLM 如何记忆和提取知识的实验性文章