Language Representations Can be What Recommenders Need: Findings and Potentials

核心思想

20250323131310

Linear

20250323132631

AlphaRec

20250323133416

其它潜力

20250323133753

20250323133719

个人测试

Movies

# AlphaRec
root: ../../data
dataset: AmazonMovies_Alpha
tasktag: Matching

embedding_dim: 64
num_layers: 2

epochs: 500
batch_size: 4096
optimizer: adam
lr: 5.e-4
weight_decay: 1.e-6

tau: 0.15
num_negs: 256
projector: mlp

monitors: [LOSS, Recall@1, Recall@10, Recall@20, HitRate@10, HitRate@20, NDCG@10, NDCG@20]
which4best: Recall@20
# LightGCN
root: ../../data
dataset: AmazonMovies_Alpha
tasktag: Matching

embedding_dim: 64
num_layers: 2

epochs: 1000
batch_size: 2048
optimizer: adam
lr: 1.e-3
weight_decay: 1.e-3

monitors: [LOSS, Recall@1, Recall@10, Recall@20, HitRate@10, HitRate@20, NDCG@10, NDCG@20]
which4best: NDCG@20

Beauty

# LightGCN

root: ../../data
dataset: Amazon2014Beauty_550811_ROU
tasktag: Matching

embedding_dim: 64
num_layers: 3

epochs: 1000
batch_size: 2048
optimizer: adam
lr: 1.e-3
weight_decay: 1.e-3

monitors: [LOSS, Recall@1, Recall@10, Recall@20, NDCG@10, NDCG@20]
which4best: NDCG@20
# LightGCN + InfoNCE

root: ../../data
dataset: Amazon2014Beauty_550811_ROU
tasktag: Matching

embedding_dim: 64
num_layers: 3
num_negs: 256
tau: 0.15

epochs: 500
batch_size: 2048
optimizer: adam
lr: 5.e-4
weight_decay: 1.e-2

monitors: [LOSS, Recall@1, Recall@10, Recall@20, NDCG@10, NDCG@20]
which4best: NDCG@20
root: ../../data
dataset: Amazon2014Beauty_550811_ROU
tasktag: Matching

embedding_dim: 64
num_layers: 3
tfile: llama2_7b_title.pkl # llama2_13b_title.pkl

epochs: 500
batch_size: 2048
optimizer: adam
lr: 5.e-4
weight_decay: 0.

tau: 0.15
num_negs: 256
projector: mlp

monitors: [LOSS, Recall@1, Recall@10, Recall@20, NDCG@10, NDCG@20]
which4best: NDCG@20
Method R@1 R@10 R@20 N@10 N@20
LightGCN 0.0079 0.0538 0.0836 0.0282 0.0361
LightGCN+InfoNCE 0.0098 0.0544 0.0829 0.0296 0.0371
AlphaRec (Llama2-7B) 0.0104 0.0618 0.0925 0.0330 0.0412
AlphaRec (Llama2-13B) 0.0107 0.0608 0.0921 0.0329 0.0412
AlphaRec (MiniLM-L12-v2) 0.0100 0.0608 0.0930 0.0322 0.0407

Baby

# LightGCN
root: ../../data
dataset: Amazon2014Baby_550811_RAU
tasktag: Matching

embedding_dim: 64
# num_layers: 3
# num_negs: 256
# tau: 0.25

epochs: 100
batch_size: 2048
optimizer: adam
lr: 1.e-3
weight_decay: 5.e-3

monitors: [LOSS, Recall@1, Recall@10, Recall@20, NDCG@10, NDCG@20]
which4best: NDCG@20
# LightGCN + InfoNCE
root: ../../data
dataset: Amazon2014Baby_550811_RAU
tasktag: Matching

embedding_dim: 64
num_layers: 3
num_negs: 256
tau: 0.25

epochs: 500
batch_size: 2048
optimizer: adam
lr: 1.e-3
weight_decay: 1.e-3

monitors: [LOSS, Recall@1, Recall@10, Recall@20, NDCG@10, NDCG@20]
which4best: NDCG@20
# AlphaRec
root: ../../data
dataset: Amazon2014Baby_550811_RAU
tasktag: Matching

embedding_dim: 64
num_layers: 3
tfile: llama2_7b_title.pkl

epochs: 500
batch_size: 2048
optimizer: adam
lr: 5.e-4
weight_decay: 0.

tau: 0.25
num_negs: 256
projector: mlp

monitors: [LOSS, Recall@1, Recall@10, Recall@20, NDCG@10, NDCG@20]
which4best: NDCG@20
Method R@1 R@10 R@20 N@10 N@20
LightGCN 0.0037 0.0212 0.0357 0.0113 0.0151
LightGCN+InfoNCE 0.0036 0.0206 0.0344 0.0111 0.0147
AlphaRec (Llama2-7B) 0.0039 0.0243 0.0399 0.0128 0.0169
AlphaRec (Llama2-13B) 0.0037 0.0242 0.0399 0.0126 0.0167
AlphaRec (MiniLM-L12-v2) 0.0031 0.0229 0.0385 0.0117 0.0158

参考文献

  1. Sheng L., Zhang A., Zhang Y., Chen Y., Wang X., and Chua T. Language Representations Can be What Recommenders Need: Findings and Potentials ICLR, 2025. [PDF] [Code]