April 24, 2026Factorizing personalized Markov chains for next-basket recommendationFPMCNoteRecommendationSequentialMarkovEmpiricalWWW2010
April 9, 2026Transformers Can Do Bayesian InferencePFN: Prior-Data Fitted NetworksNoteSimulationPFNSeminalICLR2022
April 2, 2026PAR$^2$-RAG: Planned Active Retrieval and Reasoning for Multi-Hop Question Answering粗召回 & 细检索的多条 RAG 方案NoteLLMAgentRAG2026
March 30, 2026Reasoning on Graphs: Faithful and Interpretable Large Language Model Reasoning如何蒸馏图的推理路径NoteLLMGraphRAGEmpiricalICLR2024
March 25, 2026SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?SWE-Bench ProNoteCodeAI Software EngineeringBenchmarkEmpirical2025
March 5, 2026DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language ModelsGRPO: Group Relative Policy OptimizationNoteReinforcement LearningSeminal2024
March 4, 2026Trust Region Policy OptimizationPPO 的前身NoteReinforcement LearningTheoreticalSeminalICML2015