2026.06.10 [논문리뷰]Do Transformers Need Three Projections? Systematic Study of QKV Variants (ICML, 2026)
2026.02.13 [논문리뷰]One token can help! learning scalable and pluggable virtual tokens for retrieval-augmented large language models (AAAI, 2025)