2026.06.10 [논문리뷰]Do Transformers Need Three Projections? Systematic Study of QKV Variants (ICML, 2026)
2026.02.13 [논문리뷰]One token can help! learning scalable and pluggable virtual tokens for retrieval-augmented large language models (AAAI, 2025)
2026.01.07 [논문리뷰]DSAS: A Universal Plug-and-Play Framework for Attention Optimization in Multi-Document Question Answering (NeurIPS, 2025)