2025.09.30 [논문리뷰]Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding(ACL, 2024)
2025.09.22 [논문리뷰]DAC: A Dynamic Attention-aware Approach for Task-Agnostic Prompt Compression (ACL, 2025)
2025.09.20 [논문리뷰]UniICL: An Efficient Unified Framework Unifying Compression, Selection, and Generation (ACL, 2025)