2025.09.30 [논문리뷰]Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding(ACL, 2024)
2025.09.22 [논문리뷰]DAC: A Dynamic Attention-aware Approach for Task-Agnostic Prompt Compression (ACL, 2025)