Attend to Not Attended: Structure-then-Detail Token Merging for Post-training DiT Acceleration

Published in In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Conference, 2025

Recommended citation: Haipeng Fang, Sheng Tang, Juan Cao, Enshuo Zhang, Fan Tang, Tong-Yee Lee; "Attend to Not Attended: Structure-then-Detail Token Merging for Post-training DiT Acceleration." In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Conference, 2025.
Download Bibtex

Share on

Bluesky Facebook LinkedIn Mastodon X (formerly Twitter)