Understanding the Role of the Projector in Knowledge Distillation
Published in Proceedings of the 38th AAAI Conference on Artificial Intelligence (AAAI-24), 2024
Explored a novel perspective of knowledge distillation through the training dynamics of the projector weights. We proposed a very simple distillation pipeline to attain a new state-of-the-art for the data efficient training of transformer models.
Recommended citation: Miles, R., & Mikolajczyk, K. (2023). Understanding the Role of the Projector in Knowledge Distillation. AAAI. https://arxiv.org/abs/2303.11098