Abstract

We present Agglomerative Token Clustering (ATC), a novel token merging method that consistently outperforms previous token merging and pruning methods across image classification, image synthesis, and object detection & segmentation tasks. ATC merges clusters through bottom-up hierarchical clustering, without the introduction of extra learnable parameters. We find that ATC achieves state-of-the-art performance across all tasks, and can even perform on par with prior state-of-the-art when applied off-the-shelf, i.e. without fine-tuning. ATC is particularly effective when applied with low keep rates, where only a small fraction of tokens are kept and retaining task performance is especially difficult.

Original languageEnglish
Title of host publicationComputer Vision – ECCV 2024 : 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part LVII
EditorsAleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol
Number of pages19
PublisherSpringer
Publication date2024
Pages200-218
ISBN (Print)9783031729973
DOIs
Publication statusPublished - 2024
Event18th European Conference on Computer Vision, ECCV 2024 - Milan, Italy
Duration: 29 Sept 20244 Oct 2024

Conference

Conference18th European Conference on Computer Vision, ECCV 2024
Country/TerritoryItaly
CityMilan
Period29/09/202404/10/2024
SeriesLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume15115 LNCS
ISSN0302-9743

Bibliographical note

Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

Keywords

  • Computer Vision
  • Vision Transformers
  • Token Reduction
  • Clustering
  • Agglomerative Clustering
  • Hierarchal Clustering

Fingerprint

Dive into the research topics of 'Agglomerative Token Clustering'. Together they form a unique fingerprint.
  • Pioneer Centre for AI

    Tan, Z.-H. (CoPI), Moeslund, T. B. (CoPI) & Larsen, T. (Project Participant)

    01/07/2021 → …

    Project: Research

Cite this