RESEARCH

COMPUTER VISION

Time-rEversed diffusioN tEnsor Transformer: A new TENET of Few-Shot Object Detection

October 22, 2022

Abstract

In this paper, we tackle the challenging problem of Few-shot Object Detection. Existing FSOD pipelines (i) use average-pooled representations that result in information loss; and/or (ii) discard position information that can help detect object instances. Consequently, such pipelines are sensitive to large intra-class appearance and geometric variations between support and query images. To address these drawbacks, we propose a Time-rEversed diffusioN tEnsor Transformer (TENET), which i) forms high-order tensor representations that capture multi-way feature occurrences that are highly discriminative, and ii) uses a trans- former that dynamically extracts correlations between the query image and the entire support set, instead of a single average-pooled support em- bedding. We also propose a Transformer Relation Head (TRH), equipped with higher-order representations, which encodes correlations between query regions and the entire support set, while being sensitive to the positional variability of object instances. Our model achieves state-of- the-art results on PASCAL VOC, FSOD, and COCO.

Download the Paper

AUTHORS

Written by

Naila Murray

Lei Wang

Piotr Koniusz

Shan Zhang

Publisher

ECCV

Research Topics

Computer Vision

Related Publications

February 13, 2024

GRAPHICS

COMPUTER VISION

IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality 3D Generation

Luke Melas-Kyriazi, Iro Laina, Christian Rupprecht, Natalia Neverova, Andrea Vedaldi, Oran Gafni, Filippos Kokkinos

February 13, 2024

January 25, 2024

COMPUTER VISION

LRR: Language-Driven Resamplable Continuous Representation against Adversarial Tracking Attacks

Felix Xu, Di Lin, Jianjun Zhao, Jianlang Chen, Lei Ma, Qing Guo, Wei Feng, Xuhong Ren

January 25, 2024

October 29, 2023

COMPUTER VISION

ALA: Naturalness-aware Adversarial Lightness Attack

Felix Xu, Geguang Pu, Jiayi Zhu, Jincao Feng, Liangru Sun, Qing Guo, Yang Liu, Yihao Huang

October 29, 2023

September 30, 2023

INTEGRITY

COMPUTER VISION

The Stable Signature: Rooting Watermarks in Latent Diffusion Models

Pierre Fernandez, Guillaume Couairon, Hervé Jegou, Matthijs Douze, Teddy Furon

September 30, 2023

Help Us Pioneer The Future of AI

We share our open source frameworks, tools, libraries, and models for everything from research exploration to large-scale production deployment.