Contextual Translation Embedding for Visual Relationship Detection and Scene Graph Generation. 2021

Zih-Siou Hung, and Arun Mallya, and Svetlana Lazebnik

Relations amongst entities play a central role in image understanding. Due to the complexity of modeling (subject, predicate, object) relation triplets, it is crucial to develop a method that can not only recognize seen relations, but also generalize to unseen cases. Inspired by a previously proposed visual translation embedding model, or VTransE [1] , we propose a context-augmented translation embedding model that can capture both common and rare relations. The previous VTransE model maps entities and predicates into a low-dimensional embedding vector space where the predicate is interpreted as a translation vector between the embedded features of the bounding box regions of the subject and the object. Our model additionally incorporates the contextual information captured by the bounding box of the union of the subject and the object, and learns the embeddings guided by the constraint predicate ≈ union (subject, object) - subject - object. In a comprehensive evaluation on multiple challenging benchmarks, our approach outperforms previous translation-based models and comes close to or exceeds the state of the art across a range of settings, from small-scale to large-scale datasets, from common to previously unseen relations. It also achieves promising results for the recently introduced task of scene graph generation.

UI MeSH Term Description Entries

Related Publications

Zih-Siou Hung, and Arun Mallya, and Svetlana Lazebnik
June 2024, IEEE transactions on neural networks and learning systems,
Zih-Siou Hung, and Arun Mallya, and Svetlana Lazebnik
December 2023, IEEE transactions on visualization and computer graphics,
Zih-Siou Hung, and Arun Mallya, and Svetlana Lazebnik
April 2024, IEEE transactions on pattern analysis and machine intelligence,
Zih-Siou Hung, and Arun Mallya, and Svetlana Lazebnik
January 2022, IEEE transactions on neural networks and learning systems,
Zih-Siou Hung, and Arun Mallya, and Svetlana Lazebnik
July 2022, IEEE transactions on cybernetics,
Zih-Siou Hung, and Arun Mallya, and Svetlana Lazebnik
September 2023, IEEE transactions on pattern analysis and machine intelligence,
Zih-Siou Hung, and Arun Mallya, and Svetlana Lazebnik
January 2022, IEEE transactions on image processing : a publication of the IEEE Signal Processing Society,
Zih-Siou Hung, and Arun Mallya, and Svetlana Lazebnik
August 2023, IEEE transactions on pattern analysis and machine intelligence,
Zih-Siou Hung, and Arun Mallya, and Svetlana Lazebnik
October 2023, IEEE transactions on pattern analysis and machine intelligence,
Zih-Siou Hung, and Arun Mallya, and Svetlana Lazebnik
January 2025, IEEE transactions on image processing : a publication of the IEEE Signal Processing Society,
Copied contents to your clipboard!