Our colleagues from the Centre for Research and Technology Hellas, Information Technologies Institute (CERTH) are currently at the 30th International Conference on Multimedia Modeling (MMM2024) , taking place from January 29 to February 2 in Amsterdam, the Netherlands. MMM is a leading international conference for researchers and industry practitioners for sharing new ideas, original research results and practical development experiences from all MMM-related areas.
Paper: Exploring Multi-Modal Fusion for Image Manipulation Detection and Localization
Authors: Konstantinos Triaridis and Vasileios Mezaris
Abstract: Recent image manipulation localization and detection techniques usually leverage forensic artifacts and traces that are produced by a noise-sensitive filter, such as SRM and Bayar convolution. In this paper, we showcase that different filters commonly used in such approaches excel at unveiling different types of manipulations and provide complementary forensic traces. Thus, we explore ways of merging the outputs of such filters and aim to leverage the complementary nature of the artifacts produced to perform image manipulation localization and detection (IMLD). We propose two distinct methods: one that produces independent features from each forensic filter and then fuses them (this is referred to as late fusion) and one that performs early mixing of different modal outputs and produces early combined features (this is referred to as early fusion). We demonstrate that both approaches achieve competitive performance for both image manipulation localization and detection, outperforming state-of-the-art models across several datasets.
Download the Publication
The paper is available via the CRiTERIA publication portal.