Modality Coupling for Privacy Image Classification

Yucheng Liu; Yonggang Huang; Shoujin Wang; Wenpeng Lu; Huiyan Wu

doi:10.1109/TIFS.2023.3301414

Modality Coupling for Privacy Image Classification

Yucheng Liu, Yonggang Huang^*, Shoujin Wang, Wenpeng Lu, Huiyan Wu

^*Corresponding author for this work

School of Computer Science and Technology

Research output: Contribution to journal › Article › peer-review

2 Citations (Scopus)

Abstract

Privacy image classification (PIC) has attracted increasing attention as it can help people make appropriate privacy decisions when sharing images. Most recently, some pioneer research efforts have been made to utilize multimodal information for PIC, since multi-modality can provide richer information than single modality. Those research efforts on multimodal PIC are under the assumption of independently identically distribution. However, connections between different modalities commonly exist in real-world cases. Taking the modalities of scene and object as example, in the scene of 'library/indoor', the object 'book jacket' resides with high probabilities. To this end, in this paper, a novel PIC approach, called CoupledPIC, is proposed to bridge this gap by comprehensively capturing the coupling relations between different modalities. In CoupledPIC, two submodules are designed to capture explicit and implicit coupling relations between different modalities respectively. The explicit modality coupling is learned with a tensor fusion networks based submodule, via the direct interaction of features. For the implicit modality coupling, a graph convolutional networks based submodule is proposed to learn on both the initial graphs and attention guided graphs, via information aggregation on graphs. Extensive experiments on the public benchmark, PicAlert, demonstrate the effectiveness of the proposed CoupledPIC, yielding significant improvement by modeling inter-modality coupling information.

Original language	English
Pages (from-to)	4843-4853
Number of pages	11
Journal	IEEE Transactions on Information Forensics and Security
Volume	18
DOIs	https://doi.org/10.1109/TIFS.2023.3301414
Publication status	Published - 2023

Keywords

Privacy image classification
explicit coupling
graph convolutional networks
implicit coupling
modality coupling
tensor fusion networks

Access to Document

10.1109/TIFS.2023.3301414

Cite this

Liu, Y., Huang, Y., Wang, S., Lu, W., & Wu, H. (2023). Modality Coupling for Privacy Image Classification. IEEE Transactions on Information Forensics and Security, 18, 4843-4853. https://doi.org/10.1109/TIFS.2023.3301414

@article{ba4f82f1af394cd384fed182f2c71ebb,

title = "Modality Coupling for Privacy Image Classification",

abstract = "Privacy image classification (PIC) has attracted increasing attention as it can help people make appropriate privacy decisions when sharing images. Most recently, some pioneer research efforts have been made to utilize multimodal information for PIC, since multi-modality can provide richer information than single modality. Those research efforts on multimodal PIC are under the assumption of independently identically distribution. However, connections between different modalities commonly exist in real-world cases. Taking the modalities of scene and object as example, in the scene of 'library/indoor', the object 'book jacket' resides with high probabilities. To this end, in this paper, a novel PIC approach, called CoupledPIC, is proposed to bridge this gap by comprehensively capturing the coupling relations between different modalities. In CoupledPIC, two submodules are designed to capture explicit and implicit coupling relations between different modalities respectively. The explicit modality coupling is learned with a tensor fusion networks based submodule, via the direct interaction of features. For the implicit modality coupling, a graph convolutional networks based submodule is proposed to learn on both the initial graphs and attention guided graphs, via information aggregation on graphs. Extensive experiments on the public benchmark, PicAlert, demonstrate the effectiveness of the proposed CoupledPIC, yielding significant improvement by modeling inter-modality coupling information.",

keywords = "Privacy image classification, explicit coupling, graph convolutional networks, implicit coupling, modality coupling, tensor fusion networks",

author = "Yucheng Liu and Yonggang Huang and Shoujin Wang and Wenpeng Lu and Huiyan Wu",

note = "Publisher Copyright: {\textcopyright} 2005-2012 IEEE.",

year = "2023",

doi = "10.1109/TIFS.2023.3301414",

language = "English",

volume = "18",

pages = "4843--4853",

journal = "IEEE Transactions on Information Forensics and Security",

issn = "1556-6013",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Modality Coupling for Privacy Image Classification

AU - Liu, Yucheng

AU - Huang, Yonggang

AU - Wang, Shoujin

AU - Lu, Wenpeng

AU - Wu, Huiyan

PY - 2023

Y1 - 2023

N2 - Privacy image classification (PIC) has attracted increasing attention as it can help people make appropriate privacy decisions when sharing images. Most recently, some pioneer research efforts have been made to utilize multimodal information for PIC, since multi-modality can provide richer information than single modality. Those research efforts on multimodal PIC are under the assumption of independently identically distribution. However, connections between different modalities commonly exist in real-world cases. Taking the modalities of scene and object as example, in the scene of 'library/indoor', the object 'book jacket' resides with high probabilities. To this end, in this paper, a novel PIC approach, called CoupledPIC, is proposed to bridge this gap by comprehensively capturing the coupling relations between different modalities. In CoupledPIC, two submodules are designed to capture explicit and implicit coupling relations between different modalities respectively. The explicit modality coupling is learned with a tensor fusion networks based submodule, via the direct interaction of features. For the implicit modality coupling, a graph convolutional networks based submodule is proposed to learn on both the initial graphs and attention guided graphs, via information aggregation on graphs. Extensive experiments on the public benchmark, PicAlert, demonstrate the effectiveness of the proposed CoupledPIC, yielding significant improvement by modeling inter-modality coupling information.

AB - Privacy image classification (PIC) has attracted increasing attention as it can help people make appropriate privacy decisions when sharing images. Most recently, some pioneer research efforts have been made to utilize multimodal information for PIC, since multi-modality can provide richer information than single modality. Those research efforts on multimodal PIC are under the assumption of independently identically distribution. However, connections between different modalities commonly exist in real-world cases. Taking the modalities of scene and object as example, in the scene of 'library/indoor', the object 'book jacket' resides with high probabilities. To this end, in this paper, a novel PIC approach, called CoupledPIC, is proposed to bridge this gap by comprehensively capturing the coupling relations between different modalities. In CoupledPIC, two submodules are designed to capture explicit and implicit coupling relations between different modalities respectively. The explicit modality coupling is learned with a tensor fusion networks based submodule, via the direct interaction of features. For the implicit modality coupling, a graph convolutional networks based submodule is proposed to learn on both the initial graphs and attention guided graphs, via information aggregation on graphs. Extensive experiments on the public benchmark, PicAlert, demonstrate the effectiveness of the proposed CoupledPIC, yielding significant improvement by modeling inter-modality coupling information.

KW - Privacy image classification

KW - explicit coupling

KW - graph convolutional networks

KW - implicit coupling

KW - modality coupling

KW - tensor fusion networks

UR - http://www.scopus.com/inward/record.url?scp=85166755029&partnerID=8YFLogxK

U2 - 10.1109/TIFS.2023.3301414

DO - 10.1109/TIFS.2023.3301414

M3 - Article

AN - SCOPUS:85166755029

SN - 1556-6013

VL - 18

SP - 4843

EP - 4853

JO - IEEE Transactions on Information Forensics and Security

JF - IEEE Transactions on Information Forensics and Security

ER -

Modality Coupling for Privacy Image Classification

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this