基于视觉-文本损失的开放词汇检测大模型对抗样本生成方法

Hao Shi; Shu Wang; Jianhong Han; Zhaoyi Luo; Yupei Wang

doi:10.11996/JG.j.2095-302X.2024061222

基于视觉-文本损失的开放词汇检测大模型对抗样本生成方法

Translated title of the contribution: Adversarial example generation method for open-vocabulary detection large models based on visually-textual fusion loss

Hao Shi, Shu Wang, Jianhong Han, Zhaoyi Luo, Yupei Wang^*

^*Corresponding author for this work

School of Information and Electronics

Beijing Institute of Technology

Research output: Contribution to journal › Article › peer-review

Abstract

Recently, open-vocabulary detection (OVD) has become a research focus in the field of computer vision due to its potential to recognize objects from unknown categories. As a representative approach in this domain, YOLO-World possesses powerful real-time detection capabilities; however, security issues stemming from the vulnerabilities of deep learning networks cannot be overlooked. Against this backdrop, a white-box adversarial examples generation method was proposed, targeting the YOLO-World algorithm, providing insights into identifying and quantifying vulnerabilities in large models. The method utilized gradient data generated during backpropagation in the YOLO-World network to optimize predefined perturbations, which were then added to original examples to form adversarial examples. Initially, confidence scores and bounding box information from model outputs served as a basis for preliminary optimization, resulting in adversarial examples with a certain level of attack effectiveness. This was further enhanced by a visually-textual fusion loss designed according to the RepVL-PAN structure in the YOLO-World model, to increase the destructiveness of adversarial examples against the model. Finally, perturbation magnitude loss was integrated to constrain the total amount of perturbation, generating adversarial examples with limited disturbance. The adversarial examples generated by this method were capable of achieving attack objectives such as confidence reduction and bounding box displacement according to practical needs. Experimental results demonstrated that the proposed method significantly impaired the YOLO-World model, with mean average precision dropping below 5% after testing on the LIVS dataset.

Translated title of the contribution	Adversarial example generation method for open-vocabulary detection large models based on visually-textual fusion loss
Original language	Chinese (Traditional)
Pages (from-to)	1222-1230
Number of pages	9
Journal	Journal of Graphics
Volume	45
Issue number	6
DOIs	https://doi.org/10.11996/JG.j.2095-302X.2024061222
Publication status	Published - Dec 2024

Access to Document

10.11996/JG.j.2095-302X.2024061222

Cite this

Shi, H., Wang, S., Han, J., Luo, Z., & Wang, Y. (2024). 基于视觉-文本损失的开放词汇检测大模型对抗样本生成方法. Journal of Graphics, 45(6), 1222-1230. https://doi.org/10.11996/JG.j.2095-302X.2024061222

@article{985062fc5f2d42d99f5891bd056fdb29,

title = "基于视觉-文本损失的开放词汇检测大模型对抗样本生成方法",

abstract = "Recently, open-vocabulary detection (OVD) has become a research focus in the field of computer vision due to its potential to recognize objects from unknown categories. As a representative approach in this domain, YOLO-World possesses powerful real-time detection capabilities; however, security issues stemming from the vulnerabilities of deep learning networks cannot be overlooked. Against this backdrop, a white-box adversarial examples generation method was proposed, targeting the YOLO-World algorithm, providing insights into identifying and quantifying vulnerabilities in large models. The method utilized gradient data generated during backpropagation in the YOLO-World network to optimize predefined perturbations, which were then added to original examples to form adversarial examples. Initially, confidence scores and bounding box information from model outputs served as a basis for preliminary optimization, resulting in adversarial examples with a certain level of attack effectiveness. This was further enhanced by a visually-textual fusion loss designed according to the RepVL-PAN structure in the YOLO-World model, to increase the destructiveness of adversarial examples against the model. Finally, perturbation magnitude loss was integrated to constrain the total amount of perturbation, generating adversarial examples with limited disturbance. The adversarial examples generated by this method were capable of achieving attack objectives such as confidence reduction and bounding box displacement according to practical needs. Experimental results demonstrated that the proposed method significantly impaired the YOLO-World model, with mean average precision dropping below 5% after testing on the LIVS dataset.",

keywords = "YOLO-World, adversarial examples, open vocabulary detection, sparse perturbations, visually-textual fusion loss",

author = "Hao Shi and Shu Wang and Jianhong Han and Zhaoyi Luo and Yupei Wang",

year = "2024",

month = dec,

doi = "10.11996/JG.j.2095-302X.2024061222",

language = "繁体中文",

volume = "45",

pages = "1222--1230",

journal = "Journal of Graphics",

issn = "2095-302X",

publisher = "Editorial of Board of Journal of Graphics",

number = "6",

}

TY - JOUR

T1 - 基于视觉-文本损失的开放词汇检测大模型对抗样本生成方法

AU - Shi, Hao

AU - Wang, Shu

AU - Han, Jianhong

AU - Luo, Zhaoyi

AU - Wang, Yupei

PY - 2024/12

Y1 - 2024/12

N2 - Recently, open-vocabulary detection (OVD) has become a research focus in the field of computer vision due to its potential to recognize objects from unknown categories. As a representative approach in this domain, YOLO-World possesses powerful real-time detection capabilities; however, security issues stemming from the vulnerabilities of deep learning networks cannot be overlooked. Against this backdrop, a white-box adversarial examples generation method was proposed, targeting the YOLO-World algorithm, providing insights into identifying and quantifying vulnerabilities in large models. The method utilized gradient data generated during backpropagation in the YOLO-World network to optimize predefined perturbations, which were then added to original examples to form adversarial examples. Initially, confidence scores and bounding box information from model outputs served as a basis for preliminary optimization, resulting in adversarial examples with a certain level of attack effectiveness. This was further enhanced by a visually-textual fusion loss designed according to the RepVL-PAN structure in the YOLO-World model, to increase the destructiveness of adversarial examples against the model. Finally, perturbation magnitude loss was integrated to constrain the total amount of perturbation, generating adversarial examples with limited disturbance. The adversarial examples generated by this method were capable of achieving attack objectives such as confidence reduction and bounding box displacement according to practical needs. Experimental results demonstrated that the proposed method significantly impaired the YOLO-World model, with mean average precision dropping below 5% after testing on the LIVS dataset.

AB - Recently, open-vocabulary detection (OVD) has become a research focus in the field of computer vision due to its potential to recognize objects from unknown categories. As a representative approach in this domain, YOLO-World possesses powerful real-time detection capabilities; however, security issues stemming from the vulnerabilities of deep learning networks cannot be overlooked. Against this backdrop, a white-box adversarial examples generation method was proposed, targeting the YOLO-World algorithm, providing insights into identifying and quantifying vulnerabilities in large models. The method utilized gradient data generated during backpropagation in the YOLO-World network to optimize predefined perturbations, which were then added to original examples to form adversarial examples. Initially, confidence scores and bounding box information from model outputs served as a basis for preliminary optimization, resulting in adversarial examples with a certain level of attack effectiveness. This was further enhanced by a visually-textual fusion loss designed according to the RepVL-PAN structure in the YOLO-World model, to increase the destructiveness of adversarial examples against the model. Finally, perturbation magnitude loss was integrated to constrain the total amount of perturbation, generating adversarial examples with limited disturbance. The adversarial examples generated by this method were capable of achieving attack objectives such as confidence reduction and bounding box displacement according to practical needs. Experimental results demonstrated that the proposed method significantly impaired the YOLO-World model, with mean average precision dropping below 5% after testing on the LIVS dataset.

KW - YOLO-World

KW - adversarial examples

KW - open vocabulary detection

KW - sparse perturbations

KW - visually-textual fusion loss

UR - http://www.scopus.com/inward/record.url?scp=85213889531&partnerID=8YFLogxK

U2 - 10.11996/JG.j.2095-302X.2024061222

DO - 10.11996/JG.j.2095-302X.2024061222

M3 - 文章

AN - SCOPUS:85213889531

SN - 2095-302X

VL - 45

SP - 1222

EP - 1230

JO - Journal of Graphics

JF - Journal of Graphics

IS - 6

ER -

基于视觉-文本损失的开放词汇检测大模型对抗样本生成方法

Abstract

Access to Document

Other files and links

Fingerprint

Cite this