A Secure and Disambiguating Approach for Generative Linguistic Steganography

Ruiyi Yan; Yating Yang; Tian Song

doi:10.1109/LSP.2023.3302749

A Secure and Disambiguating Approach for Generative Linguistic Steganography

Ruiyi Yan, Yating Yang^*, Tian Song

^*此作品的通讯作者

网络空间安全学院

Beijing Institute of Technology

科研成果: 期刊稿件 › 文章 › 同行评审

3 引用（Scopus）

摘要

Segmentation ambiguity in generative linguistic steganography could induce decoding errors. One existing disambiguating way is removing the tokens whose mapping words are the prefixes of others in each candidate pool. However, it neglects probability distribution of candidates and degrades imperceptibility. To enhance steganographic security, meanwhile addressing segmentation ambiguity, we propose a secure and disambiguating approach for linguistic steganography. In this letter, we focus on two questions: (1) Which candidate pools should be modified? (2) Which tokens should be retained? Firstly, we propose a secure token-selection principle that the sum of selected tokens' probabilities is positively correlated to statistical imperceptibility. To meet both disambiguation and optimal security, we present a lightweight disambiguating approach that is finding out a maximum weight independent set (MWIS) in one candidate graph only when candidate-level ambiguity occurs. Experiments show that our approach outperforms the existing method in various security metrics, improving 25.7% statistical imperceptibility and 11.2% anti-steganalysis capacity averagely.

源语言	英语
页（从-至）	1047-1051
页数	5
期刊	IEEE Signal Processing Letters
卷	30
DOI	https://doi.org/10.1109/LSP.2023.3302749
出版状态	已出版 - 2023

访问文件

10.1109/LSP.2023.3302749

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{ea8551f1f29c47f48f32f86102435332,

title = "A Secure and Disambiguating Approach for Generative Linguistic Steganography",

abstract = "Segmentation ambiguity in generative linguistic steganography could induce decoding errors. One existing disambiguating way is removing the tokens whose mapping words are the prefixes of others in each candidate pool. However, it neglects probability distribution of candidates and degrades imperceptibility. To enhance steganographic security, meanwhile addressing segmentation ambiguity, we propose a secure and disambiguating approach for linguistic steganography. In this letter, we focus on two questions: (1) Which candidate pools should be modified? (2) Which tokens should be retained? Firstly, we propose a secure token-selection principle that the sum of selected tokens' probabilities is positively correlated to statistical imperceptibility. To meet both disambiguation and optimal security, we present a lightweight disambiguating approach that is finding out a maximum weight independent set (MWIS) in one candidate graph only when candidate-level ambiguity occurs. Experiments show that our approach outperforms the existing method in various security metrics, improving 25.7% statistical imperceptibility and 11.2% anti-steganalysis capacity averagely.",

keywords = "Linguistic steganography, disambiguation, maximum weight independent set, segmentation ambiguity",

author = "Ruiyi Yan and Yating Yang and Tian Song",

note = "Publisher Copyright: {\textcopyright} 1994-2012 IEEE.",

year = "2023",

doi = "10.1109/LSP.2023.3302749",

language = "English",

volume = "30",

pages = "1047--1051",

journal = "IEEE Signal Processing Letters",

issn = "1070-9908",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - A Secure and Disambiguating Approach for Generative Linguistic Steganography

AU - Yan, Ruiyi

AU - Yang, Yating

AU - Song, Tian

PY - 2023

Y1 - 2023

N2 - Segmentation ambiguity in generative linguistic steganography could induce decoding errors. One existing disambiguating way is removing the tokens whose mapping words are the prefixes of others in each candidate pool. However, it neglects probability distribution of candidates and degrades imperceptibility. To enhance steganographic security, meanwhile addressing segmentation ambiguity, we propose a secure and disambiguating approach for linguistic steganography. In this letter, we focus on two questions: (1) Which candidate pools should be modified? (2) Which tokens should be retained? Firstly, we propose a secure token-selection principle that the sum of selected tokens' probabilities is positively correlated to statistical imperceptibility. To meet both disambiguation and optimal security, we present a lightweight disambiguating approach that is finding out a maximum weight independent set (MWIS) in one candidate graph only when candidate-level ambiguity occurs. Experiments show that our approach outperforms the existing method in various security metrics, improving 25.7% statistical imperceptibility and 11.2% anti-steganalysis capacity averagely.

AB - Segmentation ambiguity in generative linguistic steganography could induce decoding errors. One existing disambiguating way is removing the tokens whose mapping words are the prefixes of others in each candidate pool. However, it neglects probability distribution of candidates and degrades imperceptibility. To enhance steganographic security, meanwhile addressing segmentation ambiguity, we propose a secure and disambiguating approach for linguistic steganography. In this letter, we focus on two questions: (1) Which candidate pools should be modified? (2) Which tokens should be retained? Firstly, we propose a secure token-selection principle that the sum of selected tokens' probabilities is positively correlated to statistical imperceptibility. To meet both disambiguation and optimal security, we present a lightweight disambiguating approach that is finding out a maximum weight independent set (MWIS) in one candidate graph only when candidate-level ambiguity occurs. Experiments show that our approach outperforms the existing method in various security metrics, improving 25.7% statistical imperceptibility and 11.2% anti-steganalysis capacity averagely.

KW - Linguistic steganography

KW - disambiguation

KW - maximum weight independent set

KW - segmentation ambiguity

UR - http://www.scopus.com/inward/record.url?scp=85167783107&partnerID=8YFLogxK

U2 - 10.1109/LSP.2023.3302749

DO - 10.1109/LSP.2023.3302749

M3 - Article

AN - SCOPUS:85167783107

SN - 1070-9908

VL - 30

SP - 1047

EP - 1051

JO - IEEE Signal Processing Letters

JF - IEEE Signal Processing Letters

ER -

A Secure and Disambiguating Approach for Generative Linguistic Steganography

摘要

访问文件

其它文件与链接

指纹

引用此