TY - JOUR
T1 - A Secure and Disambiguating Approach for Generative Linguistic Steganography
AU - Yan, Ruiyi
AU - Yang, Yating
AU - Song, Tian
N1 - Publisher Copyright:
© 1994-2012 IEEE.
PY - 2023
Y1 - 2023
N2 - Segmentation ambiguity in generative linguistic steganography could induce decoding errors. One existing disambiguating way is removing the tokens whose mapping words are the prefixes of others in each candidate pool. However, it neglects probability distribution of candidates and degrades imperceptibility. To enhance steganographic security, meanwhile addressing segmentation ambiguity, we propose a secure and disambiguating approach for linguistic steganography. In this letter, we focus on two questions: (1) Which candidate pools should be modified? (2) Which tokens should be retained? Firstly, we propose a secure token-selection principle that the sum of selected tokens' probabilities is positively correlated to statistical imperceptibility. To meet both disambiguation and optimal security, we present a lightweight disambiguating approach that is finding out a maximum weight independent set (MWIS) in one candidate graph only when candidate-level ambiguity occurs. Experiments show that our approach outperforms the existing method in various security metrics, improving 25.7% statistical imperceptibility and 11.2% anti-steganalysis capacity averagely.
AB - Segmentation ambiguity in generative linguistic steganography could induce decoding errors. One existing disambiguating way is removing the tokens whose mapping words are the prefixes of others in each candidate pool. However, it neglects probability distribution of candidates and degrades imperceptibility. To enhance steganographic security, meanwhile addressing segmentation ambiguity, we propose a secure and disambiguating approach for linguistic steganography. In this letter, we focus on two questions: (1) Which candidate pools should be modified? (2) Which tokens should be retained? Firstly, we propose a secure token-selection principle that the sum of selected tokens' probabilities is positively correlated to statistical imperceptibility. To meet both disambiguation and optimal security, we present a lightweight disambiguating approach that is finding out a maximum weight independent set (MWIS) in one candidate graph only when candidate-level ambiguity occurs. Experiments show that our approach outperforms the existing method in various security metrics, improving 25.7% statistical imperceptibility and 11.2% anti-steganalysis capacity averagely.
KW - Linguistic steganography
KW - disambiguation
KW - maximum weight independent set
KW - segmentation ambiguity
UR - http://www.scopus.com/inward/record.url?scp=85167783107&partnerID=8YFLogxK
U2 - 10.1109/LSP.2023.3302749
DO - 10.1109/LSP.2023.3302749
M3 - Article
AN - SCOPUS:85167783107
SN - 1070-9908
VL - 30
SP - 1047
EP - 1051
JO - IEEE Signal Processing Letters
JF - IEEE Signal Processing Letters
ER -