跳到主要导航 跳到搜索 跳到主要内容

Robust AI generated text detection through multi-grained latent feature denoising and contrastive representation learning

  • Xin Liu
  • , Shuo Wang
  • , Yang Li
  • , Kan Li*
  • *此作品的通讯作者
  • Beijing Institute of Technology
  • Jilin University

科研成果: 期刊稿件文章同行评审

摘要

As large language models (LLMs) evolve rapidly, distinguishing AI-generated text (AIGT) from human-written text (HWT) is becoming increasingly challenging. Recently, some AIGT detectors have been developed to overcome this challenge and have achieved decent accuracy. However, their brittle text representations make them highly susceptible to text perturbations, such that even minor character-level perturbations can reverse their predictions. In this work, we propose a multi-grained latent feature denoising and contrastive representation learning architecture to enhance text representations in terms of granularity, robustness, and distinguishability of features, thereby achieving robust AIGT detection. Specifically, we first extract both document-level and fine-grained segment-level features using a dual network, which captures the global and subtle local differences between AIGT and HWT. To encourage feature stability under perturbations, we inject random noise into both latent features and employ a denoising network to reconstruct the original representations. While this does not precisely simulate discrete character-level perturbations, it acts as a feature-level regularizer that suppresses non-essential variations and promotes smoother, more stable representations. Considering the similarities between AIGT and HWT, we further design a contrastive augmentation mechanism to increase the distinguishability between them. Extensive experiments demonstrate that our method not only outperforms baseline models in terms of classification accuracy but also exhibits superior robustness against various text perturbations.

源语言英语
期刊Intelligent Data Analysis
DOI
出版状态已接受/待刊 - 2026
已对外发布

指纹

探究 'Robust AI generated text detection through multi-grained latent feature denoising and contrastive representation learning' 的科研主题。它们共同构成独一无二的指纹。

引用此