跳到主要导航 跳到搜索 跳到主要内容

VORTEXPIA: Indirect Prompt Injection Attack against LLMs for Efficient Extraction of User Privacy

  • Yu Cui
  • , Sicheng Pan
  • , Yifei Liu
  • , Haibin Zhang*
  • , Cong Zuo*
  • *此作品的通讯作者
  • Beijing Institute of Technology
  • Tsinghua University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Large language models (LLMs) have been widely deployed in Conversational AIs (CAIs), while exposing privacy and security threats. Recent research shows that LLM-based CAIs can be manipulated to extract private information from human users, posing serious security threats. However, the methods proposed in that study rely on a white-box setting that adversaries can directly modify the system prompt. This condition is unlikely to hold in real-world deployments. The limitation raises a critical question: can unprivileged attackers still induce such privacy risks in practical LLM-integrated applications? To address this question, we propose VORTEXPIA, a novel indirect prompt injection attack that induces privacy extraction in LLM-integrated applications under black-box settings. By injecting token-efficient data containing false memories, VORTEXPIA misleads LLMs to actively request private information in batches. Unlike prior methods, VORTEXPIA allows attackers to flexibly define multiple categories of sensitive data. We evaluate VORTEXPIA on six LLMs, covering both traditional and reasoning LLMs, across four benchmark datasets. The results show that VORTEXPIA significantly outperforms baselines and achieves state-of-the-art (SOTA) performance. It also demonstrates efficient privacy requests, reduced token consumption, and enhanced robustness against defense mechanisms. We further validate VORTEXPIA on multiple realistic open-source LLM-integrated applications, demonstrating its practical effectiveness. Our code is available at https://github.com/cuiyu-ai/VortexPIA.

源语言英语
主期刊名19th Conference of the European Chapter of the Association for Computational Linguistics, Findings of EACL 2026
出版商Association for Computational Linguistics (ACL)
587-609
页数23
ISBN(电子版)9798891763869
DOI
出版状态已出版 - 2026
活动19th Conference of the European Chapter of the Association for Computational Linguistics, Findings of EACL 2026 - Rabat, 摩洛哥
期限: 24 3月 202629 3月 2026

出版系列

姓名19th Conference of the European Chapter of the Association for Computational Linguistics, Findings of EACL 2026

会议

会议19th Conference of the European Chapter of the Association for Computational Linguistics, Findings of EACL 2026
国家/地区摩洛哥
Rabat
时期24/03/2629/03/26

指纹

探究 'VORTEXPIA: Indirect Prompt Injection Attack against LLMs for Efficient Extraction of User Privacy' 的科研主题。它们共同构成独一无二的指纹。

引用此