Skip to main navigation Skip to search Skip to main content

VORTEXPIA: Indirect Prompt Injection Attack against LLMs for Efficient Extraction of User Privacy

  • Yu Cui
  • , Sicheng Pan
  • , Yifei Liu
  • , Haibin Zhang*
  • , Cong Zuo*
  • *Corresponding author for this work
  • Beijing Institute of Technology
  • Tsinghua University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Large language models (LLMs) have been widely deployed in Conversational AIs (CAIs), while exposing privacy and security threats. Recent research shows that LLM-based CAIs can be manipulated to extract private information from human users, posing serious security threats. However, the methods proposed in that study rely on a white-box setting that adversaries can directly modify the system prompt. This condition is unlikely to hold in real-world deployments. The limitation raises a critical question: can unprivileged attackers still induce such privacy risks in practical LLM-integrated applications? To address this question, we propose VORTEXPIA, a novel indirect prompt injection attack that induces privacy extraction in LLM-integrated applications under black-box settings. By injecting token-efficient data containing false memories, VORTEXPIA misleads LLMs to actively request private information in batches. Unlike prior methods, VORTEXPIA allows attackers to flexibly define multiple categories of sensitive data. We evaluate VORTEXPIA on six LLMs, covering both traditional and reasoning LLMs, across four benchmark datasets. The results show that VORTEXPIA significantly outperforms baselines and achieves state-of-the-art (SOTA) performance. It also demonstrates efficient privacy requests, reduced token consumption, and enhanced robustness against defense mechanisms. We further validate VORTEXPIA on multiple realistic open-source LLM-integrated applications, demonstrating its practical effectiveness. Our code is available at https://github.com/cuiyu-ai/VortexPIA.

Original languageEnglish
Title of host publication19th Conference of the European Chapter of the Association for Computational Linguistics, Findings of EACL 2026
PublisherAssociation for Computational Linguistics (ACL)
Pages587-609
Number of pages23
ISBN (Electronic)9798891763869
DOIs
Publication statusPublished - 2026
Event19th Conference of the European Chapter of the Association for Computational Linguistics, Findings of EACL 2026 - Rabat, Morocco
Duration: 24 Mar 202629 Mar 2026

Publication series

Name19th Conference of the European Chapter of the Association for Computational Linguistics, Findings of EACL 2026

Conference

Conference19th Conference of the European Chapter of the Association for Computational Linguistics, Findings of EACL 2026
Country/TerritoryMorocco
CityRabat
Period24/03/2629/03/26

Fingerprint

Dive into the research topics of 'VORTEXPIA: Indirect Prompt Injection Attack against LLMs for Efficient Extraction of User Privacy'. Together they form a unique fingerprint.

Cite this