跳到主要导航 跳到搜索 跳到主要内容

A new model stealing defense based on DNN retraining for decision boundary protection

  • Chenlong Zhang
  • , Senlin Luo
  • , Limin Pan*
  • , Dujuan Gu
  • , Jun Yuan
  • *此作品的通讯作者
  • Beijing Institute of Technology
  • Nsfocus Information Technology Co.

科研成果: 期刊稿件文章同行评审

摘要

Deep neural networks (DNNs) are vulnerable to input transformations, posing challenges in thwarting model stealing attacks. Existing methods predominantly analyze the distribution differences of attack samples; however, those based on decision boundary approximation often mimic the distributions of benign samples, thereby circumventing defenses. Furthermore, the addition of deceptive perturbations to the output posterior by complex defense processing modules external to the victim model increases both computational costs and processing latency. In response, this paper proposes a novel training technique named PDB (Protecting Decision Boundaries) that robustly counters model stealing without relying on presumptions about the distribution of attack samples. Instead, PDB secures the primary targets of these attacks— the decision boundaries. It integrates an input gradient penalty into the loss function to displace the decision boundaries away from benign samples. To further enhance protection, samples near these boundaries—referred to as transition samples—are explicitly recategorized into a new, dedicated class. This recategorization is implemented by adding a corresponding neuron to the output layer, thereby fortifying the defense mechanism. Crucially, PDB discards the requirement for complex defense processing modules by employing straightforward mechanisms such as normal prediction processes and selective label flipping for a minimal number of cases. Experimental evidence confirms that PDB surpasses leading methods and marks a pioneering advance in safeguarding decision boundaries against potential breaches.

源语言英语
文章编号133816
期刊Neurocomputing
693
DOI
出版状态已出版 - 7 9月 2026

指纹

探究 'A new model stealing defense based on DNN retraining for decision boundary protection' 的科研主题。它们共同构成独一无二的指纹。

引用此