XPROMPT: Exploring the Extreme of Prompt Tuning

Fang Ma; Chen Zhang; Lei Ren; Jingang Wang; Qifan Wang; Wei Wu; Xiaojun Quan; Dawei Song

XPROMPT: Exploring the Extreme of Prompt Tuning

Fang Ma, Chen Zhang, Lei Ren, Jingang Wang^*, Qifan Wang, Wei Wu, Xiaojun Quan, Dawei Song^*

^*Corresponding author for this work

School of Computer Science and Technology

Research output: Contribution to conference › Paper › peer-review

14 Citations (Scopus)

Abstract

Prompt tuning learns soft prompts to condition the frozen Pre-trained Language Models (PLMs) for performing downstream tasks in a parameter-efficient manner. While prompt tuning has gradually reached the performance level of fine-tuning as the model scale increases, there is still a large performance gap between prompt tuning and fine-tuning for models of moderate and small scales (typically less than 11B parameters). In this paper, we empirically show that the trained prompt tokens can have a negative impact on a downstream task and thus degrade its performance. To bridge the gap, we propose a novel PROMPT tuning model with an eXtremely small scale (XPROMPT) under the regime of lottery tickets hypothesis. Specifically, XPROMPT eliminates the negative prompt tokens at different granularity levels through a hierarchical structured pruning, yielding a more parameter-efficient prompt yet with a competitive performance. Comprehensive experiments are carried out on the SuperGLUE tasks, and the results indicate that XPROMPT is able to close the performance gap at smaller model scales.

Original language	English
Pages	11033-11047
Number of pages	15
Publication status	Published - 2022
Event	2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022 - Abu Dhabi, United Arab Emirates Duration: 7 Dec 2022 → 11 Dec 2022

Conference

Conference	2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022
Country/Territory	United Arab Emirates
City	Abu Dhabi
Period	7/12/22 → 11/12/22

Cite this

Ma, F., Zhang, C., Ren, L., Wang, J., Wang, Q., Wu, W., Quan, X., & Song, D. (2022). XPROMPT: Exploring the Extreme of Prompt Tuning. 11033-11047. Paper presented at 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022, Abu Dhabi, United Arab Emirates.

@conference{f90a3ea2107b4557b8c76104bf7afaf2,

title = "XPROMPT: Exploring the Extreme of Prompt Tuning",

abstract = "Prompt tuning learns soft prompts to condition the frozen Pre-trained Language Models (PLMs) for performing downstream tasks in a parameter-efficient manner. While prompt tuning has gradually reached the performance level of fine-tuning as the model scale increases, there is still a large performance gap between prompt tuning and fine-tuning for models of moderate and small scales (typically less than 11B parameters). In this paper, we empirically show that the trained prompt tokens can have a negative impact on a downstream task and thus degrade its performance. To bridge the gap, we propose a novel PROMPT tuning model with an eXtremely small scale (XPROMPT) under the regime of lottery tickets hypothesis. Specifically, XPROMPT eliminates the negative prompt tokens at different granularity levels through a hierarchical structured pruning, yielding a more parameter-efficient prompt yet with a competitive performance. Comprehensive experiments are carried out on the SuperGLUE tasks, and the results indicate that XPROMPT is able to close the performance gap at smaller model scales.",

author = "Fang Ma and Chen Zhang and Lei Ren and Jingang Wang and Qifan Wang and Wei Wu and Xiaojun Quan and Dawei Song",

note = "Publisher Copyright: {\textcopyright} 2022 Association for Computational Linguistics.; 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022 ; Conference date: 07-12-2022 Through 11-12-2022",

year = "2022",

language = "English",

pages = "11033--11047",

}

TY - CONF

T1 - XPROMPT

T2 - 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022

AU - Ma, Fang

AU - Zhang, Chen

AU - Ren, Lei

AU - Wang, Jingang

AU - Wang, Qifan

AU - Wu, Wei

AU - Quan, Xiaojun

AU - Song, Dawei

PY - 2022

Y1 - 2022

N2 - Prompt tuning learns soft prompts to condition the frozen Pre-trained Language Models (PLMs) for performing downstream tasks in a parameter-efficient manner. While prompt tuning has gradually reached the performance level of fine-tuning as the model scale increases, there is still a large performance gap between prompt tuning and fine-tuning for models of moderate and small scales (typically less than 11B parameters). In this paper, we empirically show that the trained prompt tokens can have a negative impact on a downstream task and thus degrade its performance. To bridge the gap, we propose a novel PROMPT tuning model with an eXtremely small scale (XPROMPT) under the regime of lottery tickets hypothesis. Specifically, XPROMPT eliminates the negative prompt tokens at different granularity levels through a hierarchical structured pruning, yielding a more parameter-efficient prompt yet with a competitive performance. Comprehensive experiments are carried out on the SuperGLUE tasks, and the results indicate that XPROMPT is able to close the performance gap at smaller model scales.

AB - Prompt tuning learns soft prompts to condition the frozen Pre-trained Language Models (PLMs) for performing downstream tasks in a parameter-efficient manner. While prompt tuning has gradually reached the performance level of fine-tuning as the model scale increases, there is still a large performance gap between prompt tuning and fine-tuning for models of moderate and small scales (typically less than 11B parameters). In this paper, we empirically show that the trained prompt tokens can have a negative impact on a downstream task and thus degrade its performance. To bridge the gap, we propose a novel PROMPT tuning model with an eXtremely small scale (XPROMPT) under the regime of lottery tickets hypothesis. Specifically, XPROMPT eliminates the negative prompt tokens at different granularity levels through a hierarchical structured pruning, yielding a more parameter-efficient prompt yet with a competitive performance. Comprehensive experiments are carried out on the SuperGLUE tasks, and the results indicate that XPROMPT is able to close the performance gap at smaller model scales.

UR - http://www.scopus.com/inward/record.url?scp=85149434600&partnerID=8YFLogxK

M3 - Paper

AN - SCOPUS:85149434600

SP - 11033

EP - 11047

Y2 - 7 December 2022 through 11 December 2022

ER -

XPROMPT: Exploring the Extreme of Prompt Tuning

Abstract

Conference

Other files and links

Fingerprint

Cite this