Helping Language Models Learn More: Multi-Dimensional Task Prompt for Few-shot Tuning

Jinta Weng, Jiarui Zhang, Yue Hu*, Daidong Fa, Xiaofeng Xu, Heyan Huang

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Large language models (LLMs) can be used as accessible and intelligent chatbots by constructing natural language queries and directly inputting the prompt into the large language model. However, different prompt' constructions often lead to uncertainty in the answers and thus make it hard to utilize the specific knowledge of LLMs (like ChatGPT). To alleviate this, we use an interpretable structure to explain the prompt learning principle in LLMs, which certificates that the effectiveness of language models is determined by position changes of the task's related tokens. Therefore, we propose MTPrompt, a multi-dimensional task prompt learning method consisting based on task-related object, summary, and task description information. By automatically building and searching for appropriate prompts, our proposed MTPrompt achieves the best results on few-shot samples setting and five different datasets. In addition, we demonstrate the effectiveness and stability of our method in different experimental settings and ablation experiments. In interaction with large language models, embedding more task-related information into prompts will make it easier to stimulate knowledge embedded in large language models.

源语言英语
主期刊名2023 IEEE International Conference on Systems, Man, and Cybernetics
主期刊副标题Improving the Quality of Life, SMC 2023 - Proceedings
出版商Institute of Electrical and Electronics Engineers Inc.
746-752
页数7
ISBN(电子版)9798350337020
DOI
出版状态已出版 - 2023
活动2023 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2023 - Hybrid, Honolulu, 美国
期限: 1 10月 20234 10月 2023

出版系列

姓名Conference Proceedings - IEEE International Conference on Systems, Man and Cybernetics
ISSN(印刷版)1062-922X

会议

会议2023 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2023
国家/地区美国
Hybrid, Honolulu
时期1/10/234/10/23

指纹

探究 'Helping Language Models Learn More: Multi-Dimensional Task Prompt for Few-shot Tuning' 的科研主题。它们共同构成独一无二的指纹。

引用此