How Proficient Are Large Language Models in Formal Languages? An In-Depth Insight for Knowledge Base Question Answering

Jinxin Liu; Shulin Cao; Jiaxin Shi; Tingjian Zhang; Lunyiu Nie; Linmei Hu; Lei Hou; Juanzi Li

How Proficient Are Large Language Models in Formal Languages? An In-Depth Insight for Knowledge Base Question Answering

Jinxin Liu, Shulin Cao, Jiaxin Shi, Tingjian Zhang, Lunyiu Nie, Linmei Hu, Lei Hou^*, Juanzi Li

^*此作品的通讯作者

计算机学院

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

摘要

Knowledge Base Question Answering (KBQA) aims to answer natural language questions based on facts in knowledge bases. A typical approach to KBQA is semantic parsing, which translates a question into an executable logical form in a formal language. Recent works leverage the capabilities of large language models (LLMs) for logical form generation to improve performance. However, although it is validated that LLMs are capable of solving some KBQA problems, there has been little discussion on the differences in LLMs' proficiency in formal languages used in semantic parsing. In this work, we propose to evaluate the understanding and generation ability of LLMs to deal with differently structured logical forms by examining the inter-conversion of natural and formal language through in-context learning of LLMs. Extensive experiments with models of different sizes show that state-of-the-art LLMs can understand formal languages as well as humans, but generating correct logical forms given a few examples remains a challenge. Most importantly, our results also indicate that LLMs exhibit considerable sensitivity. In general, the formal language with a lower formalization level, i.e., the more similar it is to natural language, is more friendly to LLMs. Code and data can be found at https://github.com/Matthewlliu/structure_probe.

源语言	英语
主期刊名	62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024 - Proceedings of the Conference
编辑	Lun-Wei Ku, Andre Martins, Vivek Srikumar
出版商	Association for Computational Linguistics (ACL)
页	792-815
页数	24
ISBN（电子版）	9798891760998
出版状态	已出版 - 2024
活动	Findings of the 62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024 - Hybrid, Bangkok, 泰国期限: 11 8月 2024 → 16 8月 2024

出版系列

姓名	Proceedings of the Annual Meeting of the Association for Computational Linguistics
ISSN（印刷版）	0736-587X

会议

会议	Findings of the 62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024
国家/地区	泰国
市	Hybrid, Bangkok
时期	11/08/24 → 16/08/24

其它文件与链接

链接到 Scopus 的出版物

引用此

Liu, J., Cao, S., Shi, J., Zhang, T., Nie, L., Hu, L., Hou, L., & Li, J. (2024). How Proficient Are Large Language Models in Formal Languages? An In-Depth Insight for Knowledge Base Question Answering. 在 L.-W. Ku, A. Martins, & V. Srikumar (编辑), 62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024 - Proceedings of the Conference (页码 792-815). (Proceedings of the Annual Meeting of the Association for Computational Linguistics). Association for Computational Linguistics (ACL).

Liu, Jinxin ; Cao, Shulin ; Shi, Jiaxin 等. / How Proficient Are Large Language Models in Formal Languages? An In-Depth Insight for Knowledge Base Question Answering. 62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024 - Proceedings of the Conference. 编辑 / Lun-Wei Ku ; Andre Martins ; Vivek Srikumar. Association for Computational Linguistics (ACL), 2024. 页码 792-815 (Proceedings of the Annual Meeting of the Association for Computational Linguistics).

@inproceedings{f0713c89c09a4039a27ef4d738f6ae9b,

title = "How Proficient Are Large Language Models in Formal Languages? An In-Depth Insight for Knowledge Base Question Answering",

abstract = "Knowledge Base Question Answering (KBQA) aims to answer natural language questions based on facts in knowledge bases. A typical approach to KBQA is semantic parsing, which translates a question into an executable logical form in a formal language. Recent works leverage the capabilities of large language models (LLMs) for logical form generation to improve performance. However, although it is validated that LLMs are capable of solving some KBQA problems, there has been little discussion on the differences in LLMs' proficiency in formal languages used in semantic parsing. In this work, we propose to evaluate the understanding and generation ability of LLMs to deal with differently structured logical forms by examining the inter-conversion of natural and formal language through in-context learning of LLMs. Extensive experiments with models of different sizes show that state-of-the-art LLMs can understand formal languages as well as humans, but generating correct logical forms given a few examples remains a challenge. Most importantly, our results also indicate that LLMs exhibit considerable sensitivity. In general, the formal language with a lower formalization level, i.e., the more similar it is to natural language, is more friendly to LLMs. Code and data can be found at https://github.com/Matthewlliu/structure_probe.",

author = "Jinxin Liu and Shulin Cao and Jiaxin Shi and Tingjian Zhang and Lunyiu Nie and Linmei Hu and Lei Hou and Juanzi Li",

note = "Publisher Copyright: {\textcopyright} 2024 Association for Computational Linguistics.; Findings of the 62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024 ; Conference date: 11-08-2024 Through 16-08-2024",

year = "2024",

language = "English",

series = "Proceedings of the Annual Meeting of the Association for Computational Linguistics",

publisher = "Association for Computational Linguistics (ACL)",

pages = "792--815",

editor = "Lun-Wei Ku and Andre Martins and Vivek Srikumar",

booktitle = "62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024 - Proceedings of the Conference",

address = "United States",

}

Liu, J, Cao, S, Shi, J, Zhang, T, Nie, L, Hu, L, Hou, L & Li, J 2024, How Proficient Are Large Language Models in Formal Languages? An In-Depth Insight for Knowledge Base Question Answering. 在 L-W Ku, A Martins & V Srikumar (编辑), 62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024 - Proceedings of the Conference. Proceedings of the Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics (ACL), 页码 792-815, Findings of the 62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024, Hybrid, Bangkok, 泰国, 11/08/24.

How Proficient Are Large Language Models in Formal Languages? An In-Depth Insight for Knowledge Base Question Answering. / Liu, Jinxin; Cao, Shulin; Shi, Jiaxin 等.
62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024 - Proceedings of the Conference. 编辑 / Lun-Wei Ku; Andre Martins; Vivek Srikumar. Association for Computational Linguistics (ACL), 2024. 页码 792-815 (Proceedings of the Annual Meeting of the Association for Computational Linguistics).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - How Proficient Are Large Language Models in Formal Languages? An In-Depth Insight for Knowledge Base Question Answering

AU - Liu, Jinxin

AU - Cao, Shulin

AU - Shi, Jiaxin

AU - Zhang, Tingjian

AU - Nie, Lunyiu

AU - Hu, Linmei

AU - Hou, Lei

AU - Li, Juanzi

PY - 2024

Y1 - 2024

N2 - Knowledge Base Question Answering (KBQA) aims to answer natural language questions based on facts in knowledge bases. A typical approach to KBQA is semantic parsing, which translates a question into an executable logical form in a formal language. Recent works leverage the capabilities of large language models (LLMs) for logical form generation to improve performance. However, although it is validated that LLMs are capable of solving some KBQA problems, there has been little discussion on the differences in LLMs' proficiency in formal languages used in semantic parsing. In this work, we propose to evaluate the understanding and generation ability of LLMs to deal with differently structured logical forms by examining the inter-conversion of natural and formal language through in-context learning of LLMs. Extensive experiments with models of different sizes show that state-of-the-art LLMs can understand formal languages as well as humans, but generating correct logical forms given a few examples remains a challenge. Most importantly, our results also indicate that LLMs exhibit considerable sensitivity. In general, the formal language with a lower formalization level, i.e., the more similar it is to natural language, is more friendly to LLMs. Code and data can be found at https://github.com/Matthewlliu/structure_probe.

AB - Knowledge Base Question Answering (KBQA) aims to answer natural language questions based on facts in knowledge bases. A typical approach to KBQA is semantic parsing, which translates a question into an executable logical form in a formal language. Recent works leverage the capabilities of large language models (LLMs) for logical form generation to improve performance. However, although it is validated that LLMs are capable of solving some KBQA problems, there has been little discussion on the differences in LLMs' proficiency in formal languages used in semantic parsing. In this work, we propose to evaluate the understanding and generation ability of LLMs to deal with differently structured logical forms by examining the inter-conversion of natural and formal language through in-context learning of LLMs. Extensive experiments with models of different sizes show that state-of-the-art LLMs can understand formal languages as well as humans, but generating correct logical forms given a few examples remains a challenge. Most importantly, our results also indicate that LLMs exhibit considerable sensitivity. In general, the formal language with a lower formalization level, i.e., the more similar it is to natural language, is more friendly to LLMs. Code and data can be found at https://github.com/Matthewlliu/structure_probe.

UR - http://www.scopus.com/inward/record.url?scp=85205317289&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:85205317289

T3 - Proceedings of the Annual Meeting of the Association for Computational Linguistics

SP - 792

EP - 815

BT - 62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024 - Proceedings of the Conference

A2 - Ku, Lun-Wei

A2 - Martins, Andre

A2 - Srikumar, Vivek

PB - Association for Computational Linguistics (ACL)

T2 - Findings of the 62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024

Y2 - 11 August 2024 through 16 August 2024

ER -

Liu J, Cao S, Shi J, Zhang T, Nie L, Hu L 等. How Proficient Are Large Language Models in Formal Languages? An In-Depth Insight for Knowledge Base Question Answering. 在 Ku LW, Martins A, Srikumar V, 编辑, 62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024 - Proceedings of the Conference. Association for Computational Linguistics (ACL). 2024. 页码 792-815. (Proceedings of the Annual Meeting of the Association for Computational Linguistics).

How Proficient Are Large Language Models in Formal Languages? An In-Depth Insight for Knowledge Base Question Answering

摘要

出版系列

会议

其它文件与链接

指纹

引用此