Natural Language to SQL: State of the Art and Open Problems

  • Yuyu Luo*
  • , Guoliang Li
  • , Ju Fan
  • , Chengliang Chai
  • , Nan Tang
  • *Corresponding author for this work

Research output: Contribution to journalConference articlepeer-review

Abstract

Translating users’ natural language queries (nl) into sql queries (i.e., nl2sql) can significantly reduce barriers to accessing relational databases and support various commercial applications. The performance of nl2sql has been greatly improved with the emergence of large language models (LLMs). In this context, it is crucial to assess our current position, determine the nl2sql solutions that should be adopted for specific scenarios by practitioners, and identify the research topics that researchers should explore next. In this tutorial, we will provide a comprehensive overview of nl2sql techniques, covering every aspect of its lifecycle, from the collection and synthesis of training data, recent advancements in nl2sql translation techniques using LLMs and agents, debugging nl2sql processes, to multi-angle and scenario-based evaluation of nl2sql methods. We conclude by highlighting the research challenges and open problems in nl2sql.

Original languageEnglish
Pages (from-to)5466-5471
Number of pages6
JournalProceedings of the VLDB Endowment
Volume18
Issue number12
DOIs
Publication statusPublished - 2025
Event51st International Conference on Very Large Data Bases, VLDB 2025 - London, United Kingdom
Duration: 1 Sept 20255 Sept 2025

Fingerprint

Dive into the research topics of 'Natural Language to SQL: State of the Art and Open Problems'. Together they form a unique fingerprint.

Cite this