TY - JOUR
T1 - Leveraging LLMs for Environmental Complexity
T2 - Structured Fine-Tuning Data Sets and Deployment Strategies
AU - Chen, Chuke
AU - Li, Nan
AU - Qi, Jianchuan
AU - Chang, Huimin
AU - Shi, Wenjie
AU - Xie, Jinliang
AU - Yuan, Jiayi
AU - Yang, Hang
AU - Guo, Jing
AU - Xu, Changqing
AU - Xu, Ming
PY - 2026/1/13
Y1 - 2026/1/13
N2 - Generative artificial intelligence, especially large language models (LLMs), could accelerate environmental analysis, but deployment is hindered by two gaps: limited structured domain knowledge and unclear strategies matched to environmental decision contexts. Here, this study constructs a textbook-based, China-centered environmental knowledge data set with hierarchical organization to enable reliable fine-tuning and benchmarking. Results show a consistent trade-off that fine-tuned models achieve modest gains in precision (+1%) and response efficiency (+52%) on standardized tasks but exhibit limited adaptability when embedded in agentic workflows (-3%). In contrast, state-of-the-art generalist models consistently outperform in system-level sustainability and interdisciplinary decision tasks (+10%), benefiting from stronger cross-domain reasoning and dynamic tool integration. Together, these findings support a layered LLMs' deployment strategy for environmental intelligence. Specifically, selective fine-tuning for stable, regulatory, and verification tasks, combined with agentic workflows anchored in up-to-date generalist backbone models for dynamic, data-intensive, and interdisciplinary decision-making. This work provides both a reusable data set foundation and a practical framework for deploying LLMs as scalable and reliable decision-support tools in environmental decision.
AB - Generative artificial intelligence, especially large language models (LLMs), could accelerate environmental analysis, but deployment is hindered by two gaps: limited structured domain knowledge and unclear strategies matched to environmental decision contexts. Here, this study constructs a textbook-based, China-centered environmental knowledge data set with hierarchical organization to enable reliable fine-tuning and benchmarking. Results show a consistent trade-off that fine-tuned models achieve modest gains in precision (+1%) and response efficiency (+52%) on standardized tasks but exhibit limited adaptability when embedded in agentic workflows (-3%). In contrast, state-of-the-art generalist models consistently outperform in system-level sustainability and interdisciplinary decision tasks (+10%), benefiting from stronger cross-domain reasoning and dynamic tool integration. Together, these findings support a layered LLMs' deployment strategy for environmental intelligence. Specifically, selective fine-tuning for stable, regulatory, and verification tasks, combined with agentic workflows anchored in up-to-date generalist backbone models for dynamic, data-intensive, and interdisciplinary decision-making. This work provides both a reusable data set foundation and a practical framework for deploying LLMs as scalable and reliable decision-support tools in environmental decision.
KW - agentic workflows
KW - environmental complexity
KW - fine-tuning
KW - generative artificial intelligence
KW - large language models
UR - https://www.scopus.com/pages/publications/105027411419
U2 - 10.1021/acs.est.5c09526
DO - 10.1021/acs.est.5c09526
M3 - Article
C2 - 41476344
AN - SCOPUS:105027411419
SN - 0013-936X
VL - 60
SP - 497
EP - 509
JO - Environmental Science and Technology
JF - Environmental Science and Technology
IS - 1
ER -