TY - JOUR
T1 - Empirical validation of race-neutral normative brain morphometry models across ethnoracially diverse populations
AU - Ge, Ruiyang
AU - Yu, Yuetong
AU - New, Faye
AU - Haas, Shalaila S.
AU - Sanford, Nicole
AU - Yu, Kevin
AU - Allen, Paul
AU - Arslan, Seda
AU - Avram, Mihai
AU - Borgwardt, Stefan
AU - Crossley, Nicolas A.
AU - de la Fuente-Sandoval, Camilo
AU - Fukunaga, Masaki
AU - Gao, Jia Hong
AU - Gonzalez-Valderrama, Alfonso
AU - Hashimoto, Ryota
AU - Iasevoli, Felice
AU - Keeser, Daniel
AU - Kubat, Kader
AU - Kumari, Veena
AU - Matsumoto, Junya
AU - Mehta, Urvakhsh M.
AU - Nemoto, Kiyotaka
AU - Pontillo, Giuseppe
AU - Raabe, Florian J.
AU - Reyes-Madrigal, Francisco
AU - Roy, Neelabja
AU - Şahin-Çevik, Didenur
AU - Sahin-Ilikoglu, Tuba
AU - Toulopoulou, Timothea
AU - Wagner, Elias
AU - Yang, Guoyuan
AU - Zurita, Mariana
AU - Thompson, Paul M.
AU - Frangou, Sophia
N1 - Publisher Copyright:
Copyright © 2026 the Author(s).
PY - 2026/5/19
Y1 - 2026/5/19
N2 - Normative models of brain morphometry quantify individual deviations from typical anatomical patterns and hold promise for enhancing clinical decision-making. However, their clinical utility depends critically on demonstrating generalizability across diverse ethnoracial populations. We previously developed sex-specific, race-neutral normative models for cortical thickness, surface area, and subcortical volumes using brain scans from a large international sample of healthy individuals, as part of the CentileBrain Project, a global initiative to provide open-access, neuroimaging reference models. The primary aim of the present study was to empirically evaluate the generalizability and accuracy of these pretrained models across multiple ethnoracial groups. To this end, we tested model performance in independent samples of healthy individuals from Africa, Asia, Europe, and the Americas, with ethnoracial classification defined either by self-identification or genetic ancestry (N = 4,862). We further compared performance against normative models developed exclusively from a single-population Chinese cohort. Across all groups, as well as in the pooled sample, the pretrained CentileBrain models demonstrated consistently high accuracy, with relative mean absolute error values below 10% for subcortical volume and surface area and below 5% for cortical thickness. Model performance was highly concordant across self-identified and ancestry-defined groups. In a separate analysis, the CentileBrain models performed comparably to a population-specific model when applied to an independent ancestry-matched sample. These findings provide empirical support for the generalizability of race-neutral normative models developed on large and diverse samples and underscore their potential utility for individualized neuroimaging assessment across ethnoracially diverse populations.
AB - Normative models of brain morphometry quantify individual deviations from typical anatomical patterns and hold promise for enhancing clinical decision-making. However, their clinical utility depends critically on demonstrating generalizability across diverse ethnoracial populations. We previously developed sex-specific, race-neutral normative models for cortical thickness, surface area, and subcortical volumes using brain scans from a large international sample of healthy individuals, as part of the CentileBrain Project, a global initiative to provide open-access, neuroimaging reference models. The primary aim of the present study was to empirically evaluate the generalizability and accuracy of these pretrained models across multiple ethnoracial groups. To this end, we tested model performance in independent samples of healthy individuals from Africa, Asia, Europe, and the Americas, with ethnoracial classification defined either by self-identification or genetic ancestry (N = 4,862). We further compared performance against normative models developed exclusively from a single-population Chinese cohort. Across all groups, as well as in the pooled sample, the pretrained CentileBrain models demonstrated consistently high accuracy, with relative mean absolute error values below 10% for subcortical volume and surface area and below 5% for cortical thickness. Model performance was highly concordant across self-identified and ancestry-defined groups. In a separate analysis, the CentileBrain models performed comparably to a population-specific model when applied to an independent ancestry-matched sample. These findings provide empirical support for the generalizability of race-neutral normative models developed on large and diverse samples and underscore their potential utility for individualized neuroimaging assessment across ethnoracially diverse populations.
KW - brain morphometry
KW - human
KW - neuroimaging
KW - normative models
UR - https://www.scopus.com/pages/publications/105039054243
U2 - 10.1073/pnas.2521055123
DO - 10.1073/pnas.2521055123
M3 - Article
C2 - 42118844
AN - SCOPUS:105039054243
SN - 0027-8424
VL - 123
JO - Proceedings of the National Academy of Sciences of the United States of America
JF - Proceedings of the National Academy of Sciences of the United States of America
IS - 20
M1 - e2521055123
ER -