Abstract
An optimal design is usually model-dependent and is sub-optimal if the postulated model is not correctly specified. Furthermore, it is far from ideal even if it is efficient for model selection but has a poor performance for estimating parameters in the selected model. In practice, it is common that a researcher has a list of candidate models at hand and a design has to be found that is efficient for both model discrimination and parameter estimation in the (unknown) “true” model. In this article, we use a multi-armed bandits approach to balance these two competing goals in the design of experiments. We develop a sequential algorithm to provide a design that has asymptotically the same performance as an optimal design when the “true” model could be correctly specified in advance. A lower bound is established to quantify the relative efficiency between the proposed design and an optimal design for the “true” model. Some comparisons with other state-of-the-art algorithms for model discrimination and parameter estimation are discussed. The advantages of the proposed method are illustrated by several numerical examples. Supplementary materials for this article are available online, including a standardized description of the materials available for reproducing the work.
| Original language | English |
|---|---|
| Pages (from-to) | 2709-2722 |
| Number of pages | 14 |
| Journal | Journal of the American Statistical Association |
| Volume | 120 |
| Issue number | 552 |
| DOIs | |
| Publication status | Published - 2025 |
| Externally published | Yes |
Keywords
- Model discrimination
- Optimal design
- Reinforcement learning
- Sequential design
- Thompson sampling
Fingerprint
Dive into the research topics of 'A New Approach to Optimal Design under Model Uncertainty Motivated by Multi-Armed Bandits'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver