Using bandit algorithms to design response-adaptive trials can optimize participant outcomes, but poses major challenges for statistical inference. Recent attempts to address these challenges typically impose restrictions on the exploitative nature of the bandit algorithm and require large sample sizes to ensure asymptotic guarantees. However, large experiments generally follow a successful pilot study, which is tightly constrained in its size or duration. In this work, we tackle the problem of hypothesis testing in finite samples. We illustrate an innovative hypothesis testing procedure, uniquely based on the allocation probabilities of the bandit algorithm, and theoretically characterise it when applied to Thompson sampling.

Finite-Sample Inference in Response-Adaptive Designs: An Application to Thompson Sampling / Deliu, Nina; Williams, Joseph J.; Villar, Sofia S.. - (2025), pp. 393-398. (Intervento presentato al convegno 52nd Scientific Meeting of the Italian Statistical Society (SIS 2024) tenutosi a Bari; Italy) [10.1007/978-3-031-64350-7_66].

Finite-Sample Inference in Response-Adaptive Designs: An Application to Thompson Sampling

Deliu, Nina
Primo
Methodology
;
2025

Abstract

Using bandit algorithms to design response-adaptive trials can optimize participant outcomes, but poses major challenges for statistical inference. Recent attempts to address these challenges typically impose restrictions on the exploitative nature of the bandit algorithm and require large sample sizes to ensure asymptotic guarantees. However, large experiments generally follow a successful pilot study, which is tightly constrained in its size or duration. In this work, we tackle the problem of hypothesis testing in finite samples. We illustrate an innovative hypothesis testing procedure, uniquely based on the allocation probabilities of the bandit algorithm, and theoretically characterise it when applied to Thompson sampling.
2025
52nd Scientific Meeting of the Italian Statistical Society (SIS 2024)
Adaptive designs; Thompson sampling; multi-armed bandits
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Finite-Sample Inference in Response-Adaptive Designs: An Application to Thompson Sampling / Deliu, Nina; Williams, Joseph J.; Villar, Sofia S.. - (2025), pp. 393-398. (Intervento presentato al convegno 52nd Scientific Meeting of the Italian Statistical Society (SIS 2024) tenutosi a Bari; Italy) [10.1007/978-3-031-64350-7_66].
File allegati a questo prodotto
File Dimensione Formato  
Frontmatter.pdf

solo gestori archivio

Note: Front matter
Tipologia: Altro materiale allegato
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 495.18 kB
Formato Adobe PDF
495.18 kB Adobe PDF   Contatta l'autore
Deliu_Finite-sample inference_2025.pdf

solo gestori archivio

Note: Deliu_SIS24
Tipologia: Documento in Post-print (versione successiva alla peer review e accettata per la pubblicazione)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 280.33 kB
Formato Adobe PDF
280.33 kB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1735480
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact