Catalogo dei prodotti della ricerca

Bilateral trade models the problem of intermediating between two rational agents - a seller and a buyer - both characterized by a private valuation for an item they want to trade. We study the online learning version of the problem, in which at each time step a new seller and buyer arrive and the learner has to set prices for them without any knowledge about their (adversarially generated) valuations. In this setting, known impossibility results rule out the existence of no-regret algorithms when budget balanced has to be enforced at each time step. In this paper, we introduce the notion of global budget balance, which only requires the learner to fulfill budget balance over the entire time horizon. Under this natural relaxation, we provide the first no-regret algorithms for adversarial bilateral trade under various feedback models. First, we show that in the full-feedback model, the learner can guarantee Õ(√T) regret against the best fixed prices in hindsight, and that this bound is optimal up to poly-logarithmic terms. Second, we provide a learning algorithm guaranteeing a Õ(T34) regret upper bound with one-bit feedback, which we complement with a ω(T57) lower bound that holds even in the two-bit feedback model. Finally, we introduce and analyze an alternative benchmark that is provably stronger than the best fixed prices in hindsight and is inspired by the literature on bandits with knapsacks.

No-Regret Learning in Bilateral Trade via Global Budget Balance / Bernasconi, M.; Castiglioni, M.; Celli, A.; Fusco, F.. - (2024), pp. 247-258. ( ACM Symposium on Theory of Computing Vancouver; Canada ) [10.1145/3618260.3649653].

No-Regret Learning in Bilateral Trade via Global Budget Balance

Bernasconi M.;Castiglioni M.;Celli A.;Fusco F.

2024

Abstract

Bilateral trade models the problem of intermediating between two rational agents - a seller and a buyer - both characterized by a private valuation for an item they want to trade. We study the online learning version of the problem, in which at each time step a new seller and buyer arrive and the learner has to set prices for them without any knowledge about their (adversarially generated) valuations. In this setting, known impossibility results rule out the existence of no-regret algorithms when budget balanced has to be enforced at each time step. In this paper, we introduce the notion of global budget balance, which only requires the learner to fulfill budget balance over the entire time horizon. Under this natural relaxation, we provide the first no-regret algorithms for adversarial bilateral trade under various feedback models. First, we show that in the full-feedback model, the learner can guarantee Õ(√T) regret against the best fixed prices in hindsight, and that this bound is optimal up to poly-logarithmic terms. Second, we provide a learning algorithm guaranteeing a Õ(T34) regret upper bound with one-bit feedback, which we complement with a ω(T57) lower bound that holds even in the two-bit feedback model. Finally, we introduce and analyze an alternative benchmark that is provably stronger than the best fixed prices in hindsight and is inspired by the literature on bandits with knapsacks.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2024
			
	Nome convegno
	
				ACM Symposium on Theory of Computing
			
	Parole chiave
	
				Bilateral Trade; Budget Balance; Online Learning; Partial Feedback
			
	Tipologia
	
				04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
			
	Citazione
	
				No-Regret Learning in Bilateral Trade via Global Budget Balance / Bernasconi, M.; Castiglioni, M.; Celli, A.; Fusco, F.. - (2024), pp. 247-258. ( ACM Symposium on Theory of Computing Vancouver; Canada ) [10.1145/3618260.3649653].
			
	Appartiene alla tipologia:
	
				04b Atto di convegno in volume

File allegati a questo prodotto

File	Dimensione	Formato
Bernasconi-preprint-No-regret_2024.pdf accesso aperto Note: https://doi.org/10.1145/3618260.3649653 Tipologia: Documento in Pre-print (manoscritto inviato all'editore, precedente alla peer review) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 624.41 kB Formato Adobe PDF	624.41 kB	Adobe PDF
Bernasconi-No-regret_2024.pdf accesso aperto Note: https://doi.org/10.1145/3618260.3649653 Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza: Creative commons Dimensione 304 kB Formato Adobe PDF	304 kB	Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1717194

Citazioni

ND

10

2

social impact