Optimally deceiving a learning leader in stackelberg games

Birmpas, G.; Gan, J.; Hollender, A.; Marmolejo-Cossio, F. J.; Rajgopal, N.; Voudouris, A. A.

doi:10.1613/JAIR.1.12542

Recent results have shown that algorithms for learning the optimal commitment in a Stackelberg game are susceptible to manipulation by the follower. These learning algorithms operate by querying the best responses of the follower, who consequently can deceive the algorithm by using fake best responses, typically by responding according to fake payoffs that are different from the actual ones. For this strategic behavior to be successful, the main challenge faced by the follower is to pinpoint the fake payoffs that would make the learning algorithm output a commitment that benefits them the most. While this problem has been considered before, the related literature has only focused on a simple setting where the follower can only choose from a finite set of payoff matrices, thus leaving the general version of the problem unanswered. In this paper, we fill this gap by showing that it is always possible for the follower to efficiently compute (near-)optimal fake payoffs, for various scenarios of learning interaction between the leader and the follower. Our results also establish an interesting connection between the follower’s deception and the leader’s maximin utility: through deception, the follower can induce almost any (fake) Stackelberg equilibrium if and only if the leader obtains at least their maximin utility in this equilibrium.

Optimally deceiving a learning leader in stackelberg games / Birmpas, G.; Gan, J.; Hollender, A.; Marmolejo-Cossio, F. J.; Rajgopal, N.; Voudouris, A. A.. - In: THE JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH. - ISSN 1076-9757. - 72:(2021), pp. 507-531. [10.1613/JAIR.1.12542]

Optimally deceiving a learning leader in stackelberg games

Birmpas G.^Primo;Gan J.;Hollender A.;Marmolejo-Cossio F. J.;Rajgopal N.;Voudouris A. A.

2021

Abstract

Recent results have shown that algorithms for learning the optimal commitment in a Stackelberg game are susceptible to manipulation by the follower. These learning algorithms operate by querying the best responses of the follower, who consequently can deceive the algorithm by using fake best responses, typically by responding according to fake payoffs that are different from the actual ones. For this strategic behavior to be successful, the main challenge faced by the follower is to pinpoint the fake payoffs that would make the learning algorithm output a commitment that benefits them the most. While this problem has been considered before, the related literature has only focused on a simple setting where the follower can only choose from a finite set of payoff matrices, thus leaving the general version of the problem unanswered. In this paper, we fill this gap by showing that it is always possible for the follower to efficiently compute (near-)optimal fake payoffs, for various scenarios of learning interaction between the leader and the follower. Our results also establish an interesting connection between the follower’s deception and the leader’s maximin utility: through deception, the follower can induce almost any (fake) Stackelberg equilibrium if and only if the leader obtains at least their maximin utility in this equilibrium.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2021
			
	Parole chiave
	
				game theory; autonomous agents
			
	Tipologia
	
				01 Pubblicazione su rivista::01a Articolo in rivista
			
	Citazione
	
				Optimally deceiving a learning leader in stackelberg games / Birmpas, G.; Gan, J.; Hollender, A.; Marmolejo-Cossio, F. J.; Rajgopal, N.; Voudouris, A. A.. - In: THE JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH. - ISSN 1076-9757. - 72:(2021), pp. 507-531. [10.1613/JAIR.1.12542]
			
	Appartiene alla tipologia:
	
				01a Articolo in rivista

File allegati a questo prodotto

File	Dimensione	Formato
Birmpas_Optimally_2021.pdf accesso aperto Note: https://dl.acm.org/doi/pdf/10.1613/jair.1.12542 Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 379.58 kB Formato Adobe PDF	379.58 kB	Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1627658

Citazioni

ND

4

2

Catalogo dei prodotti della ricerca