Nonstochastic Contextual Combinatorial Bandits

Zierahn, L.; Van Der Hoeven, D.; Cesa-Bianchi, N.; Neu, G.

We study a contextual version of online combinatorial optimisation with full and semi-bandit feedback. In this sequential decision-making problem, an online learner has to select an action from a combinatorial decision space after seeing a vector-valued context in each round. As a result of its action, the learner incurs a loss that is a bi-linear function of the context vector and the vector representation of the chosen action. We consider two natural versions of the problem: semi-bandit where the losses are revealed for each component appearing in the learner's combinatorial action, and full-bandit where only the total loss is observed. We design computationally efficient algorithms based on a new loss estimator that takes advantage of the special structure of the problem, and show regret bounds order √T with respect to the time horizon. The bounds demonstrate polynomial scaling with the relevant problem parameters which is shown to be nearly optimal. The theoretical results are complemented by a set of experiments on simulated data.

Nonstochastic Contextual Combinatorial Bandits / Zierahn, L.; van der Hoeven, D.; Cesa-Bianchi, N.; Neu, G.. - 206:(2023), pp. 8771-8813. ( 26th International Conference on Artificial Intelligence and Statistics, AISTATS 2023 Valencia (ESP) 25-27 April 2023).

Nonstochastic Contextual Combinatorial Bandits

Zierahn L.;van der Hoeven D.;Cesa-Bianchi N.;Neu G.

2023

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2023
			
	Titolo della Serie/Collana
	
				PROCEEDINGS OF MACHINE LEARNING RESEARCH
			
	Appare nelle tipologie
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
zierahn23a.pdf accesso aperto Tipologia: 2a Post-print versione editoriale / Version of Record Licenza: Pubblico - Tutti i diritti riservati Dimensione 1.03 MB Formato Adobe PDF Visualizza/Apri	1.03 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2982371

PORTO @ Archivio Istituzionale della Ricerca

Nonstochastic Contextual Combinatorial Bandits

Zierahn L.;van der Hoeven D.;Cesa-Bianchi N.;Neu G.

2023

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)