Improved Regret Bounds for Bandits with Expert Advice

Cesa-Bianchi, Nicolò; Eldowa, Khaled; Esposito, Emmanuel; Olkhovskaya, Julia

Computer Science > Machine Learning

arXiv:2406.16802 (cs)

[Submitted on 24 Jun 2024]

Title:Improved Regret Bounds for Bandits with Expert Advice

Authors:Nicolò Cesa-Bianchi, Khaled Eldowa, Emmanuel Esposito, Julia Olkhovskaya

View PDF HTML (experimental)

Abstract:In this research note, we revisit the bandits with expert advice problem. Under a restricted feedback model, we prove a lower bound of order $\sqrt{K T \ln(N/K)}$ for the worst-case regret, where $K$ is the number of actions, $N>K$ the number of experts, and $T$ the time horizon. This matches a previously known upper bound of the same order and improves upon the best available lower bound of $\sqrt{K T (\ln N) / (\ln K)}$. For the standard feedback model, we prove a new instance-based upper bound that depends on the agreement between the experts and provides a logarithmic improvement compared to prior results.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2406.16802 [cs.LG]
	(or arXiv:2406.16802v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2406.16802

Submission history

From: Khaled Eldowa [view email]
[v1] Mon, 24 Jun 2024 17:14:31 UTC (31 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2024-06

Change to browse by:

cs
stat
stat.ML

References & Citations

export BibTeX citation

Computer Science > Machine Learning

Title:Improved Regret Bounds for Bandits with Expert Advice

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Improved Regret Bounds for Bandits with Expert Advice

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators