关于(甚至略微)不满意的恐惧
On the Peril of (Even a Little) Nonstationarity in Satisficing Regret Minimization
作者
Authors
Yixuan Zhang, Ruihao Zhu, Qiaomin Xie
期刊
Journal
暂无期刊信息
年份
Year
2026
分类
Category
国家
Country
印度India
📝 摘要
Abstract
Motivated by the principle of satisficing in decision-making, we study satisficing regret guarantees for nonstationary $K$-armed bandits. We show that in the general realizable, piecewise-stationary setting with $L$ stationary segments, the optimal regret is $Θ(L\log T)$ as long as $L\geq 2$. This stands in sharp contrast to the case of $L=1$ (i.e., the stationary setting), where a $T$-independent $Θ(1)$ satisficing regret is achievable under realizability. In other words, the optimal regret has to scale with $T$ even if just a little nonstationarity presents. A key ingredient in our analysis is a novel Fano-based framework tailored to nonstationary bandits via a \emph{post-interaction reference} construction. This framework strictly extends the classical Fano method for passive estimation as well as recent interactive Fano techniques for stationary bandits. As a complement, we also discuss a special regime in which constant satisficing regret is again possible.
📊 文章统计
Article Statistics
基础数据
Basic Stats
302
浏览
Views
0
下载
Downloads
32
引用
Citations
引用趋势
Citation Trend
阅读国家分布
Country Distribution
阅读机构分布
Institution Distribution
月度浏览趋势
Monthly Views