Timezone: »
Competitions for shareable and limited resources have long been studied with strategic agents. In reality, agents often have to learn and maximize the rewards of the resources at the same time. To design an individualized competing policy, we model the competition between agents in a novel multi-player multi-armed bandit (MPMAB) setting where players are selfish and aim to maximize their own rewards. In addition, when several players pull the same arm, we assume that these players averagely share the arms' rewards by expectation. Under this setting, we first analyze the Nash equilibrium when arms' rewards are known. Subsequently, we propose a novel Selfish MPMAB with Averaging Allocation (SMAA) approach based on the equilibrium. We theoretically demonstrate that SMAA could achieve a good regret guarantee for each player when all players follow the algorithm. Additionally, we establish that no single selfish player can significantly increase their rewards through deviation, nor can they detrimentally affect other players' rewards without incurring substantial losses for themselves. We finally validate the effectiveness of the method in extensive synthetic experiments.
Author Information
Renzhe Xu (Tsinghua University)
Haotian Wang
Xingxuan Zhang (Tsinghua University)
Bo Li (Tsinghua University)
Peng Cui (Tsinghua University)
Peng Cui is an Associate Professor in Tsinghua University. He got his PhD degree from Tsinghua University in 2010. His research interests include causal inference and stable learning, network representation learning, and human behavioral modeling. He has published more than 100 papers in prestigious conferences and journals in data mining and multimedia. His recent research won the IEEE Multimedia Best Department Paper Award, SIGKDD 2016 Best Paper Finalist, ICDM 2015 Best Student Paper Award, SIGKDD 2014 Best Paper Finalist, IEEE ICME 2014 Best Paper Award, ACM MM12 Grand Challenge Multimodal Award, and MMM13 Best Paper Award. He is the Associate Editors of IEEE TKDE, IEEE TBD, ACM TIST, and ACM TOMM etc. He has served as program co-chair and area chair of several major machine learning and artificial intelligence conferences, such as IJCAI, AAAI, ACM CIKM, ACM Multimedia etc.
More from the Same Authors
-
2023 Poster: Propensity Matters: Measuring and Enhancing Balancing for Recommendation »
Haoxuan Li · Yanghao Xiao · Chunyuan Zheng · Peng Wu · Peng Cui -
2023 Poster: Stable Estimation of Heterogeneous Treatment Effects »
Anpeng Wu · Kun Kuang · Ruoxuan Xiong · Bo Li · Fei Wu -
2023 Poster: Provably Invariant Learning without Domain Information »
Xiaoyu Tan · Yong LIN · Shengyu Zhu · Chao Qu · Xihe Qiu · Xu Yinghui · Peng Cui · Yuan Qi -
2022 Poster: Counterfactual Prediction for Outcome-Oriented Treatments »
Hao Zou · Bo Li · Jiangang Han · Shuiping Chen · Xuetao Ding · Peng Cui -
2022 Spotlight: Counterfactual Prediction for Outcome-Oriented Treatments »
Hao Zou · Bo Li · Jiangang Han · Shuiping Chen · Xuetao Ding · Peng Cui -
2022 Poster: A Theoretical Analysis on Independence-driven Importance Weighting for Covariate-shift Generalization »
Renzhe Xu · Xingxuan Zhang · Zheyan Shen · Tong Zhang · Peng Cui -
2022 Poster: Instrumental Variable Regression with Confounder Balancing »
Anpeng Wu · Kun Kuang · Bo Li · Fei Wu -
2022 Poster: Model Agnostic Sample Reweighting for Out-of-Distribution Learning »
Xiao Zhou · Yong LIN · Renjie Pi · Weizhong Zhang · Renzhe Xu · Peng Cui · Tong Zhang -
2022 Spotlight: Instrumental Variable Regression with Confounder Balancing »
Anpeng Wu · Kun Kuang · Bo Li · Fei Wu -
2022 Spotlight: A Theoretical Analysis on Independence-driven Importance Weighting for Covariate-shift Generalization »
Renzhe Xu · Xingxuan Zhang · Zheyan Shen · Tong Zhang · Peng Cui -
2022 Spotlight: Model Agnostic Sample Reweighting for Out-of-Distribution Learning »
Xiao Zhou · Yong LIN · Renjie Pi · Weizhong Zhang · Renzhe Xu · Peng Cui · Tong Zhang -
2021 Poster: Heterogeneous Risk Minimization »
Jiashuo Liu · Zheyuan Hu · Peng Cui · Bo Li · Zheyan Shen -
2021 Spotlight: Heterogeneous Risk Minimization »
Jiashuo Liu · Zheyuan Hu · Peng Cui · Bo Li · Zheyan Shen -
2019 Poster: Disentangled Graph Convolutional Networks »
Jianxin Ma · Peng Cui · Kun Kuang · Xin Wang · Wenwu Zhu -
2019 Oral: Disentangled Graph Convolutional Networks »
Jianxin Ma · Peng Cui · Kun Kuang · Xin Wang · Wenwu Zhu -
2019 Tutorial: Causal Inference and Stable Learning »
Tong Zhang · Peng Cui