firstbacksecondback
73 Results
Poster
|
Wed 8:00 |
Exploration Through Reward Biasing: Reward-Biased Maximum Likelihood Estimation for Stochastic Multi-Armed Bandits Xi Liu · Ping-Chun Hsieh · Yu-Heng Hung · Anirban Bhattacharya · P. R. Kumar |
|
Poster
|
Thu 6:00 |
Adaptive Region-Based Active Learning Corinna Cortes · Giulia DeSalvo · Claudio Gentile · Mehryar Mohri · Ningshan Zhang |
|
Poster
|
Wed 10:00 |
Stochastic bandits with arm-dependent delays Anne Gael Manegueu · Claire Vernade · Alexandra Carpentier · Michal Valko |
|
Poster
|
Tue 10:00 |
A simpler approach to accelerated optimization: iterative averaging meets optimism Pooria Joulani · Anant Raj · András György · Csaba Szepesvari |
|
Poster
|
Thu 6:00 |
Online Learning with Dependent Stochastic Feedback Graphs Corinna Cortes · Giulia DeSalvo · Claudio Gentile · Mehryar Mohri · Ningshan Zhang |
|
Poster
|
Thu 14:00 |
Linear bandits with Stochastic Delayed Feedback Claire Vernade · Alexandra Carpentier · Tor Lattimore · Giovanni Zappella · Beyza Ermis · Michael Brueckner |
|
Poster
|
Thu 9:00 |
Active World Model Learning in Agent-rich Environments with Progress Curiosity Kuno Kim · Megumi Sano · Julian De Freitas · Nick Haber · Daniel Yamins |
|
Poster
|
Wed 8:00 |
Online mirror descent and dual averaging: keeping pace in the dynamic case Huang Fang · Nick Harvey · Victor Sanches Portella · Michael Friedlander |
|
Poster
|
Tue 7:00 |
Concentration bounds for CVaR estimation: The cases of light-tailed and heavy-tailed distributions Prashanth L.A. · Krishna Jagannathan · Ravi Kolla |
|
Poster
|
Wed 5:00 |
Don't Waste Your Bits! Squeeze Activations and Gradients for Deep Neural Networks via TinyScript Fangcheng Fu · Yuzheng Hu · Yihan He · Jiawei Jiang · Yingxia Shao · Ce Zhang · Bin Cui |
|
Poster
|
Tue 11:00 |
Near-linear time Gaussian process optimization with adaptive batching and resparsification Daniele Calandriello · Luigi Carratino · Alessandro Lazaric · Michal Valko · Lorenzo Rosasco |
|
Poster
|
Thu 12:00 |
Information Particle Filter Tree: An Online Algorithm for POMDPs with Belief-Based Rewards on Continuous Domains Johannes Fischer · Ömer Sahin Tas |