Track: Optimization (Combinatorial) 4

Fri 13 July 2:00 - 2:20 PDT

Competitive Caching with Machine Learned Advice

Thodoris Lykouris · Sergei Vassilvitskii

We develop a framework for augmenting online algorithms with a machine learned oracle to achieve competitive ratios that provably improve upon unconditional worst case lower bounds when the oracle has low error. Our approach treats the oracle as a complete black box, and is not dependent on its inner workings, or the exact distribution of its errors. We apply this framework to the traditional caching problem ‚Äî creating an eviction strategy for a cache of size k. We demonstrate that naively following the oracle‚Äôs recommendations may lead to very poor performance, even when the average error is quite low. Instead we show how to modify the Marker algorithm to take into account the oracle‚Äôs predictions, and prove that this combined approach achieves a competitive ratio that both (i) decreases as the oracle‚Äôs error decreases, and (ii) is always capped by O(log k), which can beachieved without any oracle input. We complement our results with an empirical evaluation of our algorithm on real world datasets, and show that it performs well empirically even using simple off the shelf predictions.

Fri 13 July 2:20 - 2:30 PDT

Distributed Clustering via LSH Based Data Partitioning

Aditya Bhaskara · Pruthuvi Maheshakya Wijewardena

Given the importance of clustering in the analysisof large scale data, distributed algorithms for formulations such as k-means, k-median, etc. have been extensively studied. A successful approach here has been the ‚Äúreduce and merge‚Äù paradigm, in which each machine reduces its input size to √ï(k), and this data reduction continues (possibly iteratively) until all the data fits on one machine, at which point the problem is solved locally. This approach has the intrinsic bottleneck that each machine must solve a problem of size ‚â• k, and needs to communicate at least Œ©(k) points to the other machines. We propose a novel data partitioning idea to overcome this bottleneck, and in effect, have different machines focus on ‚Äúfindingdifferent clusters‚Äù. Under the assumption that we know the optimum value of the objective up to a poly(n) factor (arbitrary polynomial), we establish worst-case approximation guarantees for our method. We see that our algorithm results inlower communication as well as a near-optimal number of ‚Äòrounds‚Äô of computation (in the popular MapReduce framework).

Fri 13 July 2:30 - 2:40 PDT

Learning to Branch

Nina Balcan · Travis Dick · Tuomas Sandholm · Ellen Vitercik

Tree search algorithms, such as branch-and-bound, are the most widely used tools for solving combinatorial problems. These algorithms recursively partition the search space to find an optimal solution. To keep the tree small, it is crucial to carefully decide, when expanding a tree node, which variable to branch on at that node to partition the remaining space. Many partitioning techniques have been proposed, but no theory describes which is optimal. We show how to use machine learning to determine an optimal weighting of any set of partitioning procedures for the instance distribution at hand using samples. Via theory and experiments, we show that learning to branch is both practical and hugely beneficial.

Fri 13 July 2:40 - 2:50 PDT

Compiling Combinatorial Prediction Games

Frederic Koriche

In online optimization, the goal is to iteratively choose solutions from a decision space, so as to minimize the average cost over time. As long as this decision space is described by combinatorial constraints, the problem is generally intractable. In this paper, we consider the paradigm of compiling the set of combinatorial constraints into a deterministic and Decomposable Negation Normal Form (dDNNF) circuit, for which the tasks of linear optimization and solution sampling take linear time. Based on this framework, we provide efficient characterizations of existing combinatorial prediction strategies, with a particular attention to mirror descent techniques. These strategies are compared on several real-world benchmarks for which the set of Boolean constraints is preliminarily compiled into a dDNNF circuit.

Fri 13 July 2:50 - 3:00 PDT

Approximation Algorithms for Cascading Prediction Models

Matthew Streeter

We present an approximation algorithm that takes a pool of pre-trained models as input and produces from it a cascaded model with similar accuracy but lower average-case cost. Applied to state-of-the-art ImageNet classification models, this yields up to a 2x reduction in floating point multiplications, and up to a 6x reduction in average-case memory I/O. The auto-generated cascades exhibit intuitive properties, such as using lower-resolution input for easier images and requiring higher prediction confidence when using a computationally cheaper model.

Main Navigation

Session

Optimization (Combinatorial) 4

Competitive Caching with Machine Learned Advice

Distributed Clustering via LSH Based Data Partitioning

Learning to Branch

Compiling Combinatorial Prediction Games

Approximation Algorithms for Cascading Prediction Models