Timezone: »
Structured predictors require solving a combinatorial optimization problem over a large number of structures, such as dependency trees or alignments. When embedded as structured hidden layers in a neural net, argmin differentiation and efficient gradient computation are further required. Recently, SparseMAP has been proposed as a differentiable, sparse alternative to maximum a posteriori (MAP) and marginal inference. SparseMAP returns an interpretable combination of a small number of structures; its sparsity being the key to efficient optimization. However, SparseMAP requires access to an exact MAP oracle in the structured model, excluding, e.g., loopy graphical models or logic constraints, which generally require approximate inference. In this paper, we introduce LP-SparseMAP, an extension of SparseMAP addressing this limitation via a local polytope relaxation. LP-SparseMAP uses the flexible and powerful language of factor graphs to define expressive hidden structures, supporting coarse decompositions, hard logic constraints, and higher-order correlations. We derive the forward and backward algorithms needed for using LP-SparseMAP as a structured hidden or output layer. Experiments in three structured tasks show benefits versus SparseMAP and Structured SVM.
Author Information
Vlad Niculae (Instituto de Telecomunicações // NIF 502854200)
Andre Filipe Torres Martins (Instituto de Telecomunicacoes)
More from the Same Authors
-
2022 Poster: Modeling Structure with Undirected Neural Networks »
Tsvetomila Mihaylova · Vlad Niculae · Andre Filipe Torres Martins -
2022 Spotlight: Modeling Structure with Undirected Neural Networks »
Tsvetomila Mihaylova · Vlad Niculae · Andre Filipe Torres Martins -
2021 Poster: Learning Binary Decision Trees by Argmin Differentiation »
Valentina Zantedeschi · Matt J. Kusner · Vlad Niculae -
2021 Spotlight: Learning Binary Decision Trees by Argmin Differentiation »
Valentina Zantedeschi · Matt J. Kusner · Vlad Niculae -
2018 Poster: SparseMAP: Differentiable Sparse Structured Inference »
Vlad Niculae · Andre Filipe Torres Martins · Mathieu Blondel · Claire Cardie -
2018 Oral: SparseMAP: Differentiable Sparse Structured Inference »
Vlad Niculae · Andre Filipe Torres Martins · Mathieu Blondel · Claire Cardie