Timezone: »
We present a neural encoder-decoder model to convert images into presentational markup based on a scalable coarse-to-fine attention mechanism. Our method is evaluated in the context of image-to-LaTeX generation, and we introduce a new dataset of real-world rendered mathematical expressions paired with LaTeX markup. We show that unlike neural OCR techniques using CTC-based models, attention-based approaches can tackle this non-standard OCR task. Our approach outperforms classical mathematical OCR systems by a large margin on in-domain rendered data, and, with pretraining, also performs well on out-of-domain handwritten data. To reduce the inference complexity associated with the attention-based approaches, we introduce a new coarse-to-fine attention layer that selects a support region before applying attention.
Author Information
Yuntian Deng (Harvard University)
Anssi Kanervisto (University of Eastern Finland)
Jeffrey Ling (Harvard University)
Alexander Rush (Harvard University)
Related Events (a corresponding poster, oral, or spotlight)
-
2017 Poster: Image-to-Markup Generation with Coarse-to-Fine Attention »
Wed. Aug 9th 08:30 AM -- 12:00 PM Room Gallery #45
More from the Same Authors
-
2019 Poster: Latent Normalizing Flows for Discrete Sequences »
Zachary Ziegler · Alexander Rush -
2019 Oral: Latent Normalizing Flows for Discrete Sequences »
Zachary Ziegler · Alexander Rush -
2019 Poster: Tensor Variable Elimination for Plated Factor Graphs »
Fritz Obermeyer · Elias Bingham · Martin Jankowiak · Neeraj Pradhan · Justin Chiu · Alexander Rush · Noah Goodman -
2019 Oral: Tensor Variable Elimination for Plated Factor Graphs »
Fritz Obermeyer · Elias Bingham · Martin Jankowiak · Neeraj Pradhan · Justin Chiu · Alexander Rush · Noah Goodman -
2018 Poster: Semi-Amortized Variational Autoencoders »
Yoon Kim · Sam Wiseman · Andrew Miller · David Sontag · Alexander Rush -
2018 Poster: Weightless: Lossy weight encoding for deep neural network compression »
Brandon Reagen · Udit Gupta · Bob Adolf · Michael Mitzenmacher · Alexander Rush · Gu-Yeon Wei · David Brooks -
2018 Poster: Adversarially Regularized Autoencoders »
Jake Zhao · Yoon Kim · Kelly Zhang · Alexander Rush · Yann LeCun -
2018 Oral: Semi-Amortized Variational Autoencoders »
Yoon Kim · Sam Wiseman · Andrew Miller · David Sontag · Alexander Rush -
2018 Oral: Weightless: Lossy weight encoding for deep neural network compression »
Brandon Reagen · Udit Gupta · Bob Adolf · Michael Mitzenmacher · Alexander Rush · Gu-Yeon Wei · David Brooks -
2018 Oral: Adversarially Regularized Autoencoders »
Jake Zhao · Yoon Kim · Kelly Zhang · Alexander Rush · Yann LeCun -
2017 Poster: Learning Latent Space Models with Angular Constraints »
Pengtao Xie · Yuntian Deng · Yi Zhou · Abhimanu Kumar · Yaoliang Yu · James Zou · Eric Xing -
2017 Talk: Learning Latent Space Models with Angular Constraints »
Pengtao Xie · Yuntian Deng · Yi Zhou · Abhimanu Kumar · Yaoliang Yu · James Zou · Eric Xing