Unsupervised Training Data Generation of Handwritten Formulas using Generative Adversarial Networks with Self-Attention

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

  • Matthias Springstein
  • Eric Müller-Budack
  • Ralph Ewerth

Research Organisations

External Research Organisations

  • German National Library of Science and Technology (TIB)
View graph of relations

Details

Original languageEnglish
Title of host publicationMMPT 2021
Subtitle of host publicationProceedings of the 2021 Workshop on Multi-Modal Pre-Training for Multimedia Understanding
Pages46-54
Number of pages9
ISBN (electronic)9781450385305
Publication statusPublished - 27 Aug 2021
Event1st International Joint Workshop on Multi-Modal Pre-Training for Multimedia Understanding, MMPT 2021 - Taipei, Taiwan
Duration: 21 Aug 2021 → …

Publication series

NameMMPT 2021 - Proceedings of the 2021 Workshop on Multi-Modal Pre-Training for Multimedia Understanding

Abstract

The recognition of handwritten mathematical expressions in images and video frames is a difficult and unsolved problem yet. Deep convectional neural networks are basically a promising approach, but typically require a large amount of labeled training data. However, such a large training dataset does not exist for the task of handwritten formula recognition. In this paper, we introduce a system that creates a large set of synthesized training examples of mathematical expressions which are derived from LaTeX documents. For this purpose, we propose a novel attention-based generative adversarial network to translate rendered equations to handwritten formulas. The datasets generated by this approach contain hundreds of thousands of formulas, making it ideal for pretraining or the design of more complex models. We evaluate our synthesized dataset and the recognition approach on the CROHME 2014 benchmark dataset. Experimental results demonstrate the feasibility of the approach.

Keywords

    datasets, formula recognition, generative adversarial network

ASJC Scopus subject areas

Cite this

Unsupervised Training Data Generation of Handwritten Formulas using Generative Adversarial Networks with Self-Attention. / Springstein, Matthias; Müller-Budack, Eric; Ewerth, Ralph.
MMPT 2021 : Proceedings of the 2021 Workshop on Multi-Modal Pre-Training for Multimedia Understanding. 2021. p. 46-54 (MMPT 2021 - Proceedings of the 2021 Workshop on Multi-Modal Pre-Training for Multimedia Understanding).

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Springstein, M, Müller-Budack, E & Ewerth, R 2021, Unsupervised Training Data Generation of Handwritten Formulas using Generative Adversarial Networks with Self-Attention. in MMPT 2021 : Proceedings of the 2021 Workshop on Multi-Modal Pre-Training for Multimedia Understanding. MMPT 2021 - Proceedings of the 2021 Workshop on Multi-Modal Pre-Training for Multimedia Understanding, pp. 46-54, 1st International Joint Workshop on Multi-Modal Pre-Training for Multimedia Understanding, MMPT 2021, Taipei, Taiwan, 21 Aug 2021. https://doi.org/10.48550/arXiv.2106.09432, https://doi.org/10.1145/3463945.3469059
Springstein, M., Müller-Budack, E., & Ewerth, R. (2021). Unsupervised Training Data Generation of Handwritten Formulas using Generative Adversarial Networks with Self-Attention. In MMPT 2021 : Proceedings of the 2021 Workshop on Multi-Modal Pre-Training for Multimedia Understanding (pp. 46-54). (MMPT 2021 - Proceedings of the 2021 Workshop on Multi-Modal Pre-Training for Multimedia Understanding). https://doi.org/10.48550/arXiv.2106.09432, https://doi.org/10.1145/3463945.3469059
Springstein M, Müller-Budack E, Ewerth R. Unsupervised Training Data Generation of Handwritten Formulas using Generative Adversarial Networks with Self-Attention. In MMPT 2021 : Proceedings of the 2021 Workshop on Multi-Modal Pre-Training for Multimedia Understanding. 2021. p. 46-54. (MMPT 2021 - Proceedings of the 2021 Workshop on Multi-Modal Pre-Training for Multimedia Understanding). doi: 10.48550/arXiv.2106.09432, 10.1145/3463945.3469059
Springstein, Matthias ; Müller-Budack, Eric ; Ewerth, Ralph. / Unsupervised Training Data Generation of Handwritten Formulas using Generative Adversarial Networks with Self-Attention. MMPT 2021 : Proceedings of the 2021 Workshop on Multi-Modal Pre-Training for Multimedia Understanding. 2021. pp. 46-54 (MMPT 2021 - Proceedings of the 2021 Workshop on Multi-Modal Pre-Training for Multimedia Understanding).
Download
@inproceedings{92e79591b6934694992747f0ebe587c6,
title = "Unsupervised Training Data Generation of Handwritten Formulas using Generative Adversarial Networks with Self-Attention",
abstract = "The recognition of handwritten mathematical expressions in images and video frames is a difficult and unsolved problem yet. Deep convectional neural networks are basically a promising approach, but typically require a large amount of labeled training data. However, such a large training dataset does not exist for the task of handwritten formula recognition. In this paper, we introduce a system that creates a large set of synthesized training examples of mathematical expressions which are derived from LaTeX documents. For this purpose, we propose a novel attention-based generative adversarial network to translate rendered equations to handwritten formulas. The datasets generated by this approach contain hundreds of thousands of formulas, making it ideal for pretraining or the design of more complex models. We evaluate our synthesized dataset and the recognition approach on the CROHME 2014 benchmark dataset. Experimental results demonstrate the feasibility of the approach.",
keywords = "datasets, formula recognition, generative adversarial network",
author = "Matthias Springstein and Eric M{\"u}ller-Budack and Ralph Ewerth",
year = "2021",
month = aug,
day = "27",
doi = "10.48550/arXiv.2106.09432",
language = "English",
series = "MMPT 2021 - Proceedings of the 2021 Workshop on Multi-Modal Pre-Training for Multimedia Understanding",
pages = "46--54",
booktitle = "MMPT 2021",
note = "1st International Joint Workshop on Multi-Modal Pre-Training for Multimedia Understanding, MMPT 2021 ; Conference date: 21-08-2021",

}

Download

TY - GEN

T1 - Unsupervised Training Data Generation of Handwritten Formulas using Generative Adversarial Networks with Self-Attention

AU - Springstein, Matthias

AU - Müller-Budack, Eric

AU - Ewerth, Ralph

PY - 2021/8/27

Y1 - 2021/8/27

N2 - The recognition of handwritten mathematical expressions in images and video frames is a difficult and unsolved problem yet. Deep convectional neural networks are basically a promising approach, but typically require a large amount of labeled training data. However, such a large training dataset does not exist for the task of handwritten formula recognition. In this paper, we introduce a system that creates a large set of synthesized training examples of mathematical expressions which are derived from LaTeX documents. For this purpose, we propose a novel attention-based generative adversarial network to translate rendered equations to handwritten formulas. The datasets generated by this approach contain hundreds of thousands of formulas, making it ideal for pretraining or the design of more complex models. We evaluate our synthesized dataset and the recognition approach on the CROHME 2014 benchmark dataset. Experimental results demonstrate the feasibility of the approach.

AB - The recognition of handwritten mathematical expressions in images and video frames is a difficult and unsolved problem yet. Deep convectional neural networks are basically a promising approach, but typically require a large amount of labeled training data. However, such a large training dataset does not exist for the task of handwritten formula recognition. In this paper, we introduce a system that creates a large set of synthesized training examples of mathematical expressions which are derived from LaTeX documents. For this purpose, we propose a novel attention-based generative adversarial network to translate rendered equations to handwritten formulas. The datasets generated by this approach contain hundreds of thousands of formulas, making it ideal for pretraining or the design of more complex models. We evaluate our synthesized dataset and the recognition approach on the CROHME 2014 benchmark dataset. Experimental results demonstrate the feasibility of the approach.

KW - datasets

KW - formula recognition

KW - generative adversarial network

UR - http://www.scopus.com/inward/record.url?scp=85114808291&partnerID=8YFLogxK

U2 - 10.48550/arXiv.2106.09432

DO - 10.48550/arXiv.2106.09432

M3 - Conference contribution

AN - SCOPUS:85114808291

T3 - MMPT 2021 - Proceedings of the 2021 Workshop on Multi-Modal Pre-Training for Multimedia Understanding

SP - 46

EP - 54

BT - MMPT 2021

T2 - 1st International Joint Workshop on Multi-Modal Pre-Training for Multimedia Understanding, MMPT 2021

Y2 - 21 August 2021

ER -