Classification of Visualization Types and Perspectives in Patents

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

  • Junaid Ahmed Ghauri
  • Eric Müller-Budack
  • Ralph Ewerth

Research Organisations

External Research Organisations

  • German National Library of Science and Technology (TIB)
View graph of relations

Details

Original languageEnglish
Title of host publicationLinking Theory and Practice of Digital Libraries
EditorsOmar Alonso, Helena Cousijn, Gianmaria Silvello, Stefano Marchesin, Mónica Marrero, Carla Teixeira Lopes
Pages182-191
Number of pages10
ISBN (electronic)978-3-031-43849-3
Publication statusPublished - 2023

Publication series

NameLecture Notes in Computer Sciences
Volume14241
ISSN (Print)0302-9743
ISSN (electronic)1611-3349

Abstract

Due to the swift growth of patent applications each year, information and multimedia retrieval approaches that facilitate patent exploration and retrieval are of utmost importance. Different types of visualizations (e.g., graphs, technical drawings) and perspectives (e.g., side view, perspective) are used to visualize details of innovations in patents. The classification of these images enables a more efficient search in digital libraries and allows for further analysis. So far, datasets for image type classification miss some important visualization types for patents. Furthermore, related work does not make use of recent deep learning approaches including transformers. In this paper, we adopt state-of-the-art deep learning methods for the classification of visualization types and perspectives in patent images. We extend the CLEF-IP dataset for image type classification in patents to ten classes and provide manual ground truth annotations. In addition, we derive a set of hierarchical classes from a dataset that provides weakly-labeled data for image perspectives. Experimental results have demonstrated the feasibility of the proposed approaches. Source code, models, and datasets are publicly available (https://github.com/TIBHannover/PatentImageClassification ).

Keywords

    deep learning, digital libraries, patent image classification

ASJC Scopus subject areas

Cite this

Classification of Visualization Types and Perspectives in Patents. / Ghauri, Junaid Ahmed; Müller-Budack, Eric; Ewerth, Ralph.
Linking Theory and Practice of Digital Libraries. ed. / Omar Alonso; Helena Cousijn; Gianmaria Silvello; Stefano Marchesin; Mónica Marrero; Carla Teixeira Lopes. 2023. p. 182-191 (Lecture Notes in Computer Sciences; Vol. 14241).

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Ghauri, JA, Müller-Budack, E & Ewerth, R 2023, Classification of Visualization Types and Perspectives in Patents. in O Alonso, H Cousijn, G Silvello, S Marchesin, M Marrero & C Teixeira Lopes (eds), Linking Theory and Practice of Digital Libraries. Lecture Notes in Computer Sciences, vol. 14241, pp. 182-191. https://doi.org/10.48550/arXiv.2307.10471, https://doi.org/10.1007/978-3-031-43849-3_16
Ghauri, J. A., Müller-Budack, E., & Ewerth, R. (2023). Classification of Visualization Types and Perspectives in Patents. In O. Alonso, H. Cousijn, G. Silvello, S. Marchesin, M. Marrero, & C. Teixeira Lopes (Eds.), Linking Theory and Practice of Digital Libraries (pp. 182-191). (Lecture Notes in Computer Sciences; Vol. 14241). https://doi.org/10.48550/arXiv.2307.10471, https://doi.org/10.1007/978-3-031-43849-3_16
Ghauri JA, Müller-Budack E, Ewerth R. Classification of Visualization Types and Perspectives in Patents. In Alonso O, Cousijn H, Silvello G, Marchesin S, Marrero M, Teixeira Lopes C, editors, Linking Theory and Practice of Digital Libraries. 2023. p. 182-191. (Lecture Notes in Computer Sciences). Epub 2023 Sept 22. doi: 10.48550/arXiv.2307.10471, 10.1007/978-3-031-43849-3_16
Ghauri, Junaid Ahmed ; Müller-Budack, Eric ; Ewerth, Ralph. / Classification of Visualization Types and Perspectives in Patents. Linking Theory and Practice of Digital Libraries. editor / Omar Alonso ; Helena Cousijn ; Gianmaria Silvello ; Stefano Marchesin ; Mónica Marrero ; Carla Teixeira Lopes. 2023. pp. 182-191 (Lecture Notes in Computer Sciences).
Download
@inproceedings{b0cf63e8665e4a3282204a426179ba94,
title = "Classification of Visualization Types and Perspectives in Patents",
abstract = "Due to the swift growth of patent applications each year, information and multimedia retrieval approaches that facilitate patent exploration and retrieval are of utmost importance. Different types of visualizations (e.g., graphs, technical drawings) and perspectives (e.g., side view, perspective) are used to visualize details of innovations in patents. The classification of these images enables a more efficient search in digital libraries and allows for further analysis. So far, datasets for image type classification miss some important visualization types for patents. Furthermore, related work does not make use of recent deep learning approaches including transformers. In this paper, we adopt state-of-the-art deep learning methods for the classification of visualization types and perspectives in patent images. We extend the CLEF-IP dataset for image type classification in patents to ten classes and provide manual ground truth annotations. In addition, we derive a set of hierarchical classes from a dataset that provides weakly-labeled data for image perspectives. Experimental results have demonstrated the feasibility of the proposed approaches. Source code, models, and datasets are publicly available (https://github.com/TIBHannover/PatentImageClassification ).",
keywords = "deep learning, digital libraries, patent image classification",
author = "Ghauri, {Junaid Ahmed} and Eric M{\"u}ller-Budack and Ralph Ewerth",
note = "Funding Information: Acknowledgment. Part of this work is financially supported by the BMBF (Federal Ministry of Education and Research, Germany) project “ExpResViP” (project no: 01IO2004A). We also like to thank our colleague Sushil Awale (TIB) for his valuable feedback.",
year = "2023",
doi = "10.48550/arXiv.2307.10471",
language = "English",
isbn = "978-3-031-43848-6",
series = "Lecture Notes in Computer Sciences",
pages = "182--191",
editor = "Omar Alonso and Helena Cousijn and Gianmaria Silvello and Stefano Marchesin and M{\'o}nica Marrero and {Teixeira Lopes}, Carla",
booktitle = "Linking Theory and Practice of Digital Libraries",

}

Download

TY - GEN

T1 - Classification of Visualization Types and Perspectives in Patents

AU - Ghauri, Junaid Ahmed

AU - Müller-Budack, Eric

AU - Ewerth, Ralph

N1 - Funding Information: Acknowledgment. Part of this work is financially supported by the BMBF (Federal Ministry of Education and Research, Germany) project “ExpResViP” (project no: 01IO2004A). We also like to thank our colleague Sushil Awale (TIB) for his valuable feedback.

PY - 2023

Y1 - 2023

N2 - Due to the swift growth of patent applications each year, information and multimedia retrieval approaches that facilitate patent exploration and retrieval are of utmost importance. Different types of visualizations (e.g., graphs, technical drawings) and perspectives (e.g., side view, perspective) are used to visualize details of innovations in patents. The classification of these images enables a more efficient search in digital libraries and allows for further analysis. So far, datasets for image type classification miss some important visualization types for patents. Furthermore, related work does not make use of recent deep learning approaches including transformers. In this paper, we adopt state-of-the-art deep learning methods for the classification of visualization types and perspectives in patent images. We extend the CLEF-IP dataset for image type classification in patents to ten classes and provide manual ground truth annotations. In addition, we derive a set of hierarchical classes from a dataset that provides weakly-labeled data for image perspectives. Experimental results have demonstrated the feasibility of the proposed approaches. Source code, models, and datasets are publicly available (https://github.com/TIBHannover/PatentImageClassification ).

AB - Due to the swift growth of patent applications each year, information and multimedia retrieval approaches that facilitate patent exploration and retrieval are of utmost importance. Different types of visualizations (e.g., graphs, technical drawings) and perspectives (e.g., side view, perspective) are used to visualize details of innovations in patents. The classification of these images enables a more efficient search in digital libraries and allows for further analysis. So far, datasets for image type classification miss some important visualization types for patents. Furthermore, related work does not make use of recent deep learning approaches including transformers. In this paper, we adopt state-of-the-art deep learning methods for the classification of visualization types and perspectives in patent images. We extend the CLEF-IP dataset for image type classification in patents to ten classes and provide manual ground truth annotations. In addition, we derive a set of hierarchical classes from a dataset that provides weakly-labeled data for image perspectives. Experimental results have demonstrated the feasibility of the proposed approaches. Source code, models, and datasets are publicly available (https://github.com/TIBHannover/PatentImageClassification ).

KW - deep learning

KW - digital libraries

KW - patent image classification

UR - http://www.scopus.com/inward/record.url?scp=85174608481&partnerID=8YFLogxK

U2 - 10.48550/arXiv.2307.10471

DO - 10.48550/arXiv.2307.10471

M3 - Conference contribution

SN - 978-3-031-43848-6

T3 - Lecture Notes in Computer Sciences

SP - 182

EP - 191

BT - Linking Theory and Practice of Digital Libraries

A2 - Alonso, Omar

A2 - Cousijn, Helena

A2 - Silvello, Gianmaria

A2 - Marchesin, Stefano

A2 - Marrero, Mónica

A2 - Teixeira Lopes, Carla

ER -