Details
Originalsprache | Englisch |
---|---|
Titel des Sammelwerks | Towards Open and Trustworthy Digital Societies |
Untertitel | 23rd International Conference on Asia-Pacific Digital Libraries, ICADL 2021, Proceedings |
Herausgeber/-innen | Hao-Ren Ke, Chei Sian Lee, Kazunari Sugiyama |
Erscheinungsort | Cham |
Herausgeber (Verlag) | Springer Nature Switzerland AG |
Seiten | 401-410 |
Seitenumfang | 10 |
ISBN (elektronisch) | 978-3-030-91669-5 |
ISBN (Print) | 9783030916688 |
Publikationsstatus | Veröffentlicht - 2021 |
Veranstaltung | 23rd International Conference on Asia-Pacific Digital Libraries, ICADL 2021 - Virtual, Online Dauer: 1 Dez. 2021 → 3 Dez. 2021 |
Publikationsreihe
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Band | 13133 |
ISSN (Print) | 0302-9743 |
ISSN (elektronisch) | 1611-3349 |
Abstract
We describe a rule-based approach for the automatic acquisition of salient scientific entities from Computational Linguistics (CL) scholarly article titles. Two observations motivated the approach: (i) noting salient aspects of an article’s contribution in its title; and (ii) pattern regularities capturing the salient terms that could be expressed in a set of rules. Only those lexico-syntactic patterns were selected that were easily recognizable, occurred frequently, and positionally indicated a scientific entity type. The rules were developed on a collection of 50,237 CL titles covering all articles in the ACL Anthology. In total, 19,799 research problems, 18,111 solutions, 20,033 resources, 1,059 languages, 6,878 tools, and 21,687 methods were extracted at an average precision of 75%.
ASJC Scopus Sachgebiete
- Mathematik (insg.)
- Theoretische Informatik
- Informatik (insg.)
- Allgemeine Computerwissenschaft
Zitieren
- Standard
- Harvard
- Apa
- Vancouver
- BibTex
- RIS
Towards Open and Trustworthy Digital Societies: 23rd International Conference on Asia-Pacific Digital Libraries, ICADL 2021, Proceedings. Hrsg. / Hao-Ren Ke; Chei Sian Lee; Kazunari Sugiyama. Cham: Springer Nature Switzerland AG, 2021. S. 401-410 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Band 13133).
Publikation: Beitrag in Buch/Bericht/Sammelwerk/Konferenzband › Aufsatz in Konferenzband › Forschung › Peer-Review
}
TY - GEN
T1 - Pattern-Based Acquisition of Scientific Entities from Scholarly Article Titles
AU - D’Souza, Jennifer
AU - Auer, Sören
N1 - Funding Information: Supported by TIB Leibniz Information Centre for Science and Technology, the EU H2020 ERC project ScienceGRaph (GA ID: 819536).
PY - 2021
Y1 - 2021
N2 - We describe a rule-based approach for the automatic acquisition of salient scientific entities from Computational Linguistics (CL) scholarly article titles. Two observations motivated the approach: (i) noting salient aspects of an article’s contribution in its title; and (ii) pattern regularities capturing the salient terms that could be expressed in a set of rules. Only those lexico-syntactic patterns were selected that were easily recognizable, occurred frequently, and positionally indicated a scientific entity type. The rules were developed on a collection of 50,237 CL titles covering all articles in the ACL Anthology. In total, 19,799 research problems, 18,111 solutions, 20,033 resources, 1,059 languages, 6,878 tools, and 21,687 methods were extracted at an average precision of 75%.
AB - We describe a rule-based approach for the automatic acquisition of salient scientific entities from Computational Linguistics (CL) scholarly article titles. Two observations motivated the approach: (i) noting salient aspects of an article’s contribution in its title; and (ii) pattern regularities capturing the salient terms that could be expressed in a set of rules. Only those lexico-syntactic patterns were selected that were easily recognizable, occurred frequently, and positionally indicated a scientific entity type. The rules were developed on a collection of 50,237 CL titles covering all articles in the ACL Anthology. In total, 19,799 research problems, 18,111 solutions, 20,033 resources, 1,059 languages, 6,878 tools, and 21,687 methods were extracted at an average precision of 75%.
KW - Natural language processing
KW - Rule-based system
KW - Scholarly knowledge graphs
KW - Semantic publishing
KW - Terminology extraction
UR - http://www.scopus.com/inward/record.url?scp=85121912565&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-91669-5_31
DO - 10.1007/978-3-030-91669-5_31
M3 - Conference contribution
AN - SCOPUS:85121912565
SN - 9783030916688
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 401
EP - 410
BT - Towards Open and Trustworthy Digital Societies
A2 - Ke, Hao-Ren
A2 - Lee, Chei Sian
A2 - Sugiyama, Kazunari
PB - Springer Nature Switzerland AG
CY - Cham
T2 - 23rd International Conference on Asia-Pacific Digital Libraries, ICADL 2021
Y2 - 1 December 2021 through 3 December 2021
ER -