Details
Original language | English |
---|---|
Title of host publication | Journal Track at ISWC 2019 |
Subtitle of host publication | Proceedings of the Journal Track co-located with the 18th International Semantic Web Conference (ISWC 2019) |
Publication status | Published - 2019 |
Event | 18th International Semantic Web Conference, ISWC 2019 - Auckland, New Zealand Duration: 26 Oct 2019 → 30 Oct 2019 |
Publication series
Name | CEUR Workshop Proceedings |
---|---|
Publisher | CEUR Workshop Proceedings |
Volume | 2576 |
ISSN (Print) | 1613-0073 |
Abstract
Knowledge bases are in widespread use for aiding tasks such as information extraction and information retrieval. However, knowledge bases are known to be inherently incomplete. As a complimentary data source, embedded entity markup based on Microdata, RDFa, and Microformats have become prevalent on the Web. RDF statements extracted from markup are fundamentally different from traditional knowledge graphs: entity descriptions are flat, facts are highly redundant and of varied quality, and, explicit links are missing despite a vast amount of coreferences. Therefore, data fusion is required in order to facilitate the use of markup data for KBA. We present a novel data fusion approach which addresses these issues. We perform a thorough evaluation on a subset of the Web Data Commons dataset and show significant potential for augmenting existing knowledge bases. A comparison with existing data fusion baselines demonstrates superior performance of our approach when applied to Web markup data.
ASJC Scopus subject areas
Cite this
- Standard
- Harvard
- Apa
- Vancouver
- BibTeX
- RIS
Journal Track at ISWC 2019: Proceedings of the Journal Track co-located with the 18th International Semantic Web Conference (ISWC 2019). 2019. (CEUR Workshop Proceedings; Vol. 2576).
Research output: Chapter in book/report/conference proceeding › Conference contribution › Research
}
TY - GEN
T1 - Reflections on: KnowMore - Knowledge Base Augmentation with StructuredWeb Markup
AU - Yu, Ran
AU - Gadiraju, Ujwal
AU - Fetahu, Besnik
AU - Lehmberg, Oliver
AU - Ritze, Dominique
AU - Dietze, Stefan
PY - 2019
Y1 - 2019
N2 - Knowledge bases are in widespread use for aiding tasks such as information extraction and information retrieval. However, knowledge bases are known to be inherently incomplete. As a complimentary data source, embedded entity markup based on Microdata, RDFa, and Microformats have become prevalent on the Web. RDF statements extracted from markup are fundamentally different from traditional knowledge graphs: entity descriptions are flat, facts are highly redundant and of varied quality, and, explicit links are missing despite a vast amount of coreferences. Therefore, data fusion is required in order to facilitate the use of markup data for KBA. We present a novel data fusion approach which addresses these issues. We perform a thorough evaluation on a subset of the Web Data Commons dataset and show significant potential for augmenting existing knowledge bases. A comparison with existing data fusion baselines demonstrates superior performance of our approach when applied to Web markup data.
AB - Knowledge bases are in widespread use for aiding tasks such as information extraction and information retrieval. However, knowledge bases are known to be inherently incomplete. As a complimentary data source, embedded entity markup based on Microdata, RDFa, and Microformats have become prevalent on the Web. RDF statements extracted from markup are fundamentally different from traditional knowledge graphs: entity descriptions are flat, facts are highly redundant and of varied quality, and, explicit links are missing despite a vast amount of coreferences. Therefore, data fusion is required in order to facilitate the use of markup data for KBA. We present a novel data fusion approach which addresses these issues. We perform a thorough evaluation on a subset of the Web Data Commons dataset and show significant potential for augmenting existing knowledge bases. A comparison with existing data fusion baselines demonstrates superior performance of our approach when applied to Web markup data.
UR - http://www.scopus.com/inward/record.url?scp=85082436907&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85082436907
T3 - CEUR Workshop Proceedings
BT - Journal Track at ISWC 2019
T2 - 18th International Semantic Web Conference, ISWC 2019
Y2 - 26 October 2019 through 30 October 2019
ER -