SimMatching: Adaptable Road Network Matching for Efficient and Scalable Spatial Data Integration

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Autorschaft

  • Michael Schäfers
  • Udo W. Lipeck
Forschungs-netzwerk anzeigen

Details

OriginalspracheEnglisch
Titel des SammelwerksSIGSPATIAL PhD '14
Untertitel Proceedings of the 1st ACM SIGSPATIAL PhD Workshop
Herausgeber/-innenUgur Demiryurek, Mohamed Sarwat
Seitenumfang5
ISBN (elektronisch)9781450331586
PublikationsstatusVeröffentlicht - 4 Nov. 2014
Veranstaltung2014 1st ACM SIGPATIAL PhD Workshop, SIGSPATIAL PhD 2014 - In Conjunction with 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL 2014 - Dallas, USA / Vereinigte Staaten
Dauer: 4 Nov. 20147 Nov. 2014

Abstract

Spatial data integration is a challenging task due to the high degree of diversity between different geodata sources, the inherent complexity of objects, and the large size of datasets. To avoid duplicates in an integrated dataset, input sources have to be linked on the instance level. By matching spatial objects, multiple representations of the same real-world entity shall be identified based on similarity computation. In this paper, we present an approach for similarity-based spatial matching of road networks. Our SimMatching algorithm adapts to a variety of input data characteristics by using weighted similarity measures. Geometric and semantic attributes are considered as well as the dataset topology to enhance similarity computations with relational measures. We use a greedy approach and optimizations to keep the number of match candidates minimal all the time. This allows very low runtimes while giving high quality matching results. Supported by a partitioning framework and parallel processing, it also guarantees scalability to large datasets.

ASJC Scopus Sachgebiete

Zitieren

SimMatching: Adaptable Road Network Matching for Efficient and Scalable Spatial Data Integration. / Schäfers, Michael; Lipeck, Udo W.
SIGSPATIAL PhD '14: Proceedings of the 1st ACM SIGSPATIAL PhD Workshop. Hrsg. / Ugur Demiryurek; Mohamed Sarwat. 2014. 2694866.

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Schäfers, M & Lipeck, UW 2014, SimMatching: Adaptable Road Network Matching for Efficient and Scalable Spatial Data Integration. in U Demiryurek & M Sarwat (Hrsg.), SIGSPATIAL PhD '14: Proceedings of the 1st ACM SIGSPATIAL PhD Workshop., 2694866, 2014 1st ACM SIGPATIAL PhD Workshop, SIGSPATIAL PhD 2014 - In Conjunction with 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL 2014, Dallas, Texas, USA / Vereinigte Staaten, 4 Nov. 2014. https://doi.org/10.1145/2694859.2694866
Schäfers, M., & Lipeck, U. W. (2014). SimMatching: Adaptable Road Network Matching for Efficient and Scalable Spatial Data Integration. In U. Demiryurek, & M. Sarwat (Hrsg.), SIGSPATIAL PhD '14: Proceedings of the 1st ACM SIGSPATIAL PhD Workshop Artikel 2694866 https://doi.org/10.1145/2694859.2694866
Schäfers M, Lipeck UW. SimMatching: Adaptable Road Network Matching for Efficient and Scalable Spatial Data Integration. in Demiryurek U, Sarwat M, Hrsg., SIGSPATIAL PhD '14: Proceedings of the 1st ACM SIGSPATIAL PhD Workshop. 2014. 2694866 doi: 10.1145/2694859.2694866
Schäfers, Michael ; Lipeck, Udo W. / SimMatching : Adaptable Road Network Matching for Efficient and Scalable Spatial Data Integration. SIGSPATIAL PhD '14: Proceedings of the 1st ACM SIGSPATIAL PhD Workshop. Hrsg. / Ugur Demiryurek ; Mohamed Sarwat. 2014.
Download
@inproceedings{fde8ba0f2176431e886b4280275baa11,
title = "SimMatching: Adaptable Road Network Matching for Efficient and Scalable Spatial Data Integration",
abstract = "Spatial data integration is a challenging task due to the high degree of diversity between different geodata sources, the inherent complexity of objects, and the large size of datasets. To avoid duplicates in an integrated dataset, input sources have to be linked on the instance level. By matching spatial objects, multiple representations of the same real-world entity shall be identified based on similarity computation. In this paper, we present an approach for similarity-based spatial matching of road networks. Our SimMatching algorithm adapts to a variety of input data characteristics by using weighted similarity measures. Geometric and semantic attributes are considered as well as the dataset topology to enhance similarity computations with relational measures. We use a greedy approach and optimizations to keep the number of match candidates minimal all the time. This allows very low runtimes while giving high quality matching results. Supported by a partitioning framework and parallel processing, it also guarantees scalability to large datasets.",
keywords = "Data Matching, Road Networks, Scalability, Similarity, Spatial Data Integration, Spatial Databases",
author = "Michael Sch{\"a}fers and Lipeck, {Udo W.}",
year = "2014",
month = nov,
day = "4",
doi = "10.1145/2694859.2694866",
language = "English",
editor = "Ugur Demiryurek and Mohamed Sarwat",
booktitle = "SIGSPATIAL PhD '14",
note = "2014 1st ACM SIGPATIAL PhD Workshop, SIGSPATIAL PhD 2014 - In Conjunction with 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL 2014 ; Conference date: 04-11-2014 Through 07-11-2014",

}

Download

TY - GEN

T1 - SimMatching

T2 - 2014 1st ACM SIGPATIAL PhD Workshop, SIGSPATIAL PhD 2014 - In Conjunction with 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL 2014

AU - Schäfers, Michael

AU - Lipeck, Udo W.

PY - 2014/11/4

Y1 - 2014/11/4

N2 - Spatial data integration is a challenging task due to the high degree of diversity between different geodata sources, the inherent complexity of objects, and the large size of datasets. To avoid duplicates in an integrated dataset, input sources have to be linked on the instance level. By matching spatial objects, multiple representations of the same real-world entity shall be identified based on similarity computation. In this paper, we present an approach for similarity-based spatial matching of road networks. Our SimMatching algorithm adapts to a variety of input data characteristics by using weighted similarity measures. Geometric and semantic attributes are considered as well as the dataset topology to enhance similarity computations with relational measures. We use a greedy approach and optimizations to keep the number of match candidates minimal all the time. This allows very low runtimes while giving high quality matching results. Supported by a partitioning framework and parallel processing, it also guarantees scalability to large datasets.

AB - Spatial data integration is a challenging task due to the high degree of diversity between different geodata sources, the inherent complexity of objects, and the large size of datasets. To avoid duplicates in an integrated dataset, input sources have to be linked on the instance level. By matching spatial objects, multiple representations of the same real-world entity shall be identified based on similarity computation. In this paper, we present an approach for similarity-based spatial matching of road networks. Our SimMatching algorithm adapts to a variety of input data characteristics by using weighted similarity measures. Geometric and semantic attributes are considered as well as the dataset topology to enhance similarity computations with relational measures. We use a greedy approach and optimizations to keep the number of match candidates minimal all the time. This allows very low runtimes while giving high quality matching results. Supported by a partitioning framework and parallel processing, it also guarantees scalability to large datasets.

KW - Data Matching

KW - Road Networks

KW - Scalability

KW - Similarity

KW - Spatial Data Integration

KW - Spatial Databases

UR - http://www.scopus.com/inward/record.url?scp=84928024782&partnerID=8YFLogxK

U2 - 10.1145/2694859.2694866

DO - 10.1145/2694859.2694866

M3 - Conference contribution

AN - SCOPUS:84928024782

BT - SIGSPATIAL PhD '14

A2 - Demiryurek, Ugur

A2 - Sarwat, Mohamed

Y2 - 4 November 2014 through 7 November 2014

ER -