Details
Originalsprache | Englisch |
---|---|
Titel des Sammelwerks | 16th International World Wide Web Conference, WWW2007 |
Herausgeber (Verlag) | Association for Computing Machinery (ACM) |
Seiten | 1297-1298 |
Seitenumfang | 2 |
ISBN (Print) | 1595936548, 9781595936547 |
Publikationsstatus | Veröffentlicht - 8 Mai 2007 |
Veranstaltung | 16th International World Wide Web Conference, WWW2007 - Banff, AB, Kanada Dauer: 8 Mai 2007 → 12 Mai 2007 |
Publikationsreihe
Name | 16th International World Wide Web Conference, WWW2007 |
---|
Abstract
Mirroring Web sites is a well-known technique commonly used in the Web community. A mirror site should be updated frequently to ensure that it reflects the content of the original site. Existing mirroring tools apply page-level strategies to check each page of a site, which is inefficient and expensive. In this paper, we propose a novel site-level mirror maintenance strategy. Our approach studies the evolution of Web directorystructures and mines association rules between ancestor-descendant Web directories. Discovered rules indicate the evolution correlations between Web directories. Thus, when maintaining the mirror of a Web site (directory), we can optimally skipsubdirectories which are negatively correlated with it in undergoing significant changes. The preliminary experimental results show that our approach improves the efficiency of the mirror maintenance process significantly while sacrificing slightly in keeping the "freshness" of the mirrors.
ASJC Scopus Sachgebiete
- Informatik (insg.)
- Computernetzwerke und -kommunikation
- Informatik (insg.)
- Software
Zitieren
- Standard
- Harvard
- Apa
- Vancouver
- BibTex
- RIS
16th International World Wide Web Conference, WWW2007. Association for Computing Machinery (ACM), 2007. S. 1297-1298 (16th International World Wide Web Conference, WWW2007).
Publikation: Beitrag in Buch/Bericht/Sammelwerk/Konferenzband › Aufsatz in Konferenzband › Forschung › Peer-Review
}
TY - GEN
T1 - Mirror site maintenance based on evolution associations of web directories
AU - Chen, Ling
AU - Bhowmick, Sourav
AU - Nejdl, Wolfgang
PY - 2007/5/8
Y1 - 2007/5/8
N2 - Mirroring Web sites is a well-known technique commonly used in the Web community. A mirror site should be updated frequently to ensure that it reflects the content of the original site. Existing mirroring tools apply page-level strategies to check each page of a site, which is inefficient and expensive. In this paper, we propose a novel site-level mirror maintenance strategy. Our approach studies the evolution of Web directorystructures and mines association rules between ancestor-descendant Web directories. Discovered rules indicate the evolution correlations between Web directories. Thus, when maintaining the mirror of a Web site (directory), we can optimally skipsubdirectories which are negatively correlated with it in undergoing significant changes. The preliminary experimental results show that our approach improves the efficiency of the mirror maintenance process significantly while sacrificing slightly in keeping the "freshness" of the mirrors.
AB - Mirroring Web sites is a well-known technique commonly used in the Web community. A mirror site should be updated frequently to ensure that it reflects the content of the original site. Existing mirroring tools apply page-level strategies to check each page of a site, which is inefficient and expensive. In this paper, we propose a novel site-level mirror maintenance strategy. Our approach studies the evolution of Web directorystructures and mines association rules between ancestor-descendant Web directories. Discovered rules indicate the evolution correlations between Web directories. Thus, when maintaining the mirror of a Web site (directory), we can optimally skipsubdirectories which are negatively correlated with it in undergoing significant changes. The preliminary experimental results show that our approach improves the efficiency of the mirror maintenance process significantly while sacrificing slightly in keeping the "freshness" of the mirrors.
KW - Evolution correlation
KW - Mirror maintenance
KW - Web evolution
UR - http://www.scopus.com/inward/record.url?scp=35348821234&partnerID=8YFLogxK
U2 - 10.1145/1242572.1242817
DO - 10.1145/1242572.1242817
M3 - Conference contribution
AN - SCOPUS:35348821234
SN - 1595936548
SN - 9781595936547
T3 - 16th International World Wide Web Conference, WWW2007
SP - 1297
EP - 1298
BT - 16th International World Wide Web Conference, WWW2007
PB - Association for Computing Machinery (ACM)
T2 - 16th International World Wide Web Conference, WWW2007
Y2 - 8 May 2007 through 12 May 2007
ER -