Details
Originalsprache | Englisch |
---|---|
Titel des Sammelwerks | The Past Web |
Untertitel | Exploring Web Archives |
Erscheinungsort | Cham |
Herausgeber (Verlag) | Springer International Publishing AG |
Seiten | 57-67 |
Seitenumfang | 11 |
ISBN (elektronisch) | 9783030632915 |
ISBN (Print) | 9783030632908 |
Publikationsstatus | Veröffentlicht - 1 Juli 2021 |
Abstract
Web archives are an essential information source for research on historical events. However, the large scale and heterogeneity of web archives make it difficult for researchers to access relevant event-specific materials. In this chapter, we discuss methods for creating event-centric collections from large-scale web archives. These methods are manifold and may require manual curation, adopt search or deploy focused crawling. In this chapter, we focus on the crawl-based methods that identify relevant documents in and across web archives and include link networks as context in the resulting collections.
ASJC Scopus Sachgebiete
- Informatik (insg.)
- Allgemeine Computerwissenschaft
- Geisteswissenschaftliche Fächer (insg.)
- Allgemeine Kunst und Geisteswissenschaften
- Sozialwissenschaften (insg.)
- Allgemeine Sozialwissenschaften
Zitieren
- Standard
- Harvard
- Apa
- Vancouver
- BibTex
- RIS
The Past Web: Exploring Web Archives. Cham: Springer International Publishing AG, 2021. S. 57-67.
Publikation: Beitrag in Buch/Bericht/Sammelwerk/Konferenzband › Beitrag in Buch/Sammelwerk › Forschung › Peer-Review
}
TY - CHAP
T1 - Creating Event-Centric Collections from Web Archives
AU - Demidova, Elena
AU - Risse, Thomas
PY - 2021/7/1
Y1 - 2021/7/1
N2 - Web archives are an essential information source for research on historical events. However, the large scale and heterogeneity of web archives make it difficult for researchers to access relevant event-specific materials. In this chapter, we discuss methods for creating event-centric collections from large-scale web archives. These methods are manifold and may require manual curation, adopt search or deploy focused crawling. In this chapter, we focus on the crawl-based methods that identify relevant documents in and across web archives and include link networks as context in the resulting collections.
AB - Web archives are an essential information source for research on historical events. However, the large scale and heterogeneity of web archives make it difficult for researchers to access relevant event-specific materials. In this chapter, we discuss methods for creating event-centric collections from large-scale web archives. These methods are manifold and may require manual curation, adopt search or deploy focused crawling. In this chapter, we focus on the crawl-based methods that identify relevant documents in and across web archives and include link networks as context in the resulting collections.
UR - http://www.scopus.com/inward/record.url?scp=85150075966&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-63291-5_6
DO - 10.1007/978-3-030-63291-5_6
M3 - Contribution to book/anthology
AN - SCOPUS:85150075966
SN - 9783030632908
SP - 57
EP - 67
BT - The Past Web
PB - Springer International Publishing AG
CY - Cham
ER -