Details
Originalsprache | Englisch |
---|---|
Seiten (von - bis) | 2294-2312 |
Seitenumfang | 19 |
Fachzeitschrift | Journal of the American Society for Information Science and Technology |
Jahrgang | 63 |
Ausgabenummer | 11 |
Publikationsstatus | Veröffentlicht - 16 Okt. 2012 |
Abstract
Search engines are essential tools for web users today. They rely on a large number of features to compute the rank of search results for each given query. The estimated reputation of pages is among the effective features available for search engine designers, probably being adopted by most current commercial search engines. Page reputation is estimated by analyzing the linkage relationships between pages. This information is used by link analysis algorithms as a query-independent feature, to be taken into account when computing the rank of the results. Unfortunately, several types of links found on the web may damage the estimated page reputation and thus cause a negative effect on the quality of search results. This work studies alternatives to reduce the negative impact of such noisy links. More specifically, the authors propose and evaluate new methods that deal with noisy links, considering scenarios where the reputation of pages is computed using the PageRank algorithm. They show, through experiments with real web content, that their methods achieve significant improvements when compared to previous solutions proposed in the literature.
ASJC Scopus Sachgebiete
- Informatik (insg.)
- Software
- Informatik (insg.)
- Information systems
- Informatik (insg.)
- Mensch-Maschine-Interaktion
- Informatik (insg.)
- Computernetzwerke und -kommunikation
- Informatik (insg.)
- Artificial intelligence
Zitieren
- Standard
- Harvard
- Apa
- Vancouver
- BibTex
- RIS
in: Journal of the American Society for Information Science and Technology, Jahrgang 63, Nr. 11, 16.10.2012, S. 2294-2312.
Publikation: Beitrag in Fachzeitschrift › Artikel › Forschung › Peer-Review
}
TY - JOUR
T1 - Using Site-Level Connections to Estimate Link Confidence
AU - Souza, Jucimar
AU - Carvalho, André
AU - Cristo, Marco
AU - Moura, Edleno
AU - Calado, Pavel
AU - Chirita, Paul Alexandru
AU - Nejdl, Wolfgang
PY - 2012/10/16
Y1 - 2012/10/16
N2 - Search engines are essential tools for web users today. They rely on a large number of features to compute the rank of search results for each given query. The estimated reputation of pages is among the effective features available for search engine designers, probably being adopted by most current commercial search engines. Page reputation is estimated by analyzing the linkage relationships between pages. This information is used by link analysis algorithms as a query-independent feature, to be taken into account when computing the rank of the results. Unfortunately, several types of links found on the web may damage the estimated page reputation and thus cause a negative effect on the quality of search results. This work studies alternatives to reduce the negative impact of such noisy links. More specifically, the authors propose and evaluate new methods that deal with noisy links, considering scenarios where the reputation of pages is computed using the PageRank algorithm. They show, through experiments with real web content, that their methods achieve significant improvements when compared to previous solutions proposed in the literature.
AB - Search engines are essential tools for web users today. They rely on a large number of features to compute the rank of search results for each given query. The estimated reputation of pages is among the effective features available for search engine designers, probably being adopted by most current commercial search engines. Page reputation is estimated by analyzing the linkage relationships between pages. This information is used by link analysis algorithms as a query-independent feature, to be taken into account when computing the rank of the results. Unfortunately, several types of links found on the web may damage the estimated page reputation and thus cause a negative effect on the quality of search results. This work studies alternatives to reduce the negative impact of such noisy links. More specifically, the authors propose and evaluate new methods that deal with noisy links, considering scenarios where the reputation of pages is computed using the PageRank algorithm. They show, through experiments with real web content, that their methods achieve significant improvements when compared to previous solutions proposed in the literature.
KW - information retrieval software
KW - information storage and retrieval systems
KW - search engines
UR - http://www.scopus.com/inward/record.url?scp=84868203478&partnerID=8YFLogxK
U2 - 10.1002/asi.22729
DO - 10.1002/asi.22729
M3 - Article
AN - SCOPUS:84868203478
VL - 63
SP - 2294
EP - 2312
JO - Journal of the American Society for Information Science and Technology
JF - Journal of the American Society for Information Science and Technology
SN - 1532-2882
IS - 11
ER -