Unsupervised strategies for shilling detection and robust collaborative filtering

Bhaskar Mehta; Wolfgang Nejdl

doi:10.1007/s11257-008-9050-4

Details

Originalsprache	Englisch
Seiten (von - bis)	65-97
Seitenumfang	33
Fachzeitschrift	User Modeling and User-Adapted Interaction
Jahrgang	19
Ausgabenummer	1-2 SPEC. ISS.
Frühes Online-Datum	18 Juli 2008
Publikationsstatus	Veröffentlicht - Feb. 2009

Abstract

Collaborative filtering systems are essentially social systems which base their recommendation on the judgment of a large number of people. However, like other social systems, they are also vulnerable to manipulation by malicious social elements. Lies and Propaganda may be spread by a malicious user who may have an interest in promoting an item, or downplaying the popularity of another one. By doing this systematically, with either multiple identities, or by involving more people, malicious user votes and profiles can be injected into a collaborative recommender system. This can significantly affect the robustness of a system or algorithm, as has been studied in previous work. While current detection algorithms are able to use certain characteristics of shilling profiles to detect them, they suffer from low precision, and require a large amount of training data. In this work, we provide an in-depth analysis of shilling profiles and describe new approaches to detect malicious collaborative filtering profiles. In particular, we exploit the similarity structure in shilling user profiles to separate them from normal user profiles using unsupervised dimensionality reduction. We present two detection algorithms; one based on PCA, while the other uses PLSA. Experimental results show a much improved detection precision over existing methods without the usage of additional training time required for supervised approaches. Finally, we present a novel and highly effective robust collaborative filtering algorithm which uses ideas presented in the detection algorithms using principal component analysis.

ASJC Scopus Sachgebiete

Sozialwissenschaften (insg.)
Ausbildung bzw. Denomination
Informatik (insg.)
Mensch-Maschine-Interaktion
Informatik (insg.)
Angewandte Informatik

Zitieren

Unsupervised strategies for shilling detection and robust collaborative filtering. / Mehta, Bhaskar; Nejdl, Wolfgang.
in: User Modeling and User-Adapted Interaction, Jahrgang 19, Nr. 1-2 SPEC. ISS., 02.2009, S. 65-97.

Publikation: Beitrag in Fachzeitschrift › Artikel › Forschung › Peer-Review

Mehta, B & Nejdl, W 2009, 'Unsupervised strategies for shilling detection and robust collaborative filtering', User Modeling and User-Adapted Interaction, Jg. 19, Nr. 1-2 SPEC. ISS., S. 65-97. https://doi.org/10.1007/s11257-008-9050-4

Mehta, B., & Nejdl, W. (2009). Unsupervised strategies for shilling detection and robust collaborative filtering. User Modeling and User-Adapted Interaction, 19(1-2 SPEC. ISS.), 65-97. https://doi.org/10.1007/s11257-008-9050-4

Mehta B, Nejdl W. Unsupervised strategies for shilling detection and robust collaborative filtering. User Modeling and User-Adapted Interaction. 2009 Feb;19(1-2 SPEC. ISS.):65-97. Epub 2008 Jul 18. doi: 10.1007/s11257-008-9050-4

Mehta, Bhaskar ; Nejdl, Wolfgang. / Unsupervised strategies for shilling detection and robust collaborative filtering. in: User Modeling and User-Adapted Interaction. 2009 ; Jahrgang 19, Nr. 1-2 SPEC. ISS. S. 65-97.

Download

@article{22bf3004e1b9421db9d290926ffa6841,

title = "Unsupervised strategies for shilling detection and robust collaborative filtering",

abstract = "Collaborative filtering systems are essentially social systems which base their recommendation on the judgment of a large number of people. However, like other social systems, they are also vulnerable to manipulation by malicious social elements. Lies and Propaganda may be spread by a malicious user who may have an interest in promoting an item, or downplaying the popularity of another one. By doing this systematically, with either multiple identities, or by involving more people, malicious user votes and profiles can be injected into a collaborative recommender system. This can significantly affect the robustness of a system or algorithm, as has been studied in previous work. While current detection algorithms are able to use certain characteristics of shilling profiles to detect them, they suffer from low precision, and require a large amount of training data. In this work, we provide an in-depth analysis of shilling profiles and describe new approaches to detect malicious collaborative filtering profiles. In particular, we exploit the similarity structure in shilling user profiles to separate them from normal user profiles using unsupervised dimensionality reduction. We present two detection algorithms; one based on PCA, while the other uses PLSA. Experimental results show a much improved detection precision over existing methods without the usage of additional training time required for supervised approaches. Finally, we present a novel and highly effective robust collaborative filtering algorithm which uses ideas presented in the detection algorithms using principal component analysis.",

keywords = "Collaborative filtering, Dimensionality reduction, PCA, PLSA, Robust statistics, Shilling",

author = "Bhaskar Mehta and Wolfgang Nejdl",

year = "2009",

month = feb,

doi = "10.1007/s11257-008-9050-4",

language = "English",

volume = "19",

pages = "65--97",

journal = "User Modeling and User-Adapted Interaction",

issn = "0924-1868",

publisher = "Springer Netherlands",

number = "1-2 SPEC. ISS.",

}

Download

TY - JOUR

T1 - Unsupervised strategies for shilling detection and robust collaborative filtering

AU - Mehta, Bhaskar

AU - Nejdl, Wolfgang

PY - 2009/2

Y1 - 2009/2

N2 - Collaborative filtering systems are essentially social systems which base their recommendation on the judgment of a large number of people. However, like other social systems, they are also vulnerable to manipulation by malicious social elements. Lies and Propaganda may be spread by a malicious user who may have an interest in promoting an item, or downplaying the popularity of another one. By doing this systematically, with either multiple identities, or by involving more people, malicious user votes and profiles can be injected into a collaborative recommender system. This can significantly affect the robustness of a system or algorithm, as has been studied in previous work. While current detection algorithms are able to use certain characteristics of shilling profiles to detect them, they suffer from low precision, and require a large amount of training data. In this work, we provide an in-depth analysis of shilling profiles and describe new approaches to detect malicious collaborative filtering profiles. In particular, we exploit the similarity structure in shilling user profiles to separate them from normal user profiles using unsupervised dimensionality reduction. We present two detection algorithms; one based on PCA, while the other uses PLSA. Experimental results show a much improved detection precision over existing methods without the usage of additional training time required for supervised approaches. Finally, we present a novel and highly effective robust collaborative filtering algorithm which uses ideas presented in the detection algorithms using principal component analysis.

AB - Collaborative filtering systems are essentially social systems which base their recommendation on the judgment of a large number of people. However, like other social systems, they are also vulnerable to manipulation by malicious social elements. Lies and Propaganda may be spread by a malicious user who may have an interest in promoting an item, or downplaying the popularity of another one. By doing this systematically, with either multiple identities, or by involving more people, malicious user votes and profiles can be injected into a collaborative recommender system. This can significantly affect the robustness of a system or algorithm, as has been studied in previous work. While current detection algorithms are able to use certain characteristics of shilling profiles to detect them, they suffer from low precision, and require a large amount of training data. In this work, we provide an in-depth analysis of shilling profiles and describe new approaches to detect malicious collaborative filtering profiles. In particular, we exploit the similarity structure in shilling user profiles to separate them from normal user profiles using unsupervised dimensionality reduction. We present two detection algorithms; one based on PCA, while the other uses PLSA. Experimental results show a much improved detection precision over existing methods without the usage of additional training time required for supervised approaches. Finally, we present a novel and highly effective robust collaborative filtering algorithm which uses ideas presented in the detection algorithms using principal component analysis.

KW - Collaborative filtering

KW - Dimensionality reduction

KW - PCA

KW - PLSA

KW - Robust statistics

KW - Shilling

UR - http://www.scopus.com/inward/record.url?scp=58849149412&partnerID=8YFLogxK

U2 - 10.1007/s11257-008-9050-4

DO - 10.1007/s11257-008-9050-4

M3 - Article

AN - SCOPUS:58849149412

VL - 19

SP - 65

EP - 97

JO - User Modeling and User-Adapted Interaction

JF - User Modeling and User-Adapted Interaction

SN - 0924-1868

IS - 1-2 SPEC. ISS.

ER -

Research@Leibniz University

Unsupervised strategies for shilling detection and robust collaborative filtering

Autoren

Organisationseinheiten

Externe Organisationen

Details

Abstract

ASJC Scopus Sachgebiete

Zitieren

Von denselben Autoren

Robust Fusion of Time Series and Image Data for Improved Multimodal Clinical Prediction

Harnessing Empathy and Ethics for Relevance Detection and Information Categorization in Climate and COVID-19 Tweets

Open benchmark for filtering techniques in entity resolution

Beyond Accuracy: Investigating Error Types in GPT-4 Responses to USMLE Questions

An artificial intelligence-assisted clinical framework to facilitate diagnostics and translational discovery in hematologic neoplasia