標題: 植基於分群演算法的混合式合作過濾推薦系統
A Hybrid Collaborative Filtering Recommender System Based on Clustering Algorithm
作者: 程閎廉
Cheng, Hung-Lien
關鍵字: Recommender System
Collaborative Filtering
Clustering Techniques
Content-based Filtering
摘要: 合作式過濾推薦方法是目前最受到廣泛利用的過濾推薦技術之一,傳統的合作式過濾推薦系統主要是利用使用者對於曾經使用的項目給予喜好程度的評等矩陣,計算出使用者彼此之間的相似度,利用同好的評等紀錄,進一步給推薦目標使用者可能喜愛或感興趣的項目。但是若僅僅採用評等紀錄作為相似度計算的依據,資訊稍顯然不足。若能進一步考量使用項目的特徵,以及使用者的使用偏好,應該能提高推薦的準確度。 本研究以 Movielens的資料集為例,提出了一個植基於分群演算法的混合式合作過濾推薦系統,首先依電影的類別特徵和使用者給予的評等矩陣分別加以分群,藉此產生同質性較高的電影群。同樣的,使用者也利用偏好觀賞的電影類別及評等矩陣,計算使用者之間的相似度。在推薦的預測上,我們使用了混合式的方法,除了採用傳統合作式過濾推薦方法的預測方式,同時增加了同質電影群的考量。當系統因為資料集的稀疏性造成資訊不足,導致傳統方法無法有效推薦時,本研究方法仍然能維持穩定且較高準確度的推薦效果。
Collaborative recommender is one of the most popular recommendation techniques. Traditional collaborative filtering approach mainly employs a matrix of user's ratings on items to calculate the similarity between users. If the features of users or items are provided in the data set in addition to the rating data, then those features can be used to improve the quality of recommendations. In this thesis, we proposed a hybrid recommender system based on clustering and collaborative filtering techniques. In the proposed system, items are clustered based on item features and user-item rating matrix. Similarly, users are clustered based on the user's preferred categories of items and user-item rating matrix. Then a hybrid method that combines content-based and collaborative filtering is proposed to predict the rating of an item for a given user. The experimental results show that the proposed method has higher accuracy in terms of mean absolute error than that of User-based collaborative filtering approach, Item-based filtering approach, Clustering Items for Collaborative Filtering (CICF), and the User Profile Clustering (UPC) method. Especially, when the dataset is sparse, the accuracy of the proposed method is better and more stable than the other methods.
