電子學位論文服務

§ 瀏覽學位論文書目資料

本論文電子全文於2020-07-27起於校外公開使用
本論文紙本於2020-07-27起公開使用

系統識別號	U0002-2507202018510400
DOI	10.6846/TKU.2020.00739
論文名稱(中文)	比較遷移學習與推薦系統於電影資料上之應用
論文名稱(英文)	A Comparison between Transfer Learning and Recommended System with Application in Movie Data
第三語言論文名稱
校院名稱	淡江大學
系所名稱(中文)	大數據分析與商業智慧碩士學位學程
系所名稱(英文)	Master's Program In Big Data Analytics and Business Intelligence
外國學位學校名稱
外國學位學院名稱
外國學位研究所名稱
學年度	108
學期	2
出版年	109
研究生(中文)	王鼎堯
研究生(英文)	Ting-Yao Wang
學號	607890125
學位類別	碩士
語言別	繁體中文
第二語言別
口試日期	2020-07-01
論文頁數	43頁
口試委員	指導教授 - 陳景祥共同指導教授 - 陳怡如委員 - 鄧文舜委員 - 何宗武
關鍵字(中)	iMDb資料庫 Movielens資料庫推薦系統遷移學習交叉驗證
關鍵字(英)	Movielens Database iMDb Database Recommended System Transfer Learning Cross Validation
第三語言關鍵字
學科別分類
中文摘要	本研究主要利用推薦系統演算法之中的基於使用者協同過濾演算法(user-based collaborative filtering)，以及遷移學習演算法(transfer learning)來預測使用者對電影的評分，基於這個評分來決定是否要推薦此電影給使用者。在使用推薦系統演算法常會遇到冷啟動問題(cold start)，導致模型預測及推薦效果不佳，遷移學習演算法的主要目的為解決數據標記不易或資料取得困難。在遷移學習訓練中，本研究加入電影的標籤基因組(tag genome)來當作變數，本研究嘗試利用遷移學習演算法來改善冷啟動的問題，提升推薦系統模型的效能。　　本研究模型為預測一連續型數值變數，因此評估模型表現優異之準則使用均方根誤差(Root Mean Square Error, RMSE)、平均絕對誤差(Mean Absolute Error, MAE)以及對稱平均絕對百分比誤差(Symmetric mean absolute percentage error, sMAPE)。本研究實證結果顯示，利用iMDb的平均評分來做遷移學習預測Movielens的評分，此種模型效果不佳，因此本研究建議先使用協同過濾演算法，如需要遷移學習演算法，須尋找其他資料來作為預訓練資料。
英文摘要	To predict the rating of movies by users, this study applies two techniques, which are User-based Collaborative Filtering and Transfer Learning. Recommendation of movies depends on its rating. Cold start problem usually happens when applying recommended system and it will affect the result of model predictions and recommendations. Nonetheless,Transfer Learning could tackle this problem and it also help to solve the difficulty of data labeling and the difficulty of collect data. For the training of transfer learning, this paper uses “tag genome” as variables. This study tries to use transfer learning to improve the problems with cold start so that it can upgrade the efficiency of transfer learning. 　　We use Root Mean Square Error, Mean Absolute Error, and Symmetric Mean Absolute Percentage Error to evaluate the continuous target variable. The outcome of this study reveals that the performance is poor when we use transfer learning with average rating of iMDb database. Hence, this study recommends applying User-based Collaborative Filtering as priority. If Transfer Learning must be used, it needs to find more data other than target data for better pretraining.
第三語言摘要
論文目次	目錄目錄 I 表目錄 III 圖目錄 IV 第一章緒論 1 1.1　研究背景 1 1.2　研究動機與目的 2 1.3　研究流程 3 第二章文獻回顧 4 2.1　推薦系統(RECOMMENDER SYSTEM) 4 2.1.1　基於人口使用的推薦演算法 4 2.1.2　基於內容的推薦演算法 4 2.1.3　協同過濾推薦演算法 5 2.1.4　混和推薦 6 2.2　推薦系統常見問題 7 2.2.1　資料稀疏性： 7 2.2.2　可擴展性： 7 2.2.3　冷啟動： 7 2.3　遷移學習(TRANSFER LEARNING) 8 2.4　交叉驗證(CROSS-VALIDATION) 9 2.4.1　截留確認法： 9 2.4.2　K折交叉驗證法： 10 2.4.3　留一驗證法： 10 2.5　標籤基因(TAG-GENOME) 10 2.6　對稱平均絕對百分比誤差(SMAPE) 11 第三章研究方法 12 3.1　資料預處理 12 3.2　推薦系統演算法 16 3.3　遷移學習應用於推薦系統 18 3.4　模型評估指標 22 3.4.1　均方根誤差(RMSE) 22 3.4.2　平均絕對誤差(MAE) 22 3.4.3　對稱平均絕對百分比誤差(sMAPE) 22 3.5　交叉驗證 23 第四章實證結果與分析 25 4.1　分析環境 25 4.2　資料敘述 26 4.3　實驗結果 30 4.3.1　基於使用者的協同過濾推薦系統模型 30 4.3.2　遷移學習推薦系統模型 32 4.3.3　模型比較 38 第五章結論 39 5.1　研究結果 39 5.2　研究限制 40 5.3　建議和未來展望 40 參考文獻 41 表目錄表1　使用者及商品矩陣 6 表2　電影資料集 13 表3　TOY STORY電影類型 13 表4　電影基因標籤 14 表5　電影基因分數 14 表6　整理過後的電影基因資料集 14 表7　預訓練資料 15 表8　目標訓練資料 15 表9　電影評分 16 表10　電影評分矩陣 16 表11　協同過濾示意圖 17 表12　分析環境 25 表13　電影評分摘要 27 表14　基於使用者協同過濾推薦系統模之評估 30 表15　基於使用者協同過濾推薦系統模型之交叉驗證評估 31 表16　預訓練模型訓練過程迭代次數 33 表17　預訓練模型模型之交叉驗證評估 34 表18　類神經網路參數 35 表19　遷移模型訓練過程迭代次數 36 表20　遷移模型推薦系統模型之交叉驗證評估 37 表21　兩種推薦系統模型之評估指標 38 圖目錄圖1　研究流程圖 3 圖2　KNN中K=3示意圖 18 圖3　類神經網路示意圖 19 圖4　遷移學習示意圖 21 圖5　交叉驗證示意圖 24 圖6　電影類別數量圖 29
參考文獻	參考文獻中文文獻方菲，2014。开放网络学习环境中的推荐系统研究，安徽大学。洪崇富，2008。基於使用者行為之網頁推薦系統，淡江大學資訊工程學系碩士在職專班碩士論文。郭珉辰，2019。資料探勘技術在信用卡不平衡資料上之應用，淡江大學大數據分析與商業智慧碩士學位學程碩士論文。廖瑋婕，2018。串流資料演算法於推薦系統的應用，淡江大學統計學系應用統計學碩士班碩士論文。英文文獻 Adomavicius G., and Tuzhilin A., 2005. Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions, IEEE Transactions on Knowledge and Data Engineering, 17(6), pp. 734-749. Biadsy N., Rokach L., and Shmilovici A., 2013. Transfer learning for content-based recommender systems using tree matching, Availability Reliability and Security in Information Systems and HCI, Berlin, Germany:Springer, pp. 387-399. Goldberg D., Nichols D., Oki BM., and Terry D., 1992. Using collaborative filtering to weave an information tapestry, Communications of the ACM, 35(12), pp. 61-70. Hahsler M., 2011. Recommenderlab: A framework for developing and testing recommendation algorithms. Herlocker J. L., Konstan J. A., and Riedl J., 2000. Explaining cooaborative filtering recommendations, CSCW ’00 Proceedings of the 2000 ACM conference on Computer supported cooperative work, pp. 241-250. Linden G., Smith B., and York J., 2003. Amazon.com recommendations: Item-to-Item collaborative filtering, IEEE Internet Computing, 7(1), pp. 76-80. Meade N., and Armstrong J.S., 1986. Long Range Forecasting: From Crystal Ball to Computer (2nd Edition), The Journal of the Operational Research Society, 37, pp. 533. Pan S., and Yang Q., 2010. A survey on transfer learning, IEEE Transactions on Knowledge and Data Engineering, 22(10), pp. 1345-1359. Perlich C., Dalessandro B., Raeder T., Stitelman O., and Provost F., 2014. Machine learning for targeted display advertising: Transfer learning in action, Machine Learn, 95(1), pp. 103-127. Resnick P., Iacovou N., Suchak M., Bergstrom P., and Riedl J., 1994. GroupLens: an open architecture for collaborative filtering of netnews, Proceedings of the Conference on CSCW '94, pp. 175-186. Schafer J. B., Konstan J., Riedl J., 1999. Recommender Systems in E-Commerce, EC ‘99: Proceedings of the First ACM Conference on Electronic Commerce, Denver, CO, pp. 158-166. Shin H., Roth H., Gao M., Lu L., Xu Z., Nogues I., Yao J., Mollura D., and Summers R., 2016. Deep convolutional neural networks for computer-aided detection: CNN architectures, dataset characteristics and transfer learning, IEEE Transactions on Medical Imaging, 35(5), pp. 1285-1298. Vig J., Sen S., and Riedl J., 2012. The tag genome: Encoding community knowledge to support novel interaction, ACM Transactions on Interactive Intelligent Systems, 2(3), pp. 13:1-13:44. Voumard A., and Beydoun G., 2019. Transfer learning in credit risk, ECML PKDD, pp. 1-16.
論文全文使用權限	校內：校內紙本論文立即公開同意電子論文全文授權校園內公開校內電子論文立即公開校外：同意授權校外電子論文立即公開

返回頁首

如有問題，歡迎洽詢！
圖書館數位資訊組　(02)2621-5656 轉 2487 或來信