§ 瀏覽學位論文書目資料
  
系統識別號 U0002-2807201801544500
DOI 10.6846/TKU.2018.00921
論文名稱(中文) 中文意見探勘系統-關鍵字搜尋優化的研究
論文名稱(英文) Chinese Opinion Mining System-Research on keyword search optimization
第三語言論文名稱
校院名稱 淡江大學
系所名稱(中文) 資訊工程學系碩士在職專班
系所名稱(英文) Department of Computer Science and Information Engineering
外國學位學校名稱
外國學位學院名稱
外國學位研究所名稱
學年度 106
學期 2
出版年 107
研究生(中文) 江珮儀
研究生(英文) Pei-Yi Chiang
學號 703410166
學位類別 碩士
語言別 繁體中文
第二語言別
口試日期 2018-06-25
論文頁數 70頁
口試委員 指導教授 - 蔣璿東
委員 - 葛煥昭
委員 - 王鄭慈
關鍵字(中) 中文意見探勘系統
資料探勘
關鍵字搜尋
關鍵字(英) Chinese Opinion Mining System
Data Mining
Keyword Search
第三語言關鍵字
學科別分類
中文摘要
現在上網發文、爬文,以及查找資料等行為,儼然而成現代人的生活習慣。搜尋結果往往不考慮與使用者需求契合的程度,只要關鍵字曾出現在標題或內文,就會被當作相關資料回覆給使用者,故關鍵字搜尋的結果可能含有無相關的資料。
鑑於一般關鍵字搜尋的方式,其回覆結果可能摻雜不相關的資料,有礙編輯人員的工作效率及產出比,故本研究提出改善並優化關鍵字搜尋的方式,以減少不相關的文章產生,並針對特定主題或領域優化關鍵字的搜尋方式。
英文摘要
Nowadays, we use keywords to perform searches when sending texts, crawling the Net and finding data on the Internet, but the search engine does not consider the degree of matching with the user's needs. In other words, the general keyword search method works in a way that as long as the keyword appears in the titles or texts of articles, they will appear on the screen as relevant data. Therefore, the results of the keyword search may contain no relevant data. As a result, this study proposes a way to improve and optimize keyword search to reduce irrelevant articles and optimize keyword search for specific subjects or domains.
第三語言摘要
論文目次
目錄
第1章 緒論 1
1.1 研究動機與目的 1
1.2 研究架構 6
第2章 文獻探討 7
2.1 中文意見探勘系統相關研究 7
2.2 本研究室開發的中文意見探勘系統簡介 11
第3章 研究方法 15
3.1 問題陳述 15
3.1.1 新聞網站 15
3.1.2 論壇 20
3.2 關鍵字搜尋演算法 22
3.2.1 排除關鍵字類型 22
3.2.2 新聞網站處理方式 26
3.2.3 論壇處理方式 28
第4章 分析實例 31
4.1 系統架構與環境 31
4.2 資料來源 34
4.3 實驗結果 36
4.3.1 關鍵字搜尋優化實例-以新聞為例 36
4.3.2 關鍵字搜尋優化實例-以論壇為例 42
第5章 結論 44
參考文獻	45
附錄 51

圖目錄
圖1 中文意見探勘系統架構圖 12
圖2 文章下載與配置流程圖 13
圖3 中文意見探勘系統文章分析流程圖 14
圖4 中嘉寬頻與其他不相關的新聞 範例3-1 17
圖5 新竹社會處與其他不相關的新聞 範例3-2 18
圖6 資料來源之不相關文章圖例 19
圖7 新增排除句關鍵字介面 24
圖8 排除句關鍵字 設定 25
圖9 新聞網站關鍵字搜尋演算法 26
圖10 論壇關鍵字搜尋演算法 28
圖11 系統架構圖 32
圖12 系統執行流程圖 33
圖13 新聞頻道爬文收集新聞筆數比例圖 36
圖14 2018/04/01至2018/04/07之關鍵字訓練與篩選相關文章筆數關係折線圖	38
圖15 2018/04/08至2018/04/14之關鍵字訓練與篩選相關文章筆數關係折線圖	39
圖16 2018/04/15至2018/04/21之關鍵字訓練與篩選相關文章筆數關係折線圖	39
圖17 2018/04/22至2018/04/28之關鍵字訓練與篩選相關文章筆數關係折線圖	40
圖18 2018/04/29至2018/05/05之關鍵字訓練與篩選相關文章筆數關係折線圖	40
圖19 2018/05/06至2018/05/12之關鍵字訓練與篩選相關文章筆數關係折線圖	41
圖20 2018/05/13至2018/05/19之關鍵字訓練與篩選相關文章筆數關係折線圖	41
圖21 2018/05/20至2018/05/26之關鍵字訓練與篩選相關文章筆數關係折線圖	42
圖22 2018/05/27至2018/06/02之關鍵字訓練與篩選相關文章筆數關係折線圖	42
圖23 不相關文章被收集之案例 43

表目錄
表1 關鍵字與標題對照表 21
參考文獻
[1] 	Z. Li, M. Zhang, S. Ma, B. Zhou, and Y. Sun, "Automatic Extraction for Product Feature Words from Comments on the Web Information Retrieval Technology." vol. 5839, G. Lee, D. Song, C.-Y. Lin, A. Aizawa, K. Kuriyama, M. Yoshioka, et al., Eds., ed: Springer Berlin / Heidelberg, 2009, pp. 112-123.
[2] 	C. Zhang, D. Zeng, J. Li, F.-Y. Wang, and W. Zuo, "Sentiment analysis of Chinese documents: From sentence to document level," J. Am. Soc. Inf. Sci. Technol., vol. 60, pp. 2474-2487, 2009.
[3] 	M. Chen and T. Yao, "Combining dependency parsing with shallow semantic analysis for Chinese opinion-element relation identification," in Universal Communication Symposium (IUCS), 2010 4th International (pp. 299-305), 2010, pp. 299-305.
[4] 	P. Ting-Chun and S. Chia-Chun, "Using Chinese part-of-speech patterns for sentiment phrase identification and opinion extraction in user generated reviews," in Digital Information Management (ICDIM), 2010 Fifth International Conference on (pp. 120-127), 2010, pp. 120-127.
[5] 	V. Hatzivassiloglou and K. R. McKeown, "Predicting the semantic orientation of adjectives," 1997, pp. 174-181.
[6] 	G. Qiu, B. Liu, J. Bu, and C. Chen, "Expanding domain sentiment lexicon through double propagation," presented at the Proceedings of the 21st international jont conference on Artifical intelligence (pp.1199-1204), Pasadena, California, USA, 2009.
[7] 	M. Hu and B. Liu, "Mining and summarizing customer reviews," presented at the Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining (pp. 168-177), Seattle, WA, USA, 2004.
[8] 	S.-M. Kim and E. Hovy, "Determining the sentiment of opinions," presented at the Proceedings of the 20th international conference on Computational Linguistics (pp. 1367), Geneva, Switzerland, 2004.
[9] 	S. Bin and C. Kuiyu, "Mining Chinese Reviews," in Data Mining Workshops, 2006. ICDM Workshops 2006. Sixth IEEE International Conference on (pp. 585-589), 2006, pp. 585-589.
[10] 	Y. Qiang, S. Wen, and L. Yijun, "Sentiment Classification for Movie Reviews in Chinese by Improved Semantic Oriented Approach," in System Sciences, 2006. HICSS '06. Proceedings of the 39th Annual Hawaii International Conference on (pp. 53b-53b), 2006, pp. 53b-53b.
[11] 	L. Zhuang, F. Jing, and X.-Y. Zhu, "Movie review mining and summarization," presented at the Proceedings of the 15th ACM international conference on Information and knowledge management, Arlington, Virginia, USA, 2006.
[12] 	X. Ding, B. Liu, and P. S. Yu, "A holistic lexicon-based approach to opinion mining," presented at the Proceedings of the 2008 International Conference on Web Search and Data Mining (pp. 231-240), Palo Alto, California, USA, 2008.
[13] 	L. W. Ku, I. C. Liu, C. Y. Lee, K. Chen, and H. H. Chen, "Sentence-Level Opinion Analysis by CopeOpi in NTCIR-7," in Proceedings of the 7th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, Question Answering, and Cross-Lingual Information Access (pp. 260-267), Tokyo, Japan, 2008, pp. 260-267.
[14] 	S. Morinaga, K. Yamanishi, K. Tateishi, and T. Fukushima, "Mining product reputations on the Web," presented at the Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining (pp. 341-349), Edmonton, Alberta, Canada, 2002.
[15] 	G. A. Miller. (1980). WordNet. Available: http://wordnet.princeton.edu/
[16] 	P. J. Stone, D. C. Dunphy, and M. S. Smith, "The General Inquirer: A Computer Approach to Content Analysis," 1966.
[17] 	Esuli and F. Sebastiani, "Sentiwordnet: A publicly available lexical resource for opinion mining," 2006, pp. 417-422.
[18] 	Ohana and B. Tierney, "Sentiment classification of reviews using SentiWordNet," in 9th. IT & T Conference, 2009, p. 13.
[19] 	G. Qiu, B. Liu, J. Bu, and C. Chen, "Opinion Word Expansion and Target Extraction through Double Propagation," Computational Linguistics, vol. 37, pp. 9-27, 2011/03/01 2011.
[20] 	L.-W. Ku, H.-W. Ho, and H.-H. Chen, "Opinion mining and relationship discovery using CopeOpi opinion analysis system," Journal of the American Society for Information Science and Technology, vol. 60, pp. 1486-1503, 2009.
[21] 	L. Chien-Liang, H. Wen-Hoar, L. Chia-Hoang, L. Gen-Chi, and E. Jou, "Movie Rating and Review Summarization in Mobile Environment," Systems, Man, and Cybernetics, Part C: Applications and Reviews, IEEE Transactions on, vol. 42, pp. 397-407, 2012.
[22] 	蔡伊玲, "使用上下文關聯性改善中文意見探勘系統的效能," 淡江大學資訊工程學系資訊網路與多媒體碩士班碩士論文, 2017.
[23] 	劉珍惠, "應用於中文意見探勘系統之日報功能設計與實作," 淡江大學資訊工程學系資訊工程學系碩士班碩士論文, 2018.
論文全文使用權限
校內
紙本論文於授權書繳交後5年公開
同意電子論文全文授權校園內公開
校內電子論文於授權書繳交後5年公開
校外
同意授權
校外電子論文於授權書繳交後5年公開

如有問題,歡迎洽詢!
圖書館數位資訊組 (02)2621-5656 轉 2487 或 來信