§ 瀏覽學位論文書目資料
  
系統識別號 U0002-2807201815010300
DOI 10.6846/TKU.2018.00922
論文名稱(中文) 使用關聯式分類器於中文意見探勘系統之應用
論文名稱(英文) Application of Chinese Opinion exploration system with Associative Classification
第三語言論文名稱
校院名稱 淡江大學
系所名稱(中文) 資訊工程學系碩士在職專班
系所名稱(英文) Department of Computer Science and Information Engineering
外國學位學校名稱
外國學位學院名稱
外國學位研究所名稱
學年度 106
學期 2
出版年 107
研究生(中文) 劉鴻郁
研究生(英文) Hong-Yu Liu
學號 705410172
學位類別 碩士
語言別 繁體中文
第二語言別 英文
口試日期 2018-06-25
論文頁數 53頁
口試委員 指導教授 - 蔣璿東
委員 - 葛煥昭
委員 - 王鄭慈
關鍵字(中) 關聯規則
關聯式分類器
資料探勘
關鍵字(英) Association Rules
Associative Classification
Data mining
第三語言關鍵字
學科別分類
中文摘要
相關研究指出,爭取一個新客戶的成本,是維繫一個舊客戶成本的五倍,而網友常會在網路上表達更換或勸人更換原業者的文章。所以本研究將以網際網路服務供應商(Internet Service Provider,ISP)為研究對象,冀望能盡快找出這些文章,爭取時效來處理以減少顧客流失。所以本研究將利用中文意見探勘系統找出可能在表達更換或勸人更換原業者的完整句(opinion sentences),再使用AC分類器對這些完整句做進一步的分類。但由於原意見探勘系統是針對口碑分析所設計,無法準確表達這類完整句,導致分類效能不佳。本研究在不影響原意見探勘系統功能的前提下,(1)修改系統原完整句表達方式:將含動詞的完整句由四個意見元素的組合改成由七個意見元素的組合而成;(2) 新增回推功能以填補某些完整句所缺的面相資訊。經實驗證明,強化完整句表達的精準度確實可以大幅改善分類的結果。
英文摘要
Studies show that gaining one customer costs five times as much as keeping one customer, and web users often use the Internet to express a desire to change providers or encourage others to switch.  This study uses Internet Service Providers (ISPs) as the study target, hoping to find these writings as fast as possible in an effort to reduce the loss of custom-ers.  This study will make use of Chnese-language opinion mining systems to search for complete sen-tences (opinion sentences) that might be expressing a desire to change providers or encouraging others to switch.  It will also use AC classifiers to perform further classification on these sentences.  But be-cause the opinion mining systems were designed for analyzing word of mouth reputations, current com-plete sentences cannot accurately this type of textual opinions, resulting in poor classification.  On the premise of not affecting the opinion mining system, this study will: 
1.	.revise the system's sentence conveyance method, changing from four opinion ele-ments to seven opinion elements for sen-tences containing verbs; 
2.	.introduce a pushback function to fill in exterior information that is lacking in some sentences.  
As evidenced by experiments, strengthening  the accuracy of complete sentence expression could significantly improve classification results.
第三語言摘要
論文目次
第一章	緒論	1
1-1 研究動機與目的	1
1-2 論文架構	3
第二章	文獻探討	5
2-1 中文意見探勘系統相關研究	5
2-2 關聯規則 (Associative Rule)	10
2-3 關聯式分類 (Associative Classification)	12
第三章	研究方法	15
3-1 問題陳述	15
3-2 研究流程圖	18
3-2-1 新完整句表達方式的介紹	20
3-2-2 回推功能的介紹	21
3-2-3 人工標記分類	22
3-2-4 資料預處理	24
3-2-5 AC分類器的建立	26
第四章	實驗結果	27
4-1 資料來源	27
4-2 新完整句表達方式對AC分類器的影響分析	30
4-3 推功能對關聯分類器的影響分析	33
第五章	結論與未來展望	37
參考資料	38


 
圖目錄
圖 1支持度及信賴度定義	12
圖 2一般AC分類器規則排序方法	14
圖 3研究流程圖	18

 
表目錄
表 1意見元素定義表	10
表 2新完整句的表達方式	20
表 3回推後資料範例	22
表 4分類類別為Yes的資料	23
表 5衍生性屬性定義表	24
表 6實驗資料欄位	25
表 7頻道列表	28
表 8完整句統計	29
表 9篇數統計	30
表10 a原完整句表達方式-完整句統計	32
表10 b新完整句表達方式-完整句統計	32
表11 a原完整句表達方式-篇數統計	32
表11 b新完整句表達方式-篇數統計	32
表12 a回推前RC-AC分類器-完整句統計	34
表12 b回推後RC-AC分類器-完整句統計	34
表13 a回推前RC-AC分類器-篇數統計	34
表13 b回推後RC-AC分類器-篇數統	34
表14 1規則差異	35
表15  a回推前YN-AC分類器-完整句統計	36
表15  b回推後YN-AC分類器-完整句統計	36
表16  a回推前YN-AC分類器-篇數統計	36
表16  b回推後YN-AC分類器-篇數統計	36
參考文獻
1.	Z. Li, M. Zhang, S. Ma, B. Zhou, and Y. Sun, "Automatic Extraction for Product Feature Words from Comments on the Web Information Retrieval Technology." vol. 5839, G. Lee, D. Song, C.-Y. Lin, A. Aizawa, K. Kuriyama, M. Yoshioka, et al., Eds., ed: Springer Berlin / Heidelberg, 2009, pp. 112-123.
2.	C. Zhang, D. Zeng, J. Li, F.-Y. Wang, and W. Zuo, "Sentiment analysis of Chinese documents: From sentence to document level," J. Am. Soc. Inf. Sci. Technol., vol. 60, pp. 2474-2487, 2009.
3.	P. Ting-Chun and S. Chia-Chun, "Using Chinese part-of-speech patterns for sentiment phrase identification and opinion extraction in user generated reviews," in Digital Information Management (ICDIM), 2010 Fifth International Conference on (pp. 120-127), 2010, pp. 120-127.
4.	G. Qiu, B. Liu, J. Bu, and C. Chen, "Expanding domain sentiment lexicon through double propagation," presented at the Proceedings of the 21st international jont conference on Artifical intelligence (pp.1199-1204), Pasadena, California, USA, 2009.
5.	M. Hu and B. Liu, "Mining and summarizing customer reviews," presented at the Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining (pp. 168-177), Seattle, WA, USA, 2004.
6.	S.-M. Kim and E. Hovy, "Determining the sentiment of opinions," presented at the Proceedings of the 20th international conference on Computational Linguistics (pp. 1367), Geneva, Switzerland, 2004.
7.	] S. Bin and C. Kuiyu, "Mining Chinese Reviews," in Data Mining Workshops, 2006. ICDM Workshops 2006. Sixth IEEE International Conference on (pp. 585-589), 2006, pp. 585-589.
8.	Y. Qiang, S. Wen, and L. Yijun, "Sentiment Classification for Movie Reviews in Chinese by Improved Semantic Oriented Approach," in System Sciences, 2006. HICSS '06. Proceedings of the 39th Annual Hawaii International Conference on (pp. 53b-53b), 2006, pp. 53b-53b.
9.	L. Zhuang, F. Jing, and X.-Y. Zhu, "Movie review mining and summarization," presented at the Proceedings of the 15th ACM international conference on Information and knowledge management, Arlington, Virginia, USA, 2006.
10.	X. Ding, B. Liu, and P. S. Yu, "A holistic lexicon-based approach to opinion mining," presented at the Proceedings of the 2008 International Conference on Web Search and Data Mining (pp. 231-240), Palo Alto, California, USA, 2008.
11.	L. W. Ku, I. C. Liu, C. Y. Lee, K. Chen, and H. H. Chen, "Sentence-Level Opinion Analysis by CopeOpi in NTCIR-7," in Proceedings of the 7th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, Question Answering, and Cross-Lingual Information Access (pp. 260-267), Tokyo, Japan, 2008, pp. 260-267.
12.	S. Morinaga, K. Yamanishi, K. Tateishi, and T. Fukushima, "Mining product reputations on the Web," presented at the Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining (pp. 341-349), Edmonton, Alberta, Canada, 2002.
13.	G. A. Miller. (1980). WordNet. Available: http://wordnet.princeton.edu/
14.	P. J. Stone, D. C. Dunphy, and M. S. Smith, "The General Inquirer: A Computer
15.	Approach to Content Analysis," 1966.
16.	A. Esuli and F. Sebastiani, "Sentiwordnet: A publicly available lexical resource for opinion mining," 2006, pp. 417-422.
17.	B. Ohana and B. Tierney, "Sentiment classification of reviews using SentiWordNet," in 9th. IT & T Conference, 2009, p. 13.
18.	L.-W. Ku, H.-W. Ho, and H.-H. Chen, "Opinion mining and relationship discovery using CopeOpi opinion analysis system," Journal of the American Society for Information Science and Technology, vol. 60, pp. 1486-1503, 2009.
19.	] L. Chien-Liang, H. Wen-Hoar, L. Chia-Hoang, L. Gen-Chi, and E. Jou, "Movie Rating and Review Summarization in Mobile Environment," Systems, Man, and Cybernetics, Part C: Applications and Reviews, IEEE Transactions on, vol. 42, pp. 397-407, 2012.
20.	G. Qiu, B. Liu, J. Bu, and C. Chen, "Opinion Word Expansion and Target Extraction through Double Propagation," Computational Linguistics, vol. 37, pp. 9-27, 2011/03/01 2011.

21.	D. Peppers, M. Rogers, and S. Sengupta, “The one to one future,” International Business Review, vol. 4, no. 4, pp. 541-543, //, 1995. 
22.	J. Heskett, C. Hart, and W. E. Sasser Jr., Service Breakthroughs: Changing the Rules of the Game. : NY: Free Press, 1990. 
23.	Dr. Rui-Dong Chiang  and Yi-Ling Cai "Improving the Performance of Chinese Opinion-Mining System by Context Dependent "
24.	L. W. Ku, I. C. Liu, C. Y. Lee, K. Chen, and H. H. Chen, "Sentence-Level Opinion Analysis by CopeOpi in NTCIR-7," in Proceedings of the 7th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, Question Answering, and Cross-Lingual Information Access (pp. 260-267), Tokyo, Japan, 2008, pp. 260-267. 
25.	A Esuli and F. Sebastiani, "Sentiwordnet: A publicly available lexical resource for opinion mining," 2006, pp. 417-422. 
26.	R. Agrawal, T. Imielinski, A. Swami, “Mining Association Rules between Sets of Items in Large Databases,” Proceedings of the 1993 ACM SIGMOD Conferencen on Management of Data, Washington D.C., pp. 207-216, May 1993. 
27.	F. Thabtah, “A review of associative classification mining,” The Knowledge Engineering Review, vol. 22, no. 01, pp. 37-65, 2007. 
28.	B. L. W. H. Y. Ma, "Integrating classification and association rule mining." 
29.	W. Li, J. Han, and J. Pei, "CMAR: Accurate and efficient classification based on multiple class-association rules." pp. 369-376.
30.	E. Baralis, and P. Garza, "A lazy approach to pruning classification rules." pp. 35-42. 
31.	K. Wang, Y. He, and D. W. Cheung, "Mining confident rules without support requirement." pp. 89-96. 
32.	K. Wang, S. Zhou, and Y. He, "Growing decision trees on support-less association rules." pp. 265-269. 
33.	C.-H. Chen, R.-D. Chiang, C.-M. Lee, and C.-Y. Chen, “Improving the performance of association 
34.	classifiers by rule prioritization,” Knowledge-Based Systems, vol. 36, pp. 59-67, 2012
35.	MMAC: A New Multi-class, Multi-label Associative Classification Approach 
36.	Jeng-Chi Yang"Associative Classifier with Multi-Ranking"
論文全文使用權限
校內
紙本論文於授權書繳交後5年公開
同意電子論文全文授權校園內公開
校內電子論文於授權書繳交後5年公開
校外
同意授權
校外電子論文於授權書繳交後5年公開

如有問題,歡迎洽詢!
圖書館數位資訊組 (02)2621-5656 轉 2487 或 來信