電子學位論文服務

§ 瀏覽學位論文書目資料

本論文電子全文於2007-07-30起於校外公開使用
本論文紙本於2007-07-30起公開使用

系統識別號	U0002-2407200720212300
DOI	10.6846/TKU.2007.00745
論文名稱(中文)	在STPN網頁架構模型建立與應用網頁衡量基準
論文名稱(英文)	Construct and Apply the Web Metrics for STPN Web Structure Model
第三語言論文名稱
校院名稱	淡江大學
系所名稱(中文)	資訊工程學系碩士在職專班
系所名稱(英文)	Department of Computer Science and Information Engineering
外國學位學校名稱
外國學位學院名稱
外國學位研究所名稱
學年度	95
學期	2
出版年	96
研究生(中文)	劉宛青
研究生(英文)	Wan-Ching Liu
學號	794190099
學位類別	碩士
語言別	繁體中文
第二語言別
口試日期	2007-06-15
論文頁數	63頁
口試委員	指導教授 - 陳伯榮委員 - 趙景明委員 - 徐郁輝
關鍵字(中)	網頁使用者習性探勘隨機過程時間派翠網路網頁結構特性衡量網站的基準
關鍵字(英)	web usage mining Stochastic Timed Petri Nets Web graph properties Web metrics
第三語言關鍵字
學科別分類
中文摘要	網頁探勘(Web Mining)是資料探勘(Data Mining)中的一個領域，他將全球資訊網中相關原始資料進一步整理並運用資料探勘的方法，以得到有用的資訊。運用隨機過程時間派翠網路來建構STPN網頁架構模型可以強化網頁使用者習性探勘。本篇論文則應用網頁結構特性（Web graph properties）中的向心度（Centrality）、整體衡量基準（Global Metrics）及局部衡量基準（Local Metrics）以及網頁相似性（Web page similarity）中的使用習性相似性（Usage-Based Similarity）來作為衡量網站的基準（Web metrics），我們在STPN網頁架構模型中加入調整網頁結構的子系統來分析網頁結構特性，提供網頁管理者是否要調整網頁結構的依據，以便增進網頁使用者擷取資訊。面對網頁結構經常修改的問題，我們也探討如何透過漸進的方式來調整網頁結構。
英文摘要	Web Mining is a domain of Data Mining. It is a method to get some helpful information by processing the World Wide Web source data and using the data mining methods. Using Stochastic Timed Petri Nets to construct the web structure model can enhance web usage mining. In this paper, we apply three web graph properties: Centrality, Global Metrics and Local Metrics, and Usage-Based Similarity in Web page similarity to be the web metrics. In STPN web structure model, we add the subsystem that adjust the web structure to analyze the web graph properties. Whether the web administrator adjust the web structure, we provide some helpful information to web administrator for improving web information access. Face the problem of often modifying the web structure, we also discuss how adjusting the web structure by using progressive method.
第三語言摘要
論文目次	目錄Ⅰ 圖目錄Ⅲ 表目錄Ⅳ 第一章緒論1 1.1研究背景與動機1 1.2相關研究1 1.3研究目標2 第二章背景知識3 2.1派翠網路定義3 2.2隨機過程時間派翠網路定義6 2.3使用STPN建構網頁結構模型7 第三章在STPN架構下加強網頁結構特性之分析12 3.1網頁結構衡量基準 12 3.2向心度 13 3.3整體衡量基準 17 3.4局部衡量基準 20 第四章在STPN架構下處理網頁經常修改問題23 4.1以漸進的方法來調整網頁結構23 4.2連結的新增24 4.3網頁的新增26 4.4連結的刪除28 4.5網頁的刪除31 第五章在STPN架構下分析網頁使用記錄34 5.1藉由分析網頁使用記錄建立索引網頁34 5.2案例說明 36 第六章結論與未來研究方向43 參考文獻 45 附錄一計算由A網頁出發至各點之間路徑長度為1~8之總和機率程式碼列表49 附錄二英文論文51 圖目錄圖一 PN Place的表示圖 3 圖二 PN Transition的表示圖3 圖三網頁結構 11 圖四調整後的交叉連結樹狀結構 17 圖五調整後的交叉連結樹狀結構及深度向量、子節點向量22 圖六網站架構圖37 圖七衡量網站基準子系統 44 表目錄表一網站的主要內容8 表二位置名稱與網頁名稱對應表9 表三轉移動作名稱與網頁標籤名稱對應表 10 表四關聯矩陣[Aij]7x8 10 表五代表網頁結構的[Cij]7x7初值15 表六改變後距離矩陣[Cij]7x7及COD、CID、ROC、RIC 15 表七加入g->e後的關聯矩陣[Aij]7x916 表八調整後的改變後距離矩陣[Cij]7x7及COD、CID、ROC、RIC 16 表九調整後的距離矩陣[Dij]7x7及status、contrastatus、absolute prestige19 表十加入g->e之轉移動作名稱與網頁標籤名稱對應表 25 表十一加入g->e後的關聯矩陣[Aij]7x9 25 表十二增加網頁h之位置名稱與網頁名稱對應表26 表十三加入h->e之轉移動作名稱與網頁標籤名稱對應表 27 表十四加入網頁h及h->e後的關聯矩陣 27 表十五刪除e->a後的轉移動作名稱與網頁標籤名稱對應表 30 表十六刪除e->a後的關聯矩陣 30 表十七刪除網頁位置d後的位置名稱與網頁名稱對應表32 表十八刪除e->d及d->a後的轉移動作名稱與網頁標籤名稱對應33 表十九刪除e->d、d->a及網頁d後的關聯矩陣 33 表二十經過前置處理的網頁使用者記錄片斷36 表二十一連結次數矩陣38 表二十二連結機率矩陣38 表二十三任二個網頁間可能路徑之總和機率矩陣40 表二十四經過處理之各網頁間同時引用機率矩陣41 表二十五取完門檻值後之相似度矩陣41
參考文獻	[1] Jaideep Srivastava, Robert Cooley, Mukund Deshpande, and Pan-Ning Tan, “Web Usage Mining: Discovery and Applications of Usage Patterns from Web Data”, SIGKDD Explorations, Vol.1, Issue 2, pp12-23, Jan. 2000. [2] Federico Michele Facca and Pier Luca Lanzi “Recent Development in Web Usage Mining”, Lecture Notes in Computer Science 2727, pp.140-150, 2003. [3] Robert Cooley “The Use of Web Structure and Content to Identify Subjectively Interesting Web Usage Patterns”, ACM Transactions on Internet Technoloey, Vol.3, No.2, ppP.93-116, May 2003. [4] A. Buchner, M. Mulvenna, “Discovering Internet Marketing Intelligence through Online Analytical Web Usage Mining”, SIGMOD Record, Vol.27, No.4, pp.54-61, Dec.1998. [5] Robert Cooley, Pang-Ning Tan, Jaideep Srivastava, ”Discovery of Interesting Usage Patterns from Web Data”, Lecture Notes in Computer Science, 2000. [6] Peter Pirolli, James Pitkow, Ramana Rao , “Silk from a Sow’s Ear:Extracting Usable Structures from the Web”, Conference on Human Factors in Computing Systems, CHI-96, 1996. [7] Myra Spiliopoulou, Carsten Pohle, Lukas C. Faulstich, “Improving the effectiveness of a web site with web usage mining”, WEBKDD, 1999. [8] Jeffrey Heer, Ed H. Chi, “Identification of Web User Traffic Composition using Multi-Modal Clustering and Information “, In Proceedings of the 1st SIAM International Conference on Data Mining Workshop on Web Mining, pp.51-58, 2001. [9] Ellen Spertus, “Parasite: Mining structural information on the Web”, Computer Networks and ISDN Systems: The International Journal of Computer and Telecommunications Networking, pp.1205-1215, April 1997. [10] David Gibson, Jom Kleinberg, and Parabhakar Raschid, ” Inferring web communities from link topology” In Proceedings of the Conference on Hypertext and Hypermedia, pp.225-234, 1998. [11] Marko Balabanovic and Yoav Shoham , “Learning information retrieval agents : Experiments with automated web browsing” In Proceedings of the AAAI Spring Symposium on Information Gathering from Heterogenous, Distributed Environments, 1995. [12] Mark Craven, Dan DiPasquo, Dayne Freitag, Andrew McCallum, Tom Mitchell, Kamal Nigam, and Se an Slattery “Learning to extract symbolic knowledge from the world wide web” In Proceedings of AAAI-98, 15th Conference of the American Association for Artificial Intelligence, pp.509-516, 1998. [13] Jerome Moore, Eui-Hong Han, Daniel Boley, Maria Gini, Robert Gross, Kyle Hastings, George Karypis, Vipin Kumar, and Bamshad Mobashe, “Web page categorization and feature selection using association rule and principal component clustering”, In 7th Workshop on Information Technologies and Systems, Dec. 1997. [14] Raymond Kosala and Hendrik Blockeel, “Web mining research: A survey”, SIGKDD Explorations, Vol.2, Issue 1, pp1-15, July 2000. [15] W. Reisig, “Correctness Proofs of Distributed Algorithms”, Lecture Notes in Computer Science, Vol. 938 : Theory and Practice in Distributed Systems, pp. 164-177, 1995. [16] Devanshu Dhyani, Wee Keong Ng, and Sourav S. Bhowmick, “A Survey of Web Metrics”, ACM Computing Surveys, Vol. 34, No. 4, pp. 469-503, December 2002 [17] Tadao Murata, “Petri Nets: Properties, Analysis and Applications”, Proceedings of the IEEE, Vol. 77, No. 4, 1989. [18] 陳伯榮、楊士央、何仁中，”應用隨機過程時間派翠網路來強化網頁使用者習性探勘” ，二００四數位生活與網際網路科技研討會。Session7C-3，June 24-26，2004，NSC 92-2213-E-032-024. [19] 陳伯榮、楊士央、季振忠、陳清祥、孫初豪，”應用一般隨機過程派翠網路來協助網頁使用者習性探勘中的前置處理”，二００五數位生活與網際網路科技研討會。Session9D-1，June 2-3，2005，NSC 93-2213-E-032-016. [20] Botafogo R., Rivlin E., and Shneiderman B., “Structural analysis of hypertexts: Identifying hierarchies and useful metrics.”, ACM Transaction on Information System, 10, Apr., pp.142-180, 1992. [21] Peterkowitz M., and Etzioni O., “Adaptive web sites: Automatically synthesizing web pages.” In Proceedings of the 15th National Conference on Artificial Intelligence, 1998. [22] Peterkowitz M., and Etzioni O., “Towards adaptive web sites: Conceptual framework and case study.” In Proceedings of the 8th World Wide Web Conference, 1999.
論文全文使用權限	校內：校內紙本論文立即公開同意電子論文全文授權校園內公開校內電子論文立即公開校外：同意授權校外電子論文立即公開

返回頁首

如有問題，歡迎洽詢！
圖書館數位資訊組　(02)2621-5656 轉 2487 或來信