淡江大學覺生紀念圖書館 (TKU Library)
進階搜尋


下載電子全文限經由淡江IP使用) 
系統識別號 U0002-2406200610465900
中文論文名稱 分散式垃圾郵件過濾系統之設計與實作
英文論文名稱 A Design and Implementation of Distributed SPAM Mail Filter System
校院名稱 淡江大學
系所名稱(中) 資訊管理學系碩士班
系所名稱(英) Department of Information Management
學年度 94
學期 2
出版年 95
研究生中文姓名 廖明禮
研究生英文姓名 Ming-Li Liau
電子信箱 minli@mail.im.tku.edu.tw
學號 691520273
學位類別 碩士
語文別 中文
口試日期 2006-05-20
論文頁數 48頁
口試委員 指導教授-梁德昭
委員-廖賀田
委員-陳彥良
委員-尹邦嚴
中文關鍵字 分散式  垃圾郵件  SPAM  郵件過濾 
英文關鍵字 SPAM  mail filter  junk mail  distributed system 
學科別分類 學科別社會科學管理學
學科別社會科學資訊科學
中文摘要 電子郵件的普及使用,已成為E化不可或缺的溝通管道。然而,藉由網路的低成本及無遠弗屆,卻被有心人士利用來散發數不盡的廣告信。近年來,由於廣告信的遽增,許多學者紛紛提出郵件過濾之研究。在IT產業中,也逐漸發展出不同解決方案的垃圾郵件過濾系統。
在過濾系統中,郵件處理量必然會影響速度。當處理量大時,郵件過濾系統將會成效不彰,並失去其郵件的時效性。在目前各個過濾系統中,針對單一郵件網域的超高負載情況,尚無解決之道。
本研究目的,將以分散式架構來規劃垃圾郵件過濾系統,並取其以往系統之優點,改良缺點而加以實作,解決現有垃圾郵件過濾系統在超高負載時的瓶頸。然而,透由網頁介面,提供使用者回饋機制,並將所誤判之郵件加以修正及學習,減少再次誤判的機會。
英文摘要 To use E-mail is an indispensable for the way of communication in the E-generation. However, someone make capital of it to send countless advertisement letters by the advantage of network that has low cost and endless scope. Due to advertisement letters violently increase, a lot of scholars bring up the investigations into mails of percolation in the several years. People progressively develop various solutions of SPAM mail filter system.

When handle a large number of mails that can effect the speed inevitably. SPAM mail filter system will inefficiency and that can loss the mails timeliness when the system deal with a great quantity of mails. There are no way of solutions aim at an overload situation by single mail domain in every SPAM mail filter system presently.

The purpose of this study, we use distributed architecture to design the spam mail filter system. We take some advantage and ameliorate some drawback from old system to construct IM-SPAM system. According to experimental result, the IM-SPAM system solved the bottleneck in system is overloaded. Although we also provided a web-based interface for administrative and user can do the system configuration and set up their filtering policies and feed back spam mail information. This system will analyse feedback information and mistrial fixed. That will reduce wrong decision again.
論文目次 IV
中文摘要............................................. I
英文摘要............................................ II
目錄............................................... IV
圖目錄..............................................VII
表目錄.............................................. IX
1. 緒論............................................ 1
1.1. 研究背景..................................... 1
1.2. 研究動機..................................... 1
1.3. 研究目的..................................... 2
1.4. 論文架構..................................... 2
2. 相關研究......................................... 4
2.1. 郵件防堵技術分析: ............................ 4
2.1.1. 黑名單分享(RBL) .......................... 4
2.1.2. 貝氏過濾法(Bayesian Filtering) .......... 4
2.1.3. 貝氏過濾搭配啟發式過濾.................... 5
2.1.4. SPF(Sender Policy Framework) .............. 5
2.1.5. DNS 反查................................. 6
3. 系統架構......................................... 7
V
3.1. IM-SPAM 組成元件.............................. 8
3.1.1. DNS System ............................... 8
3.1.2. Database System .......................... 8
3.1.3. Core System.............................. 9
3.1.4. Analysis/Report System.................... 9
3.2. Core System 架構.............................. 9
3.3. Alalysis/Report System ....................... 13
3.4. IM-SPAM 運作模式............................. 13
4. 系統實作........................................ 17
4.1. IM-SPAM Gateway 開發平台與工具................ 18
4.2. IM-SPAM Gateway 規劃圖........................ 19
4.3. Database System .............................. 19
4.4. DNS System................................... 21
4.5. Core System.................................. 24
4.6. Analysis/Report System ....................... 28
4.7. IM-SPAM Gateway 運作流程...................... 32
5. 系統安裝與實測.................................. 35
6. 結論與未來方向.................................. 39
6.1. 結論與貢獻................................... 39
VI
6.2. 未來方向.................................... 39
參考文獻............................................ 40
附錄1............................................... 44

圖目錄
圖3-1. 組成IM-SPAM 的四大系統元件...................... 8
圖3-2. IM-SPAM Gateway 運作模式....................... 16
圖4-1. IM-SPAM 內系統元件及子元件..................... 17
圖 4-2. 分散式IM-SPAM Gateway 架構圖.................. 19
圖 4-3. Master Database Config 檔...................... 20
圖 4-4.Slave Database Config 檔....................... 21
圖 4-5. /etc/named.conf 設定檔中ACL 與VIEW 的設定....... 22
圖 4-6. ./spam/im.tku.edu.tw 設定檔................... 23
圖4-7. ./external/im.tku.edu.tw 設定檔................ 23
圖4-8. 透由ACL 及VIEW 設定後,DNS 運作方式............. 24
圖 4-9. Analysis/Report System 運作流程................ 29
圖4-10. 使用者郵件攔截清單........................... 32
圖4-11 IM-SPAM 整體運作流程........................... 33
圖5-1. IM-SPAM 實測時,CPU 一天的負載圖................ 37
圖 5-2. 經過IM-SPAM 郵件過濾後的Mail Header 資訊....... 37
圖 5-3. 使用者完整郵件清單(包含正常郵件與垃圾郵件) ..... 38

表目錄
表 5-1. 95/5/1~95/5/10 總信件、垃圾郵件及正常郵件數量.. 36
參考文獻 [1] 張世敏(2001), qmail 快速入門,台北:博碩
[2] 陳瑞順、胡駿彥,“以代理人為基礎之客服電子郵件自動回覆
系統”,國立交通大學資訊管理研究所,碩士論文,民國91
[3] Yoshida, K, Adachi, F, Washio, T, Motoda, H, Homma,T,
Nakashima, A, Fujikawa, H & Yamazaki, K
2004,'Density-based spam detector', in Proceedings of
the2004 ACM SIGKDD international conference onKnowledge
discovery and data mining, ACM Press,Seattle, WA, USA, pp.
[4] N. Borenstein, N. Freed, “MIME (Multipurpose Internet
Mail Extensions): Mechanisms for Specifying and
Describing the Format of Internet Message Bodies”, June
[5] G. Lindberg. RFC2505: Anti-Spam Recommendations for SMTP
[6] Lyon, J & Wong, M 2004, Work in Progress, Internet-Draft:
Sender ID: Authenticating E-Mail, InternetEngineering
41
Task Force, viewed May 6 2005,
http://download.microsoft.com/download/6/c/5/6c53077f-
013e-480c-a19d-787850d84861/senderid_spec1.pdf.
[7] Trend Micro RBL+ Service, 2005, Trend MicroIncorporated,
viewed Aug 17 2005,
http://www.trendmicro.com/en/products/nrs/rbl/evaluate
/overview.htm
[8] The FreeBSD Project “FreeBSD handbook”
http://www.freebsd.org/doc/en_US.ISO8859-1/books/handb
ook/
[9] MySQL AB “MySQL Documentation”
http://dev.mysql.com/doc/refman/5.1/en/index.html
[10] The PHP Group “PHP Documentation”
http://www.php.net
[11] Michal Trojnara “Stunnel Documentation”
http://stunnel.mirt.net
[12] kojm, nervoso “Clam AntiVirus Documentation”
http://www.clamav.net/doc/0.88.2/clamdoc.pdf
[13] Theo Van Dinter “SpamAssassin Documentation”
42
http://spamassassin.apache.org/doc.html
[14] Frank Law, iBASE Technologies Limited“Qmail 安裝”
http://www.chinese.qmail.org
[15] Tobias Oetiker “rrdtool”
http://oss.oetiker.ch/rrdtool/doc/rrdtool.en.html
[16] Tobias Oetiker “MRTG Documentation”
http://oss.oetiker.ch/mrtg
[17] Jonathan B. Postel,“SIMPLE MAIL TRANSFER PROTOCOL”
ftp://ftp.rfc-editor.org/in-notes/pdfrfc/rfc821.txt.pd
f.
[18] M. Wong, W. Schlitt,“Sender Policy Framework (SPF) for
Authorizing Use of Domains in E-Mail, Version 1”
http://www.ietf.org/rfc/rfc4408.txt.
[19] The ORDB Group“Open Relay Database - FAQ”
http://www.ordb.org/.
[20] Internet Engineering Steering Group,“How Does SPF
Work?” http://www.openspf.org/howworks.html.
[21] Perl Group , ”The Perl Directory: Online Documentation”
http://www.perl.org/docs.html
[22] Nabeken ,” bsfilter / bayesian spam filter”
http://bsfilter.org/index-e.html
論文使用權限
  • 同意紙本無償授權給館內讀者為學術之目的重製使用,於2006-08-28公開。
  • 同意授權瀏覽/列印電子全文服務,於2006-08-28起公開。


  • 若您有任何疑問,請與我們聯絡!
    圖書館: 請來電 (02)2621-5656 轉 2281 或 來信