電子學位論文服務

§ 瀏覽學位論文書目資料

本論文電子全文於2021-08-11起於校外公開使用
本論文紙本於2021-08-11起公開使用

系統識別號	U0002-0808201614214000
DOI	10.6846/TKU.2016.00251
論文名稱(中文)	基於多尺度分類之強健型即時視覺追蹤演算法的設計與實現
論文名稱(英文)	Design and Implementation of a Robust Real-Time Visual Tracking Algorithm Based on Multi-scale Classification
第三語言論文名稱
校院名稱	淡江大學
系所名稱(中文)	電機工程學系碩士班
系所名稱(英文)	Department of Electrical and Computer Engineering
外國學位學校名稱
外國學位學院名稱
外國學位研究所名稱
學年度	104
學期	2
出版年	105
研究生(中文)	馮彥彰
研究生(英文)	Yen-Chang Feng
學號	603470179
學位類別	碩士
語言別	繁體中文
第二語言別
口試日期	2016-07-13
論文頁數	71頁
口試委員	指導教授 - 蔡奇謚(walilile@gmail.com) 委員 - 郭景明委員 - 李世安委員 - 蔡奇謚
關鍵字(中)	強健型視覺追蹤多尺度分類平行加速演算法加速壓縮追蹤演算法
關鍵字(英)	Robust visual tracking multi-scale classification parallel acceleration algorithmic acceleration compressive tracking
第三語言關鍵字
學科別分類
中文摘要	視覺追蹤在目前電腦視覺的領域中為一重要的角色，其能應用於機器人上，幫助機器人達成許多任務。本論文以現有的壓縮追蹤演算法為基礎，提出一個多尺度分類之視覺追蹤方法，其依靠多個特徵分類器，在不同尺度空間條件下進行分類器訓練，使得追蹤效果更加強建。再者，在分類器初始化時，可透過隨機生成不同種類特徵的方式，產生較為強健之特徵分類器，獲得較精準的追蹤結果。為了提升追蹤系統的即時效能，在本論文中也針對所提出的視覺追蹤演算法進行各部份動作的執行時間分析，並針對執行時間較高的部分，進行CPU的平行加速。另外，也針對分類器公式，進行數值討論與公式化簡，以解決增加多尺度分類器後，演算法運算複雜度增加的影響。經由上述平行加速及運算簡化後，所實現的演算法在處理642x352影像時，可以達到每秒約45張影像(45 fps)的追蹤處理速度。而在實驗結果上，與四種現有的視覺追蹤演算法相比，不但在追蹤成功率有明顯提升外，在追蹤精確度上也較為優異。
英文摘要	Visual tracking acts an important role in fields of computer vision as it can apply on robots to accomplish many tasks. In this thesis, a robust visual tracking algorithm is proposed based on the existing compressive tracking method. The proposed algorithm adopts multiple Bayes feature classifiers, each of them was trained under a different scale condition, to realize online multi-scale classification, which can greatly improve the robustness of tracking system. Furthermore, each feature classifier was initialized by randomly generating different types of Haar-like features. By doing so, the robustness of feature classification can be improved to obtain more accurate tracking results. In order to enhance real-time performance of visual tracking system, the processing time of each stage of the proposed algorithm was recorded. Then, the processing speed of the proposed algorithm is accelerated by adopting CPU-based parallel processing on the computationally expensive stages. Also, the formula of the Bayes feature classifier is simplified by numerical analysis to speedup the processing speed of multi-scale feature classification. After accelerating by parallel computation and formula simplification, the proposed visual tracking algorithm reaches about 45 images per second (45 fps) tracking performance in dealing with images of 642x352 pixels. Experimental results show that the proposed algorithm outperforms four state-of-the-art visual tracking methods in terms of success rate, tracking accuracy, and visual comparison.
第三語言摘要
論文目次	目錄中文摘要 I 英文摘要 II 目錄 III 圖目錄 VI 表目錄 VIII 第一章序論 1 1.1研究背景 1 1.2研究動機與目的 8 1.3論文架構 10 第二章相關研究 11 2.1 壓縮追蹤演算法 11 2.1.1 特徵值計算與降維方式 14 2.1.2 分類器的生成與更新 17 2.2 快速壓縮追蹤演算法 19 第三章多尺度分類器追蹤演算法 23 3.1 多尺度樣本 24 3.2 分類器更新 25 3.3 最佳樣本與尺度的取得 29 第四章追蹤演算法加速與實現 31 4.1加速搜尋樣本特徵值計算部分 32 4.2加速搜尋追蹤目標位置部分 33 4.2.1搜尋追蹤目標位置部分平行化 33 4.2.2搜尋追蹤目標位置部分公式減化加速 35 4.3加速樣本搜尋動作 40 第五章實驗結果與分析 43 5.1軟硬體介紹 43 5.2演算法參數設定 44 5.3演算法評估標準 45 5.4追蹤結果 53 5.4.1追蹤影片David 54 5.4.2追蹤影片Shacking 56 5.4.3追蹤影片Sylvester 57 5.4.4追蹤影片Animal 59 5.4.5追蹤影片Tiger 1 61 5.4.6追蹤影片Soccer 62 5.4.7追蹤影片Cliff bar 63 5.4.8追蹤影片Occluded Face與Occluded Face 2 65 第六章結論與未來展望 68 參考文獻 69 圖目錄圖2. 1壓縮追蹤演算法分類器學習階段(Learning Phase)流程圖。 11 圖2. 2 取樣範圍示意圖：(a) 正樣本與負樣本的取樣範圍示意圖，(b) 搜尋樣本的取樣範圍示意圖。 12 圖2. 3 壓縮追蹤演算法追蹤階段(Tracking Phase)流程圖。 13 圖2. 4 特徵值計算與降維過程示意圖。 15 圖2. 5 快速壓縮追蹤演算法分類器學習階段(Learning Phase)流程圖。 20 圖2. 6 快速壓縮追蹤演算法追蹤階段(Tracking Phase)流程圖。 21 圖2. 7 快速壓縮追蹤演算法樣本搜尋示意圖。 22 圖3. 1多尺度追蹤演算法學習階段(Learning Phase)流程圖。 23 圖3. 2多尺度樣本、稀疏測量矩陣與特徵示意圖。 24 圖3. 3 影片David的第1張影像與第2張影像的平均值μ的比較圖：(a)正樣本分類器，(b)負樣本分類器。 26 圖3. 4 影片David的第101張影像與第102張影像的平均值μ的比較圖。：(a)正樣本分類器，(b)負樣本分類器。 27 圖3. 5 影片David的第1張影像與第2張影像的標準差σ的比較圖：(a)正樣本分類器， (b)負樣本分類器。 27 圖3. 6 影片David的第101張影像與第102張影像的標準差σ的比較圖：(a)正樣本分類器，(b)負樣本分類器。 28 圖3. 7多尺度追蹤演算法追蹤階段(Tracking Phase)流程圖。 30 圖4. 1 有無設定極小值之值域圖。 37 圖4. 2 演算法加速後追蹤階段(Tracking Phase)流程圖。 41 圖5. 1 取樣半徑示意圖：(a) 正樣本與負樣本的取樣半徑示意圖，(b) 搜尋樣本的取樣半徑示意圖。 44 圖5. 2 各演算法追蹤結果標記表示示意圖。 53 圖5. 3 Challenging Sequence David 追蹤結果。 54 圖5. 4 Challenging Sequence Shacking追蹤結果。 56 圖5. 5 Challenging Sequence Sylvester追蹤結果。 58 圖5. 6 Challenging Sequence Animal追蹤結果。 59 圖5. 7 Challenging Sequence Tiger 1追蹤結果。 61 圖5. 8 Challenging Sequence Soccer追蹤結果。 62 圖5. 9 Challenging Sequence Cliff bar追蹤結果。 64 圖5. 10 Challenging Sequence Occluded Face追蹤結果。 65 圖5. 11 Challenging Sequence Occluded Face 2追蹤結果。 66 表目錄表4. 1提出的演算法與壓縮追蹤演算法各部分處理時間與總處理時間。 31 表4. 2提出的演算法於搜尋樣本特徵值計算未加速與加速後的處理時間。 33 表4. 3搜尋追蹤目標位置未平行化、平行化後與平行化且簡化後的處理時間。 35 表4. 4提出的演算法進行加速前與加速後各部分處理時間與總處理時間。 40 表4. 5 修改搜尋方式前後處理時間。 42 表5. 1 電腦規格表。 43 表5. 2 使用軟體版本表。 44 表5. 3 使用五種尺度之分類器、快速壓縮追蹤演算法、縮追蹤演算法、多樣本學習追蹤演算法和追蹤-學習-偵測演算法之追蹤成功率與fps。 46 表5. 4 使用三種尺度(尺度為0. 9、1.0和1.11)之分類器、快速壓縮追蹤演算法、縮追蹤演算法、多樣本學習追蹤演算法和追蹤-學習-偵測演算法之追蹤成功率與fps。 47 表5. 5 使用三種尺度(尺度為0. 8、1.0和1.25)之分類器、快速壓縮追蹤演算法、縮追蹤演算法、多樣本學習追蹤演算法和追蹤-學習-偵測演算法之追蹤成功率與fps。 48 表5. 6 使用五種尺度之分類器、快速壓縮追蹤演算法、縮追蹤演算法、多樣本學習追蹤演算法和追蹤-學習-偵測演算法之平均中心點誤差。 49 表5. 7 使用三種尺度(尺度為0. 9、1.0和1.11)之分類器、快速壓縮追蹤演算法、縮追蹤演算法、多樣本學習追蹤演算法和追蹤-學習-偵測演算法之平均中心點誤差。 50 表5. 8 使用三種尺度(尺度為0. 8、1.0和1.25)之分類器、快速壓縮追蹤演算法、縮追蹤演算法、多樣本學習追蹤演算法和追蹤-學習-偵測演算法之平均中心點誤差。 51
參考文獻	參考文獻 [1] S. Avidan, “Support Vector Tracking,” in Proceedings IEEE Conference Computer Vision and Pattern Recognition, vol. I, pp. 184-191, 2001. [2] C. Cortes, and V. Vapnik, “Support-vector networks,” Machine Learning, vol. 20, no. 3, pp.273-297, 1995. [3] F.Jurie, and M.Dhome, “Hyperplane Approximation for Template Matching,” IEEE Transactions Pattern Analysis and Machine Intelligence, vol. 24, no. 7, pp. 996-1000, 2002. [4] H. Grabner, M. Grabner, and H. Bischof, “Real-time tracking via on-line boosting,” in Proceedings British Machine Vision Conference, pp. 47–56, 2006. [5] H. Grabner, C. Leistner, and H. Bischof, “Semi-supervised on-line boosting for robust tracking,” in Proceedings European Conference on Computer Vision, pp. 234–247, 2008. [6] B. Babenko, M.-H. Yang, and S. Belongie, “Visual tracking with online multiple instance learning,” in Proceedings IEEE Conference Computer Vision and Pattern Recognition, pp. 983–990, 2012. [7] B. Babenko, M.-H. Yang, and S. Belongie, “Robust object tracking with online multiple instance learning,” IEEE Transactions Pattern Analysis and Machine Intelligence, vol. 33, no. 8, pp. 1619–1632, 2011. [8] K. Zhang, and H. Song, “Real-time visual tracking via online weighted multiple instance learning,” Pattern Recognition, vol. 46, no. 1, pp. 397–411, 2013. [9] Z. Kalal, K. Mikolajczyk, and J. Matas, “Tracking-learning-detection,” IEEE Transactions Pattern Analysis and Machine Intelligence, vol. 34, no. 7, pp. 1409-1422, 2012 [10] K. Zhang, L. Zhang, Q. Liu, D. Zhang, and M.-H. Yang, “Fast visual tracking via dense spatio-temporal context learning,” in Proceedings European Conference on Computer Vision, vol. 8693, pp. 127-141, 2014. [11] K. Zhang, L. Zhang, and M.-H. Yang, “Real-time compressive tracking,” in Proceedings European Conference on Computer Vision, pp. 864-877, 2012. [12] K. Zhang, L. Zhang and M.-H. Yang, “Fast compressive tracking,” IEEE Transactions Pattern Analysis and Machine Intelligence, vol. 36, no. 10, pp. 2002-2015, 2014 [13] M.D. Jenkins, P. Barrie, T. Buggy, G. Morison, “Extended fast compressive tracking with weighted multi-frame template matching for fast motion tracking,” Pattern Recognition Letters, vol. 69, Issue C, pp. 82–87, 2016 [14] D.S. Bolme, J.R. Beveridge, B.A. Draper, and Y.M. Lui, “Visual object tracking using adaptive correlation ﬁlters,” in Proceedings IEEE Conference Computer Vision and Pattern Recognition, pp. 2544–2550, 2010. [15] J. Henriques, R. Caseiro, P. Martins, and J. Batista, “Exploiting the circulant structure of tracking-by-detection with kernels,” in Proceedings European Conference on Computer Vision, pp. 702–715, 2012. [16] K. Nummiaro, E. Koller-Meier, and L. Van Gool, “Object tracking with an adaptive color-based particle ﬁlter,” in Pattern Recognition, Springer, pp. 353–360, 2002. [17] D.A. Ross, J. Lim, R.-S. Lin, and M.-H. Yang, “Incremental learning for robust visual tracking,” International Journal of Computer Vision, vol. 77, no. 1, pp. 125–141, 2008. [18] D. Achlioptas, “Database-friendly random projections: Johnson-Lindenstrauss with binary coins,” Journal of Computer and System Sciences, vol. 66, no. 4, pp. 671–687, 2003. [19] A. Ng, and M. Jordan, “On discriminative vs. generative classifiers: A comparison of logistic regression and naive bayes,” in Proceedings Advances in Neural Information Processing Systems, pp. 841–848, 2002. [20] P. Diaconis and D. Freedman, “Asymptotics of graphical projection pursuit,” Annals of Statistics, vol. 12, no. 3, pp. 793–815, 1984. [21] M. Everingham, L. Gool, C. Williams, J. Winn, and A. Zisserman,“The pascal visual object class (VOC) challenge,” International Journal of Computer Vision, vol. 88, no. 2, pp. 303–338, 2010
論文全文使用權限	校內：紙本論文於授權書繳交後5年公開同意電子論文全文授權校園內公開校內電子論文於授權書繳交後5年公開校外：同意授權校外電子論文於授權書繳交後5年公開

返回頁首

如有問題，歡迎洽詢！
圖書館數位資訊組　(02)2621-5656 轉 2487 或來信