電子學位論文服務

§ 瀏覽學位論文書目資料

本論文電子全文於2007-06-27起於校外公開使用
本論文紙本於2007-06-27起公開使用

系統識別號	U0002-1706200515320800
DOI	10.6846/TKU.2005.00342
論文名稱(中文)	視訊簡報互動技術在數位學習之研究
論文名稱(英文)	Interactive Techniques for Video Presentations in E-Learning
第三語言論文名稱
校院名稱	淡江大學
系所名稱(中文)	資訊工程學系博士班
系所名稱(英文)	Department of Computer Science and Information Engineering
外國學位學校名稱
外國學位學院名稱
外國學位研究所名稱
學年度	93
學期	2
出版年	94
研究生(中文)	廖逸群
研究生(英文)	Yi-Chun Liao
學號	890190076
學位類別	博士
語言別	英文
第二語言別
口試日期	2005-05-31
論文頁數	111頁
口試委員	指導教授 - 林慧珍(hjlin@cs.tku.edu.tw) 委員 - 施國琛(tshih@cs.tku.edu.tw) 委員 - 王英宏(inhon@mail.tku.edu.tw) 委員 - 楊錦潭(yangdav@nknucc.nknu.edu.tw) 委員 - 廖弘源(liao@iis.sinica.edu.tw)
關鍵字(中)	超媒體數位教學互動式視訊影片處理課程錄製系統
關鍵字(英)	Video Presentation Interaction Model Hypervideo MPEG-2 E-Learning Interactive Video MPEG-4
第三語言關鍵字
學科別分類
中文摘要	隨著電腦硬體與網路的快速發展，數位學習成為重要的課題。而視訊影片為不可或缺的媒體之一，這些視訊教材常以VCD或DVD的格式來使用，或者以串流的方式來呈現。數位學習常用視訊簡報系統中，大都有互動性不足的情形。對教材提供者或錄製者而言，以現存系統來錄製，則無法提供多樣性的教材，錄製的形式也是無法表現出教材內容與教師的互動。就學習者而言，視訊教材提供一種線性與單一的學習環境，使用者無法根據個人所需來決定學習進度、甚至學習路徑。本論文將根據上述的問題，提供具有高互動性而簡單的基礎架構，使用者透過互動讓視訊教材提供更豐富更多樣化的內容。分別從視訊教材的錄製、後製、播放三個階段來探討使用者與視訊教材如何互動，其中研究要點有(1)視覺內容的同步方式(2)視訊教材與使用者互動式(3)可調式視訊簡報內容的建構方式。在多媒體同步與互動模式中，我們利用OCPN說明本論文所提出的多種模式。視訊教材錄製時，四種以教師為主角錄製模式用來強調教師與教材的互動關係。教材剪輯時，本論文提出以影片為物件的註解編輯系統。在最後階段中，學習者根據自己的需要來觀賞不同教學內容。特別的是我們把兩種概念帶入本架構(1)超媒體視訊 (2)以演講者為主的視訊教材。
英文摘要	With the prospects of novel capture and display technologies there is no doubt that digital visual content in the future will become as ubiquitous as paper is today. Furthermore, advances in communications, affective, and wearable computing, assures us that personal visual information will be used in unexpected and exciting ways. For e-learning, the video is a major material of the learning contents, but the whole video material is considered as a pure and single data like as an image. It is not convenient for a learning content provider to provide more information in the video. For a learner, the learning video should bring more abounding knowledge. A learner can get what he need according to the knowledge and interest level. We address the problem of automatic capturing and organization of visual information through user interaction at different steps. Our work focuses on the following three important areas: (1) synchronization method of visual contents and (2) interaction models between material and users that get more information from user input at multiple devices (3) construction of adaptive presentation methods in solving different knowledge levels. In particular, we present the following: (1) novel actor-based model for capturing the notion (2) a complete video processing framework that applies hypervideo concept into e-learning.
第三語言摘要
論文目次	CONTENTS LIST OF FIGURES ii LIST OF TABLES iv CHAPTER 1 INTRODUCTION 1 1.1 Background and Motivation 1 1.2 Challenges of Video Systems 4 1.2.1 Interaction 4 1.2.2 Synchronization 6 1.3 Thesis Overview 6 CHAPTER 2 FUNDAMENTALS OF VIDEO TECHNOLOGIES 9 2.1 Overview of Video Formats and Concepts 10 2.1.1 History of Video Compression Standards 19 2.2 General Concepts of Video Capture Techniques 23 2.3 The Inter-/Intra-stream Synchronization Model 28 2.4 Actor-Based Capture Model for E-Learning 37 CHAPTER 3 AUGMENTED HYPERVIDEO AUTHORING 47 3.1 HyperVideo -- Beyond Hypermedia and Hypertext 47 3.1.1 Related Work of HyperVideo 49 3.2 Overview of MPEG-4 BIFS 51 3.3 Authoring Techniques 53 3.3.1 Spatial-Temporal Rules 68 3.4 A Case Study of HyperVideo System in E-Learning 70 CHAPTER 4 PRESENTATION PLAYBACK OF STRUCTURED HYPERVIDEO 81 4.1 Overview of Real-Time Video Playback 81 4.2 Viewer-based Presentation Model 86 4.2.1 Interaction with Remote Control 90 4.2.2 Interaction with Camera-Captured Gestures 93 CHAPTER 5 Conclusions and Future Directions 103 5.1 Summary of Thesis 103 5.2 Extensions to Our Work 105 Bibliography… 107 LIST OF FIGURES CHAPTER1 Figure 1.1 The Workflow of the Video Process 3 Figure 1.2 The Proposed Framework of Video Process in E-learning 8 CHAPTER2 Figure 2.1 Video Lecture Recorded by a CCD Camera 24 Figure 2.2 Synthesized Video Lecture with a Clear Screen Output 24 Figure 2.3 the Scenario of Video Capture 27 Figure 2.4 A DOCPN Map of the AVS 37 Figure 2.5 Multi-stream in Augmented Video System 39 Figure 2.6 The Augmented Video System Architecture 40 Figure 2.7 Normal Mode and DCPN Map of Normal Mode 42 Figure 2.8 Reduce Screen Mode and DCPN Map 43 Figure 2.9 Full Actor Mode and DCPN Map 44 Figure 2.10 Full Screen Mode and DCPN Map 45 CHAPTER3 Figure 3.1 Hypermedia and Hypervideo 48 Figure 3.3 The User_data is Attached to the Raw Video Data 55 Figure 3.4 The Multimedia Presentation System 56 Figure 3.5 The Timeline of the Different Media Objects 59 Figure 3.6 Petri Net Specification of the Slide Show 60 Figure 3.7 A Timeline Scenario of the Hypervideo System 64 Figure 3.8 the DOCPN map with the User Interaction and Hyperlink Events 65 Figure 3.9 An Example of the Hypervideo Tree 67 Figure 3.10 The Crossover of two Objects 70 Figure 3.11 Different Playing Sequences with the Same Video Material 72 Figure 3.12 Editing Process of the Hypervideo Authoring System 73 Figure 3.13 Comparing the Raw File and Edited Video File 76 Figure 3.14 The Hypervideo Authoring Tool 79 Figure 3.15 The Hyrpervideo Player 79 CHAPTER4 Figure 4.1 A transmission Scenario of the Video Transmission 82 Figure 4.2 (a) A Linear Sequence (b) Tree-like Structure 87 Figure 4.3 The Architecture of Proposed Player 89 Figure 4.4 The Other Interactive for the Hypervideo Browser 90 Figure 4.5 The Workflow of the Remote Control for the Hypervideo Browser 91 Figure 4.6 The Remote Control and the Operation of the Remote Control 92 Figure 4.7 The Indexing Example of the Jump Point 93 Figure 4.8 The Workflow of the Gesture Capturing 94 Figure 4.9 Identify the Gesture in HIS Color Domain 95 Figure 4.11 Overview of the Tracking Method 101 Figure 4.12 Moving Mouse by a Gesture 102 Figure 4.13 Click an Object by a Gesture in the Hypervideo Browser 102 LIST OF TABLES Table 2.1 The Characteristics of the Analog Video and Digital Video 11 Table 2.2 Data Rate for some Applications 17 Table 3.1 The Definitions of DOCPN Map 63 Table 3.2 Example of attribute list of a video object and the personal textual description of each field 74 Table 4.1 Streaming Media Formats-basic Features 83
參考文獻	Bibliography Bertini, M., Cucchiara, R., Bimbo, A. D. & Prati, A. (2003). Object and event detection for semantic annotation and transcoding, Proceedings of International Conference on Multimedia and Expo, 2, 421-424. Chang, S. F., Chen, W., Meng, H. J., Sundaram, H. & Zhong, D. (1997). VideoQ: an automated content based video search system using visual cues, Proceedings of the fifth ACM international conference on Multimedia, 313-324. Chen, H. T., Lin, H. H., & Liu, T. L., (2001). Multi-object tracking using dynamical graph matching, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2, 201-271. Chen, M. (2003). Interacting with media: visualizing the pulse of a classroom, Proceedings of the eleventh ACM international conference on Multimedia, 2003, 555-561. Chen, M. (2002). Meeting support: achieving effective floor control with a low-bandwidth gesture-sensitive videoconferencing system, Proceedings of the tenth ACM international conference on Multimedia, 2002, 476-483. Chien, S. Y., Huang, Y. W., Hsieh, B. Y., Ma, S. Y. & Chen, L. G. (2004). Fast video segmentation algorithm with shadow cancellation, global motion compensation, and adaptive threshold techniques, IEEE Transactions on Multimedia, 6(5), 732-748. Deshpande, S.G. & Hwang J. N. (2001). A real-time interactive virtual classroom multimedia distance learning system, IEEE Transactions on Multimedia, 3(4), 432-444. Dimitrova, N., Zhang, Hong-Jiang, Shahraray, B., Sezan, I., Huang, T. & Zakhor, A. (2002). Aplications of video-content analysis and retrieval, IEEE Multimedia, 9(3), 42-55. Doherty, J., Girgensohn, A., Helfman, J., Shipman, F. & Wilcox, L. (2003). Detail-on-demand hypervideo, Proceedings of the eleventh ACM international conference on Multimedia, 600-601. Girgensohn, A., Shipman, F. & Wilcox, L. (2003). Hyper-hitchcock: authoring interactive videos and generating interactive summaries, Proceedings of the eleventh ACM international conference on Multimedia, 92-93. Girgensohn, A., Wilcox, L., Shipman, F. & Bly, S. (2004). Extending to multidimensional interfaces: designing affordances for the navigation of detail-on-demand hypervideo, Proceedings of the working conference on Advanced visual interfaces, 290-297. Gonzalez, R. C. & Wintz, P. (1987). Digital Image Processing, 2nd Edition, Addison-Wesley. Granrath, D. J. (1981). The role of human visual models in image processing, Proceedings of IEEE, 69(5), 552-561. Guan, S. U., Yu, H. Y. & Yang, J. S. (1998). A prioritized petri net model and its application in distributed multimedia systems, IEEE Transaction on Computer, 47(4), 477-481. Hada, Y., Ogata, H, & Yano, Y. (2001). XML-based video annotation system for language learning environment, Proceedings of the Second International Conference on Web Information Systems Engineering, 1, 102-111. Hung, J. C., Shih, T. K., Huang, J. Y., Wang, C. S. & Chen, J. F. (2001). An efficient approach to holding a virtual conference, the Proceedings of the National Science Council, R.O.C. Part A: Physical Science and Engineering, 25(6), 352-366. Little, T. D. C. & Ghafoor, A. (1990). Synchronization and storage models for multimedia objects, IEEE Journal on Selected Areas in Communications, 8(3), 413-427. Little, T. D. C. & Ghafoor, A. (1991). Multimedia synchronization protocols for broadband integrated services, IEEE Journal on Selected Areas in Communications, 9(9), 1368-1382. Little, T. D. C. (1991). Synchronization for distributed multimedia database systems, Ph. D. Dissertation, Syracuse University. Liu, K. Y., Huang, N., Wu, B. H., Chu, W. T. & Chen. H. Y. (2002). The WSML system: web-based aynchronization multimedia lecture system, Proceedings of the tenth ACM international conference on Multimedia, 662-663, France. Lin, C. Y., Tseng, B. L., Naphade, M., Natsev, A. & Smith, J. R. (2003). MPEG-7 video automatic labeling system, Proceedings of the eleventh ACM international conference on Multimedia, 98-99. Mackay, W. E. & Davenport, G. (1989). Virtual video editing in interactive multimedia applications, Communications of the ACM, 32(7), 802-810. Marcenaro, L., Ferrari, M., Marchesotti, L. & Regazzoni, C. S., (2002). Multiple object tracking under heavy occlusions by using Kalman filters based on shape matching, Proceedings of the. International Conference on Image Processing, 3, 341-344. Martinez, J. M. (2003). Authoring 744: writing descriptions to create content, IEEE Multimedia, 10(4), 94-99. Mitchell, J. L. Pennebaker, W. B., Fogg, C. E. & Legall, D. J. (1996). Mpeg Video: Compression Standard (Digital Multimedia Standards Series), Kluwer Academic Publishers. Mitchell, J. L. , Pennebaker, W. B. , Fogg, C. E. & LeGall, D. J. (1997). MPEG Video: Compression Sstandard, Chapman & Hall. Mukhopadhyay, S. & Smith, B. (1999). Passive capture and structuring of lectures, Proceedings of the seventh ACM international conference on Multimedia, 477-487. Murata, T. (1989). Petri nets: properties, analysis and applications, Proceedings of the IEEE, 77(4), 541 - 580. Peterson, J. L. (1981). Petri Net Theory and the Modeling of Systems, Englewood Cliffs, NJ: Prentice-Hall, Inc.. Sawhney, N., Balcom, D. & Smith, I. (1996). HyperCafe: narrative and aesthetic properties of hypervideo, Proceedings of the seventh ACM conference on Hypertext, 1-10. Sen, S., Rexford, J. L., Dey, J. K., Kurose, J. F. & Towsley, D. F. (2000). Online smoothing of variable-bit-rate streaming video, IEEE Transactions on Multimedia, 2(1), 282-300. Setten, M. V. & Oltmans, E. (2001). Demonstration of a distributed MPEG-7 video search and retrieval application in the educational domain, Proceedings of the ninth ACM international conference on Multimedia, 595-596. Shih, T.K. & Deng, L. Y. (2000). Distributed multimedia presentation with floor control mechanisms in a distance learning system, Proceeding of the 2000 International Conference on Information Society in the 21 Century: Emerging Technologies and New Challenges, Japan. Shih, T. K., Deng, L. Y., Wang, J. H. & Liou, A. A. H. (2001). Maintaining persistent look-and-feel for roaming student with mobile agent in distance learning, Proceeding of the 7th International Conference on Distributed Multimedia Systems (DMS’ 2001), Taipei, Taiwan. Shipman, F., Girgensohn, A. & Wilcox, L. (2003). Creating navigable multi-level video summaries, Proceedings of the International Conference on Multimedia and Expo,2 , 752-756. Simpson, W. A. (1994). Temporal summation of visual motion, Vision Res. 34(19), 2547-2559. Steinmetz, R. & Nahrstedt, K. (1995). Multimedia: Computing, Communications and Applications, Prentice Hall PTR, (1st edition). Suzuki, T. & Shatz, S. M. (1990). A protocol modeling and verification approach based on a specification language and petri nets, IEEE Transactions on Software Engineering, 16(5) , 523-536. Taylor, J. (2000), DVD Demystified, McGraw-Hill Professional (2nd edition). Woo, M., Qazi, N. U. & Ghafoor, A. (1994). A synchronization framework for communication of pre-orchestrated multimedia information, IEEE Network, 8(1), 52-61. Yu, H., Huang, T. S., & Niemann, H. (2002). Segmentation-based object tracking using image warping and Kalman filtering, Proceeding of the International Conference on Image Processing, 3, 601-604. Wu, D., Hou, Y.T., Zhu, W. W., Zhang, Y. Q. & Peha, J. M. (2001). Streaming video over the Internet: approaches and directions, IEEE Transactions on Circuits and Systems for Video Technology, 11(3), 282-300. Zurawsi, R. & Zhou, M. (1994). Petri nets and industrial application: a tutorial, IEEE Transactions on Industrisl Electronics, 41(6), 567-583.
論文全文使用權限	校內：紙本論文於授權書繳交後2年公開同意電子論文全文授權校園內公開校內電子論文於授權書繳交後2年公開校外：同意授權校外電子論文於授權書繳交後2年公開

返回頁首

如有問題，歡迎洽詢！
圖書館數位資訊組　(02)2621-5656 轉 2487 或來信