淡江大學覺生紀念圖書館 (TKU Library)
進階搜尋


下載電子全文限經由淡江IP使用) 
系統識別號 U0002-1706200515320800
中文論文名稱 視訊簡報互動技術在數位學習之研究
英文論文名稱 Interactive Techniques for Video Presentations in E-Learning
校院名稱 淡江大學
系所名稱(中) 資訊工程學系博士班
系所名稱(英) Department of Computer Science and Information Engineering
學年度 93
學期 2
出版年 94
研究生中文姓名 廖逸群
研究生英文姓名 Yi-Chun Liao
學號 890190076
學位類別 博士
語文別 英文
口試日期 2005-05-31
論文頁數 111頁
口試委員 指導教授-林慧珍
委員-施國琛
委員-王英宏
委員-楊錦潭
委員-廖弘源
中文關鍵字 超媒體  數位教學  互動式視訊  影片處理  課程錄製系統 
英文關鍵字 Video Presentation  Interaction Model  Hypervideo  MPEG-2  E-Learning  Interactive Video  MPEG-4 
學科別分類 學科別應用科學資訊工程
中文摘要 隨著電腦硬體與網路的快速發展,數位學習成為重要的課題。而視訊影片為不可或缺的媒體之一,這些視訊教材常以VCD或DVD的格式來使用,或者以串流的方式來呈現。數位學習常用視訊簡報系統中,大都有互動性不足的情形。對教材提供者或錄製者而言,以現存系統來錄製,則無法提供多樣性的教材,錄製的形式也是無法表現出教材內容與教師的互動。就學習者而言,視訊教材提供一種線性與單一的學習環境,使用者無法根據個人所需來決定學習進度、甚至學習路徑。
本論文將根據上述的問題,提供具有高互動性而簡單的基礎架構,使用者透過互動讓視訊教材提供更豐富更多樣化的內容。分別從視訊教材的錄製、後製、播放三個階段來探討使用者與視訊教材如何互動,其中研究要點有(1)視覺內容的同步方式(2)視訊教材與使用者互動式(3)可調式視訊簡報內容的建構方式。在多媒體同步與互動模式中,我們利用OCPN說明本論文所提出的多種模式。視訊教材錄製時,四種以教師為主角錄製模式用來強調教師與教材的互動關係。教材剪輯時,本論文提出以影片為物件的註解編輯系統。在最後階段中,學習者根據自己的需要來觀賞不同教學內容。特別的是我們把兩種概念帶入本架構(1)超媒體視訊 (2)以演講者為主的視訊教材。
英文摘要 With the prospects of novel capture and display technologies there is no doubt that digital visual content in the future will become as ubiquitous as paper is today. Furthermore, advances in communications, affective, and wearable computing, assures us that personal visual information will be used in unexpected and exciting ways. For e-learning, the video is a major material of the learning contents, but the whole video material is considered as a pure and single data like as an image. It is not convenient for a learning content provider to provide more information in the video. For a learner, the learning video should bring more abounding knowledge. A learner can get what he need according to the knowledge and interest level.
We address the problem of automatic capturing and organization of visual information through user interaction at different steps. Our work focuses on the following three important areas: (1) synchronization method of visual contents and (2) interaction models between material and users that get more information from user input at multiple devices (3) construction of adaptive presentation methods in solving different knowledge levels. In particular, we present the following: (1) novel actor-based model for capturing the notion (2) a complete video processing framework that applies hypervideo concept into e-learning.
論文目次 CONTENTS

LIST OF FIGURES ii
LIST OF TABLES iv

CHAPTER 1 INTRODUCTION 1
1.1 Background and Motivation 1
1.2 Challenges of Video Systems 4
1.2.1 Interaction 4
1.2.2 Synchronization 6
1.3 Thesis Overview 6

CHAPTER 2 FUNDAMENTALS OF VIDEO TECHNOLOGIES 9
2.1 Overview of Video Formats and Concepts 10
2.1.1 History of Video Compression Standards 19
2.2 General Concepts of Video Capture Techniques 23
2.3 The Inter-/Intra-stream Synchronization Model 28
2.4 Actor-Based Capture Model for E-Learning 37

CHAPTER 3 AUGMENTED HYPERVIDEO AUTHORING 47
3.1 HyperVideo -- Beyond Hypermedia and Hypertext 47
3.1.1 Related Work of HyperVideo 49
3.2 Overview of MPEG-4 BIFS 51
3.3 Authoring Techniques 53
3.3.1 Spatial-Temporal Rules 68
3.4 A Case Study of HyperVideo System in E-Learning 70

CHAPTER 4 PRESENTATION PLAYBACK OF STRUCTURED HYPERVIDEO 81
4.1 Overview of Real-Time Video Playback 81
4.2 Viewer-based Presentation Model 86
4.2.1 Interaction with Remote Control 90
4.2.2 Interaction with Camera-Captured Gestures 93

CHAPTER 5 Conclusions and Future Directions 103
5.1 Summary of Thesis 103
5.2 Extensions to Our Work 105

Bibliography… 107

LIST OF FIGURES
CHAPTER1
Figure 1.1 The Workflow of the Video Process 3
Figure 1.2 The Proposed Framework of Video Process in E-learning 8
CHAPTER2
Figure 2.1 Video Lecture Recorded by a CCD Camera 24
Figure 2.2 Synthesized Video Lecture with a Clear Screen Output 24
Figure 2.3 the Scenario of Video Capture 27
Figure 2.4 A DOCPN Map of the AVS 37
Figure 2.5 Multi-stream in Augmented Video System 39
Figure 2.6 The Augmented Video System Architecture 40
Figure 2.7 Normal Mode and DCPN Map of Normal Mode 42
Figure 2.8 Reduce Screen Mode and DCPN Map 43
Figure 2.9 Full Actor Mode and DCPN Map 44
Figure 2.10 Full Screen Mode and DCPN Map 45
CHAPTER3
Figure 3.1 Hypermedia and Hypervideo 48
Figure 3.3 The User_data is Attached to the Raw Video Data 55
Figure 3.4 The Multimedia Presentation System 56
Figure 3.5 The Timeline of the Different Media Objects 59
Figure 3.6 Petri Net Specification of the Slide Show 60
Figure 3.7 A Timeline Scenario of the Hypervideo System 64
Figure 3.8 the DOCPN map with the User Interaction and Hyperlink Events 65
Figure 3.9 An Example of the Hypervideo Tree 67
Figure 3.10 The Crossover of two Objects 70
Figure 3.11 Different Playing Sequences with the Same Video Material 72
Figure 3.12 Editing Process of the Hypervideo Authoring System 73
Figure 3.13 Comparing the Raw File and Edited Video File 76
Figure 3.14 The Hypervideo Authoring Tool 79
Figure 3.15 The Hyrpervideo Player 79
CHAPTER4
Figure 4.1 A transmission Scenario of the Video Transmission 82
Figure 4.2 (a) A Linear Sequence (b) Tree-like Structure 87
Figure 4.3 The Architecture of Proposed Player 89
Figure 4.4 The Other Interactive for the Hypervideo Browser 90
Figure 4.5 The Workflow of the Remote Control for the Hypervideo Browser 91
Figure 4.6 The Remote Control and the Operation of the Remote Control 92
Figure 4.7 The Indexing Example of the Jump Point 93
Figure 4.8 The Workflow of the Gesture Capturing 94
Figure 4.9 Identify the Gesture in HIS Color Domain 95
Figure 4.11 Overview of the Tracking Method 101
Figure 4.12 Moving Mouse by a Gesture 102
Figure 4.13 Click an Object by a Gesture in the Hypervideo Browser 102


LIST OF TABLES

Table 2.1 The Characteristics of the Analog Video and Digital Video 11
Table 2.2 Data Rate for some Applications 17

Table 3.1 The Definitions of DOCPN Map 63
Table 3.2 Example of attribute list of a video object and the personal textual description of each field 74

Table 4.1 Streaming Media Formats-basic Features 83

參考文獻 Bibliography
Bertini, M., Cucchiara, R., Bimbo, A. D. & Prati, A. (2003). Object and event detection for semantic annotation and transcoding, Proceedings of International Conference on Multimedia and Expo, 2, 421-424.
Chang, S. F., Chen, W., Meng, H. J., Sundaram, H. & Zhong, D. (1997). VideoQ: an automated content based video search system using visual cues, Proceedings of the fifth ACM international conference on Multimedia, 313-324.
Chen, H. T., Lin, H. H., & Liu, T. L., (2001). Multi-object tracking using dynamical graph matching, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2, 201-271.
Chen, M. (2003). Interacting with media: visualizing the pulse of a classroom, Proceedings of the eleventh ACM international conference on Multimedia, 2003, 555-561.
Chen, M. (2002). Meeting support: achieving effective floor control with a low-bandwidth gesture-sensitive videoconferencing system, Proceedings of the tenth ACM international conference on Multimedia, 2002, 476-483.
Chien, S. Y., Huang, Y. W., Hsieh, B. Y., Ma, S. Y. & Chen, L. G. (2004). Fast video segmentation algorithm with shadow cancellation, global motion compensation, and adaptive threshold techniques, IEEE Transactions on Multimedia, 6(5), 732-748.
Deshpande, S.G. & Hwang J. N. (2001). A real-time interactive virtual classroom multimedia distance learning system, IEEE Transactions on Multimedia, 3(4), 432-444.
Dimitrova, N., Zhang, Hong-Jiang, Shahraray, B., Sezan, I., Huang, T. & Zakhor, A. (2002). Aplications of video-content analysis and retrieval, IEEE Multimedia, 9(3), 42-55.
Doherty, J., Girgensohn, A., Helfman, J., Shipman, F. & Wilcox, L. (2003). Detail-on-demand hypervideo, Proceedings of the eleventh ACM international conference on Multimedia, 600-601.
Girgensohn, A., Shipman, F. & Wilcox, L. (2003). Hyper-hitchcock: authoring interactive videos and generating interactive summaries, Proceedings of the eleventh ACM international conference on Multimedia, 92-93.
Girgensohn, A., Wilcox, L., Shipman, F. & Bly, S. (2004). Extending to multidimensional interfaces: designing affordances for the navigation of detail-on-demand hypervideo, Proceedings of the working conference on Advanced visual interfaces, 290-297.
Gonzalez, R. C. & Wintz, P. (1987). Digital Image Processing, 2nd Edition, Addison-Wesley.
Granrath, D. J. (1981). The role of human visual models in image processing, Proceedings of IEEE, 69(5), 552-561.
Guan, S. U., Yu, H. Y. & Yang, J. S. (1998). A prioritized petri net model and its application in distributed multimedia systems, IEEE Transaction on Computer, 47(4), 477-481.
Hada, Y., Ogata, H, & Yano, Y. (2001). XML-based video annotation system for language learning environment, Proceedings of the Second International Conference on Web Information Systems Engineering, 1, 102-111.
Hung, J. C., Shih, T. K., Huang, J. Y., Wang, C. S. & Chen, J. F. (2001). An efficient approach to holding a virtual conference, the Proceedings of the National Science Council, R.O.C. Part A: Physical Science and Engineering, 25(6), 352-366.
Little, T. D. C. & Ghafoor, A. (1990). Synchronization and storage models for multimedia objects, IEEE Journal on Selected Areas in Communications, 8(3), 413-427.
Little, T. D. C. & Ghafoor, A. (1991). Multimedia synchronization protocols for broadband integrated services, IEEE Journal on Selected Areas in Communications, 9(9), 1368-1382.
Little, T. D. C. (1991). Synchronization for distributed multimedia database systems, Ph. D. Dissertation, Syracuse University.
Liu, K. Y., Huang, N., Wu, B. H., Chu, W. T. & Chen. H. Y. (2002). The WSML system: web-based aynchronization multimedia lecture system, Proceedings of the tenth ACM international conference on Multimedia, 662-663, France.
Lin, C. Y., Tseng, B. L., Naphade, M., Natsev, A. & Smith, J. R. (2003). MPEG-7 video automatic labeling system, Proceedings of the eleventh ACM international conference on Multimedia, 98-99.
Mackay, W. E. & Davenport, G. (1989). Virtual video editing in interactive multimedia applications, Communications of the ACM, 32(7), 802-810.
Marcenaro, L., Ferrari, M., Marchesotti, L. & Regazzoni, C. S., (2002). Multiple object tracking under heavy occlusions by using Kalman filters based on shape matching, Proceedings of the. International Conference on Image Processing, 3, 341-344.
Martinez, J. M. (2003). Authoring 744: writing descriptions to create content, IEEE Multimedia, 10(4), 94-99.
Mitchell, J. L. Pennebaker, W. B., Fogg, C. E. & Legall, D. J. (1996). Mpeg Video: Compression Standard (Digital Multimedia Standards Series), Kluwer Academic Publishers.
Mitchell, J. L. , Pennebaker, W. B. , Fogg, C. E. & LeGall, D. J. (1997). MPEG Video: Compression Sstandard, Chapman & Hall.
Mukhopadhyay, S. & Smith, B. (1999). Passive capture and structuring of lectures, Proceedings of the seventh ACM international conference on Multimedia, 477-487.
Murata, T. (1989). Petri nets: properties, analysis and applications, Proceedings of the IEEE, 77(4), 541 - 580.
Peterson, J. L. (1981). Petri Net Theory and the Modeling of Systems, Englewood Cliffs, NJ: Prentice-Hall, Inc..
Sawhney, N., Balcom, D. & Smith, I. (1996). HyperCafe: narrative and aesthetic properties of hypervideo, Proceedings of the seventh ACM conference on Hypertext, 1-10.
Sen, S., Rexford, J. L., Dey, J. K., Kurose, J. F. & Towsley, D. F. (2000). Online smoothing of variable-bit-rate streaming video, IEEE Transactions on Multimedia, 2(1), 282-300.
Setten, M. V. & Oltmans, E. (2001). Demonstration of a distributed MPEG-7 video search and retrieval application in the educational domain, Proceedings of the ninth ACM international conference on Multimedia, 595-596.
Shih, T.K. & Deng, L. Y. (2000). Distributed multimedia presentation with floor control mechanisms in a distance learning system, Proceeding of the 2000 International Conference on Information Society in the 21 Century: Emerging Technologies and New Challenges, Japan.
Shih, T. K., Deng, L. Y., Wang, J. H. & Liou, A. A. H. (2001). Maintaining persistent look-and-feel for roaming student with mobile agent in distance learning, Proceeding of the 7th International Conference on Distributed Multimedia Systems (DMS’ 2001), Taipei, Taiwan.
Shipman, F., Girgensohn, A. & Wilcox, L. (2003). Creating navigable multi-level video summaries, Proceedings of the International Conference on Multimedia and Expo,2 , 752-756.
Simpson, W. A. (1994). Temporal summation of visual motion, Vision Res. 34(19), 2547-2559.
Steinmetz, R. & Nahrstedt, K. (1995). Multimedia: Computing, Communications and Applications, Prentice Hall PTR, (1st edition).
Suzuki, T. & Shatz, S. M. (1990). A protocol modeling and verification approach based on a specification language and petri nets, IEEE Transactions on Software Engineering, 16(5) , 523-536.
Taylor, J. (2000), DVD Demystified, McGraw-Hill Professional (2nd edition).
Woo, M., Qazi, N. U. & Ghafoor, A. (1994). A synchronization framework for communication of pre-orchestrated multimedia information, IEEE Network, 8(1), 52-61.
Yu, H., Huang, T. S., & Niemann, H. (2002). Segmentation-based object tracking using image warping and Kalman filtering, Proceeding of the International Conference on Image Processing, 3, 601-604.
Wu, D., Hou, Y.T., Zhu, W. W., Zhang, Y. Q. & Peha, J. M. (2001). Streaming video over the Internet: approaches and directions, IEEE Transactions on Circuits and Systems for Video Technology, 11(3), 282-300.
Zurawsi, R. & Zhou, M. (1994). Petri nets and industrial application: a tutorial, IEEE Transactions on Industrisl Electronics, 41(6), 567-583.
論文使用權限
  • 同意紙本無償授權給館內讀者為學術之目的重製使用,於2007-06-27公開。
  • 同意授權瀏覽/列印電子全文服務,於2007-06-27起公開。


  • 若您有任何疑問,請與我們聯絡!
    圖書館: 請來電 (02)2621-5656 轉 2281 或 來信