Dissertation/ Thesis

基於注意力與多模式分析之 數位相片管理系統設計與實作 ; Design and implementation of a multi-modal attention-based photo manager

التفاصيل البيبلوغرافية
العنوان: 基於注意力與多模式分析之 數位相片管理系統設計與實作 ; Design and implementation of a multi-modal attention-based photo manager
المؤلفون: 孫新民
المساهمون: 廖文宏
سنة النشر: 2004
المجموعة: National Chengchi University Institutional Repository (NCCUIR)
مصطلحات موضوعية: 電腦視覺, 多模式, 影像處理, 人工智慧
Time: 25
الوصف: 碩士 ; 國立政治大學 ; 資訊科學學系 ; 90753012 ; 93 ; 本論文敘述對於智慧型個人數位相片管理瀏覽平台之研究、設計與實作過程。系統設計上基於整合多重證據架構,採用影像內容與使用者瀏覽行為之分析作為自動分類,判斷影像重要性與推薦程度的依據。影像自動分類方面,包括外部給予的標準資訊-EXIF資訊與分析影像內容,以其中人物存在數量與面積比例為依據的影像分類。而在影像的推薦方面,則採用影像品質之分析-包括對焦品質分析、曝光品質分析-與分析使用者瀏覽相片時的行為-包括停留時間與專注程度的整合為分析重要程度依據;最後則採用多模式(Multi-Modal)架構整合不同的評估結果並作為推薦的結論。 ; In this thesis, we present the design and implementation of an intelligent personal digital photo browsing platform. The proposed system relies on multiple evidences inferred from image content as well as user behavior. Specifically, external EXIF data and face detection results are utilized to coarsely classify the digital images. Measures of image quality, including clarity and contrast, are calculated to further refine the search result. Moreover, we use web cameras to record and analyze the viewing behavior of the user and attempt to correlate the interest of the viewer to the effective viewing time. Finally, a multi-modal system is put in place to integrate the clues acquired from different modules. ; 第一章 緒論 1 1.1數位影像普及化所造成的管理問題與目前的解決方案 1 1.2智慧型人機介面 3 1.3智慧型數位影像管理系統平台 5 第二章 影像內容資訊分析 9 2.1影像內容資訊分析概觀 9 2.2數位相機與拍攝參數資訊-EXIF簡介 10 2.3以人物為基礎的數位相片分類 13 第三章 數位影像品質參數分析 25 3.1數位影像品質分析概觀 25 3.2偵測成像結果品質之評估演算法 27 3.3偵測影像對比程度對於數位相片品質影響之評估演算法 33 第四章 使用者行為參數分析 37 4.1使用者行為分析概論 37 4.2使用者行為與專注程度分析 39 第五章 多模式與資訊融合 51 5.1多模式(Multi-Modal)與資訊融合(Information Fusion)概論 51 5.2決策核心設計 55 5.3系統整合 60 第六章 系統實作 62 6.1系統實作 62 6.2資料庫設計與實作 64 第七章 結論與未來發展 67 7.1 結論 67 7.2 未來發展 69 參考文獻 71
نوع الوثيقة: thesis
وصف الملف: 98777 bytes; 106434 bytes; 104322 bytes; 61791 bytes; 121485 bytes; 121150 bytes; 98223 bytes; 96164 bytes; 1327653 bytes; 1346117 bytes; 1712700 bytes; 416345 bytes; 308738 bytes; 1389671 bytes; 828192 bytes; 163279 bytes; application/pdf
اللغة: English
Relation: 【1】 Richard Shim.,「影像左右快閃記憶卡命運」,CNET新聞專區,2004年,http://taiwan.cnet.com/news/ce/0,2000062982,20087086,00.htm; 【2】 Kerry Rodden, Kenneth R. Wood. 2003. How Do People Manage Their Digital Photographs? CHI 2003: NEW HORIZONS. Volume No. 5, Issue No. 1; 【3】 Hyunmo Kang, Ben Shneiderman.2002. Visualization Methods for Personal Photo Collections:Browsing and Searching in the PhotoFinder. Department of Computer Science, Human-Computer Interaction Laboratory; 【4】 Adobe Systems Incorporated, http://www.pacific.adobe.com/products/photoshopalbum/overview.html; 【5】 Ullas Gargi, Yining Deng, Daniel R. Tretter. 2002. Managing and Searching Personal Photo Collections. HP Laboratories Palo Alto; 【6】 Lynette Hirschman.1999. Intelligent Human-Computer Interfaces. The Edge Volume 3, Number 4; 【7】 P. Maes, T. Darrell, B. Blumberg, A. Pentland. 1995. The ALIVE system: full-body interaction with autonomous agents. Computer Animation'95 .; 【8】 許聞廉、陳克健,「自然智慧型輸入系統的語意分析─脈絡會意法」,1993年,Proceedings of the 6th International Symposium on Cognitive Aspects of the Chinese Language, (1993), 527-540.; 【9】 Japan Electronics and Information Technology Industries Association . Exchangeable image file format for digital still cameras : Exif Version 2.2; 【10】 TsuruZohTachibanaya.Description of Exif file format. 2001. http://park2.wakwak.com/~tsuruzoh/Computer/Digicams/exif-e.html#AboutExif; 【11】 Stuart Russell ,Peter Norvig. 2002. Artificial Intelligence: A Modern Approach Second Edition. Prentice Hall.; 【12】 Sanjay Kr. Singh, D. S. Chauhan, Mayank Vatsa, Richa Singh. 2003. A Robust Skin Color Based Face Detection Algorithm. Tamkang Journal of Science and Engineering, Vol. 6, No. 4, pp. 227-234; 【13】 Y. Gong and M. Sakauchi, "Detection of regions matching specified chromatic features", Computer Vision and Image Understanding, vol. 61, no. 2, 1995, pp 263 - 269; 【14】 Goldennumer.Net, “The human face is based entirely on Phi”, http://www.goldennumber.net/face.htm; 【15】 Zhou Wang, Alan C. Bovik, 2002 “WHY IS IMAGE QUALITY ASSESSMENT SO DIFFICULT?”, IEEE International Conference on Acoustics, Speech, & Signal Processing; 【16】 Zhou Wang, Alan C. Bovik. 2002. A Universal Image Quality Index. IEEE Signal Processing Letters, vol. 9, no. 3, pp. 81-84; 【17】 Norbert Wiener. 1942. Extrapolation, Interpolation, and Smoothing of Stationary Time Series. MIT Express; 【18】 Claude E. Shannon. 1948 . A Mathematical Theory of Communication. Bell System Technical Journal, vol. 27, pp. 379-423 and 623-656; 【19】 Jiawei Han, Micheline Kamber. 2001. Data Mining: Concepts and Techniques; 【20】 Gordon S. Linoff, Michael J. A. Berry, Michael J. A. Berry . 2001. Mining the Web: Transforming Customer Data.; 【21】 Paul Viola, Michael Jones. 2001. Rapid Object Detection using a Boosted Cascade of Simple Features. Proceedings IEEE Conf. on Computer Vision and Pattern Recognition; 【22】 E.S. Bigun, J.Bigün, B. Duc, S. Fischer. 1997. Expert conciliation for multi modal person authentication systems by Bayesian statistics, Audio and Video based Person Authentication - AVBPA97; 【23】 P. Verlinde, G. Chollet, and M. Acheroy. 2000. Multi-modal identity verification using expert fusion. Information Fusion, 1:17--33; 【24】 Conrad Sanderson, 2002, “Information fusion and person verification using speech & face information”, IDIAP–RR 02-33; 【25】 Arun Ross, Anil Jain, Jian-Zhong Qian. 2001. Information Fusion in Biometrics. Lecture Notes in Computer Science; 【26】 Metropolis,N., A. Rosenbluth, M. Rosenbluth, A. Teller, E. Teller, 1953,"Equation of State Calculations by Fast Computing Machines", J. Chem. Phys.,21, 6, 1087-1092; G0090753012; http://nccur.lib.nccu.edu.tw//handle/140.119/37101
الاتاحة: http://nccur.lib.nccu.edu.tw//handle/140.119/37101
رقم الانضمام: edsbas.3DAC5288
قاعدة البيانات: BASE