研究生: |
李靜玟 Lee, Jing-Wen |
---|---|
論文名稱: |
提升知識再利用效能之聲音知識語音化—以設備操作手冊為例 Knowledge Representation via Knowledge Vocalization--A Case Study on Equipment User Manuals |
指導教授: |
侯建良
Hou, Jiang-Liang |
口試委員: | |
學位類別: |
碩士 Master |
系所名稱: |
工學院 - 工業工程與工程管理學系 Department of Industrial Engineering and Engineering Management |
論文出版年: | 2009 |
畢業學年度: | 97 |
語文別: | 中文 |
論文頁數: | 235 |
中文關鍵詞: | 知識語音化 、知識表示 、知識管理 |
外文關鍵詞: | Knowledge Vocalization, Knowledge Representation, Knowledge Management |
相關次數: | 點閱:1 下載:0 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
在工廠生產線中,產品之生產與加工多透過作業人員使用機台設備進行之;而不當之操作機台設備易導致工安事故之發生,故機台設備之安全操作成為工廠重視之課題。機台設備之運作狀況多透過聲音訊息傳達予現場作業人員,以讓現場工作人員可藉由辨識聲音訊息進而執行聲音訊息所對應之處置動作。而工業指導書或工廠作業手冊中多以文字形式描述機台設備運作所發出之聲音訊息以及其對應之處置方式,相關人員往往需花費時間揣摩或瞭解以文字形式表達之聲音訊息,故以文字型態表達之聲音知識往往讓知識吸收者陷入需花費時間閱讀與理解內容之困境,且往往無法讓員工具體且清楚地掌握聲音訊息與其對應之處置措施。
為能讓知識吸收者可透過具體之聽覺感受瞭解文字形式所描述之聲音訊息,本研究乃發展一套「聲音知識語音化」方法論,以將以文字型聲音知識轉以語音化之方式呈現予知識吸收者。本研究乃先針對具聲音知識之文件內容(如機械設備使用說明書或設備操作手冊)進行收集、分析與整理,以瞭解表達聲音知識內容之結構與元素。之後,再根據聲音知識之解析結果建立聲音知識判斷詞庫,以作為由自由形式知識文件擷取聲音知識目標文句之基礎。最後,本研究即根據聲音知識判斷詞庫與表達聲音知識之文句結構發展一套聲音知識語音化方法論,此方法論之詳細作法為先將自由形式之知識文件全文進行文句標示,再對標示文句進行篩選,以取得可能含有聲音知識之待選文句,進而針對待選文句進行文句關聯解析,以取得具聲音知識之文句。之後,由自由形式表達聲音知識之目標文句中擷取呈現聲音類型、音量大小與時間久暫三種聲音特性之關鍵內容轉化為具結構化之內容,以利用此結構化之內容精鍊表達聲音知識文句之內涵,最後再配合語音化方式呈現結構化之聲音知識文句內容,達到聲音知識語音化之目的。
本研究除發展聲音知識語音化之方法論外,亦根據此方法論建構一套聲音知識語音化資訊分享系統,並以「設備操作知識文件」為案例進行系統驗證,以確認本方法論之可行性及績效。而由驗證結果得知,本系統僅需一定數量之訓練資料即可使系統推論績效達一定水準。整體而言,本研究所提出之聲音知識語音化模式與技術可有效將聲音知識以語音化之方式呈現與知識吸收者,以協助知識吸收者快速理解與吸收抽象之聲音知識,進而提升知識之再利用率。
In the shop floor of a factory, operators have to use machines to perform manufacturing processes. Usually the operation status of machines can be identified via sounds. In the equipment user manuals, knowledge is usually represented via texts or illustrations and knowledge receivers might spend much time to recognize the text-based sound expressions. Thus, a vocalization representation scheme for the text-based sound expressions can assist knowledge receivers to efficiently and effectively recognize this type of knowledge.
This research aims at developing a knowledge vocalization methodology in order to convert the knowledge contents with text-based sound expressions into the vocalized expressions. The proposed methodology consists of three modules namely Sound Expression Identification (SEI), Target Sentence Extraction and Formatting (TSEF) and Knowledge Content Vocalization (KCV). In the SEI module, the components with sound expressions are identified from the sentences. Based on the identified sound components, the target sentences with sound expressions are extracted from the free-form documents and expressed as formatted matrices via the TSEF module. In the KCV module, all text-based, formatted sound expressions are represented via vocalized expressions. As the knowledge contents with sound expressions can be represented via knowledge vocalization, knowledge receivers can efficiently recognize the knowledge contents and knowledge reuse can be facilitated. Moreover, based on the proposed methodology, a Web-based prototype system for vocalized knowledge sharing is also developed and the equipment manuals are employed to evaluate the feasibility and performance of the proposed methodology.
As a whole, this research provides a knowledge representation and vocalization model to facilitate knowledge receivers to efficiently and accurately acquire the knowledge contents with sound expressions.
1.余少卿,2003,「個人化行動數位導覽之互動設計探討-以故宮博物院「乾隆皇帝的文化大業」特展為例」,碩士論文(指導教授:梁朝雲、張弘毅),元智大學資訊傳播學系。
2.吳宇穎,2005,「多媒體組合方式與知覺偏好對學習結果的影響」,碩士論文(指導教授:陳姚真),國立中正大學教育學研究所。
3.邱慶麟,2006,「以人工智慧策略偵測網球節目精采片段」,碩士論文(指導教授:黃有評),大同大學資訊工程學系。
4.翁嘉鴻,2001,「以認知負荷觀點探討聽覺媒體物件之媒體呈現方式對學習成效之影響」,碩士論文(指導教授:范懿文),國立中央大學資訊管理學系。
5.連鍠瑜,2000,「對視聽覺訊號之反應速度量測及其影響因素之探討」,碩士論文(指導教授:許宏德),國立高雄第一科技大學環境與安全衛生工程系。
6.陳訓平,2004,「視覺與聽覺網路廣告型態對消費者態度之研究」,碩士論文(指導教授:王年燦)國立台灣藝術大學多媒體動畫藝術研究所。
7.陳智仁,2006,「整合視覺特徵與語音資訊之視訊註解方法」,碩士論文(指導教授:曾新穆),國立成功大學資訊工程學系。
8.黃聖翔,2007,「視障者行動電話階層式選單之軟硬體操作介面分析研究」,碩士論文(指導教授:吳志富),大同大學工業設計學系。
9.蔡純純,2003,「中文新聞文件空間資訊擷取之研究-以火災、搶劫、車禍事件為例」,碩士論文(指導教授:朱子豪),國立台灣大學地理環境資源學研究所。
10.鍾明蒼,2001,「身體障礙者之聲控人機介面」,碩士論文(指導教授:賴友仁、蘇木春),淡江大學電機工程學系控制組碩士班。
11.Agius, H. W. and Angelides, M. C., 1997, “Integrating logical video and audio segments with content-related information in instructional multimedia systems,” Information and Software Technology, Vol. 39, No. 10, pp. 679-694.
12.Bartsch, M. A. and Wakefield, G. H., 2005, “Audio thumbnailing of popular music using chroma-based representations,” IEEE Transactions on Multimedia, Vol. 7, No. 1, pp. 96-104.
13.Bennett, R. E., Goodman, M., Hessinger, J., Kahn, H., Ligget, J., Marshall, G. and Zack, J., 1999, “Using multimedia in large-scale computer-based testing programs,” Computer in Human Behavior, Vol. 15, No. 3, pp. 283-294.
14.Cazan, A., Varbanescu, R. and Popescu, D., 2007, “Algorithms and techniques for image to sound conversion for helping the visually impaired people - application proposal,” The 6th EURASIP Conference Focused on Speech and Image Processing, Multimedia Communications and Services and the 14th International Workshop on System, Signals and Image Processing, pp. 471-474.
15.Ching, L. W. and Leung, M. K. H., 2004, “SINVI: Smart indoor navigation for the visually impaired,” The 8th Control, Automation, Robotics and Vision Conference, Vol. 2, No. 6-9, pp. 1072-1077.
16.Chou, L. D., Wu, C. H., Ho, S. P., Lee, C. C. and Chen, J. M., 2004, “Requirement analysis and implementation of palm-based multimedia museum guide systems,” The 18th International Conference on Advanced Information Networking and Applications, Vol. 1, pp. 352-357.
17.Cole, R. A., Novick, D. G., Vermeulen, P. J. E., Sutton, S., Fanty, M., Wessels, L. F. A., de Villiers, J. H., Schalkwyk, J., Hansen, B. and Burnett, D., 1997, “Experiments with a spoken dialogue system for taking the US census,” Speech Communication, Vol. 23, No. 3, pp. 243-260.
18.Crestani, F., 2002, “Spoken query processing for interactive information retrieval,” Data and Knowledge Engineering, Vol. 41, No. 1, pp. 105-124.
19.Delic, V. D., Vuinovic, N. M. and Secuiski, M. S., 2005, “Speech-enabled computers as a tool for Serbian-speaking blind persons,” The International Conference on Computer as a Tool, Vol. 2, pp. 1662-1665.
20.Denny, M. and Higgins, A., 2003, “The use of computer assisted technology to enhance student psychiatric nurses learning during a practice placement,” Nurse Education in Practice, Vol. 3, No. 2, pp. 80-88.
21.Doerr, G. and Dugelay, J. L., 2003, “A guide tour of video watermarking,” Signal Processing: Image Communication, Vol. 18, No. 4, pp. 263-282.
22.Foo, S. and Li, H., 2004, “Chinese word segmentation and its effect on information retrieval,” Information Processing and Management, Vol. 40, No. 1, pp. 161-190.
23.Fu, G., Kit, C. and Webster, J.J., 2008, “Chinese word segmentation as morpheme-based lexical chunking,” Information Sciences, Vol. 178, No. 9, pp. 2282-2296.
24.Goose, S., Newman, M., Schmidt, C. and Hue, L., 2000, “Enhancing Web accessibility via the Vox Portal and a Web-hosted dynamic HTML VoxML converter,” Computer Networks, Vol. 33, No. 1-6, pp. 583-592.
25.Halimah, B. Z., Azlina, A., Behrang, P. and Choo, W. O., 2008, “Voice recognition system for the visually impaired: Virtual cognitive approach,” International Symposium on Information Technology, Vol. 2, No. 26-28, pp. 1-6.
26.Hillis, C., 2005, “Talking images: Museums, galleries, and heritage sites,” International Congress Series, Vol. 1282, pp. 855-859.
27.Iglesias, R., Casado, S. Gutierrez, T. Barbero, J. I., Avizzano, C. A., Marcheschi, S. and Bergamasco, M., 2004, “Computer graphics access for blind people through a haptic and audio virtual environment,” Proceedings of the 3rd IEEE International Workshop on Haptic, Audio and Visual Environments and Their Applications, pp. 13-18.
28.Inoue, M., Suyama, A., Takeuchi, Y. and Meshitsuka, S., 1999, “Application of a computer based education system for aged persons and issues arising during the field test,” Computer Methods and Programs in Biomedicine, Vol. 59, No. 1, pp. 55-60.
29.Jasinschi, R. S. and Louie, J., 2001, “Automatic TV program genre classification based on audio patterns,” Proceedings of the 27th Euromicro Conference, pp. 370-375.
30.Jobbins, A. C. and Evett, L. J., 1999, “Segmenting documents using multiple lexical features,” Proceedings of the 5th International Conference on Document Analysis and Recognition, pp. 721-724.
31.Karacs, K., Lazar, A., Wagner, R., Balya, D. and Roska, T., 2006, “Bionic eyeglass: An audio guide for visually impaired,” IEEE Biomedical Circuits and Systems Conference, pp. 190-193.
32.Karimian, P., Vaughan, R. and Brown, S., 2006, “Sounds good: Simulation and evaluation of audio communication for multi-robot exploration,” IEEE/RSJ International Conference on Intelligent Robot and Systems, pp. 2711-2716.
33.Kato, N. and Ishii, N., 2007, “Cell phone system for tour & information guide,” The 6th International Conference on Computer and Information Science, Vol. 11-13, pp. 267-273.
34.Kong, J., 2004, “Browsing Web through audio,” IEEE Symposium on Visual Languages and Human Centric Computing, pp. 279-280.
35.Kwok, S. H., Yang, C. C. and Tam, K. Y., 2000, “Watermark design pattern for intellectual property protection in electronic commerce applications,” Proceedings of the 33rd Annual Hawaii International Conference on System Sciences, Vol. 2, pp. 1-10.
36.Lane, P. C. R. and Henderson, J. B., 2001, “Incremental syntactic parsing of natural language corpora with simple synchrony networks,” IEEE Transactions on Knowledge and Data Engineering, Vol. 13, No. 2, pp. 219-231.
37.Lee, M. Y., Kuo, C. H. and Hung, S. S., 2005, “Hands-on practice and implementations on a sound-guided 3D navigation system for orthopedic surgical applications,” IEEE International Conference on Mechatronics, pp. 641-646.
38.Lee, Y. B. and Wong, P. C., 1998, “Design and performance evaluation of a multimedia web sever,” Journal of Visual Communication and Image Representation, Vol. 9, No. 3, pp. 183-193.
39.Lin, D. Y. M., 2004, “Evaluating older adults’ retention in hypertext perusal: Impacts of presentation media as a function of text topology,” Computer in Human Behavior, Vol. 20, No. 4, pp. 491-503.
40.Liu, J. and Sun, X., 2006, “A survey of vision aids for the blind,” The 6th World Congress on Intelligent Control and Automation, Vol. 1, pp. 4312-4316.
41.Lixin, Z., 2001, “Research of segmentation of Chinese search engine,” IEEE International Conference on System, Man and Cybernetics, Vol. 4, No. 7-10, pp. 2627-2631.
42.Lo, W. S., Wong, P. F. and Siu, M. H., 2002, “Maximum likelihood algorithm on Chinese word segmentation,” The 6th International Conference on Signal Processing, Vol. 1, pp. 468-471.
43.Lu, Q. and Stensin, L., 1998, “Audio ticker,” Computer Networks and ISDN Systems, Vol. 30, No. 1-7, pp. 721-722.
44.Lv, Q., Wang, H., Qian, P. and Luo, X., 2006, “AntSeg: An ant approach to disambiguation of Chinese word segmentation,” IEEE International Conference on Information Reuse and Integration, pp. 420-425.
45.Magerman, D. M., 1995, “Statistical decision-tree models for parsing,” Proceedings of the 33rd Annual Meeting on Association for Computational Linguistics, pp. 276-283.
46.Maria, V., Enrico, M., John, D., Simon, B. S. and Mattia, L., 2001, “Knowledge extraction by using an ontology-based annotation tool,” Proceedings, Workshop on Knowledge Markup & Semantic Annotation, pp. 5-12.
47.Mayer, R. E., 1999, “Multimedia aids to problem-solving transfer,” International Journal of Educational Research, Vol. 31, No. 7, pp. 611-623.
48.Mayer, R. E., 2003, “The promise of multimedia learning: Using the same instructional design method across different media,” Learning and Instruction, Vol. 13, No. 2, pp. 125-139.
49.Nickerson, M., 2005, “History calls: Delivering automated audio tours to visitor’s cell phones,” International Conference on Information Technology Coding and Computing, Vol. 2, No. 4-6, pp. 30-34.
50.Nie, J. Y. and Ren, F., 1999, “Chinese information retrieval: Using characters or words?” Information Processing and Management, Vol. 35, pp. 443-462.
51.Raisamo, R., Patomaki, S., Hasu, M. and Pasto, V., 2007, “Design and evaluation of a tactile memory game for visually impaired children,” Interacting with Computers, Vol. 19, No. 24, pp. 196-205.
52.Reddy, P. D. V. G., Kitamura, R. and Jovanis, P. P., 1995, “Voice operated information system (VOIS) for driver’s route guidance,” Mathematical and Computer Modeling, Vol. 22, No. 4-7, pp. 269-278.
53.Roden, T. E., Parberry, I. and Ducrest, D., 2007, “Toward mobile entertainment: A paradigm for narrative-based audio only games,” Science of Computer Programming, Vol. 67, No. 1, pp. 76-90.
54.Rogerson, J. and Dodd, B., 2004, “Is there an effect of dysphonic teachers’ voices on children’s processing of spoken language?” Journal of Voice, Vol. 19, No. 1, pp. 47-60.
55.Sreenu, G., Girija, P. N., Prasad, M. N. and Nagamani, M., 2004, “A human machine speaker dependent speech interactive system,” Proceedings of the First India Annual Conference, Vol. 23, No. 3, pp. 349-351.
56.Sun, Y. H., He, P. L., Nie, S. and Wu, G. Y., 2003, “A Chinese segmentation system based on document self-matching for identifying the unknown words,” International Conference on Machine Learning and Cybernetics, Vol. 4, No. 4-5, pp. 2080-2084.
57.Takao, H., Sakai, K., Osufi, J. and Ishii, H., 2002, “Acoustic user interface (AUI) for the auditory displays,” Displays, Vol. 23, No. 1-2, pp. 65-73.
58.Turk, Z., 2001, “Multimedia: Providing students with real world experiences,” Automation in Construction, Vol. 10, No. 2, pp. 247-255.
59.Wik, L., Thowsen, J. and Steen, P. A., 2001, “An automated voice advisory manikin system for training in basic life support without an instructor--A novel approach to CPR training,” Resuscitation, Vol. 50, No. 2, pp. 167-172.
60.Wilson, J., Walker, B. N., Lindsay, J., Cambias, C. and Dellaert, F., 2007, “SWAN: System for wearable audio navigation,” The 11th IEEE International Symposium on Wearable Computers, pp. 91-98.
61.Xu, R. and Yeung, D., 1998, “Experiments on the use of corpus-based word BI-gram in Chinese word segmentation,” IEEE International Conference on Systems, Man, and Cybernetics, Vol. 5, pp. 4222-4227.
62.Yeh, M. L., Chen, H. H. and Liu, P. H., 2005, “Effect of multimedia with printed nursing guide in education on self-efficacy and functional activity and hospitalization in patients with hip replacement,” Patient Education and Counseling, Vol. 57, No. 2, pp. 217-224.
63.Zhang, M. Y, Lu, Z. D and Zou, C. Y., 2003, “A Chinese word segmentation based on language situation in processing ambiguous words,” Information Sciences, Vol. 162, No. 3-4, pp. 275-285.