研究生: |
葉禮宗 Li-Tzong Yeh |
---|---|
論文名稱: |
對話介面代理人-以推薦旅遊行程為例 Dialogue Interface Agent-with a Recommending Trip Plan Travel Agent |
指導教授: |
蘇豐文
Von-Wun Soo |
口試委員: | |
學位類別: |
碩士 Master |
系所名稱: |
電機資訊學院 - 資訊工程學系 Computer Science |
論文出版年: | 2002 |
畢業學年度: | 90 |
語文別: | 中文 |
論文頁數: | 53 |
中文關鍵詞: | 語音對話 、介面 、代理人 、本體論知識 、語音辨識 、旅遊代理人 |
外文關鍵詞: | Speech Dialogue, Interface, Agent, Ontology, Speech Recognition, Travel Agent |
相關次數: | 點閱:3 下載:0 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
我們認為最直覺的介面使用方法就是語音對話,於是我們設計了一個對話介面代理人,作為服務系統的介面,將服務功能與介面功能分離獨立,導引使用者設定、使用。在本論文中,對話介面代理人與推薦旅遊行程代理人共同合作互動提供服務。我們利用了以隱藏式馬可夫模型 (Hidden Markov Models) 技術實作的中文語音辨識引擎、IBM ViaVoice 中文文字轉語音 (Text-to_Speech) 引擎、微軟代理人 (Microsoft Agent)來作支援,提供電腦系統中文語音說與聽的能力來對話,以及有直覺性視覺效果的動畫。由於語音的辨識有其極限,總是會有辨識錯誤的時候,加上背景待辨識字詞的數量影響辨識時間甚大,我們提出以控制背景待辨識字詞集的方式,配合對話策略,辨識錯誤的時候,系統能夠持續與使用者對話,找出使用者想要輸入的字詞。本論文最感興趣的是對於一般文字字詞辨識錯誤的處理。我們解決了語音辨識引擎有背景字詞數目上限的問題,對於觀察特徵少的字詞集、使用者記錯字的字詞、同音異字字詞的分別,我們都能對話找出答案字詞。語音輸入的字詞是否真的存在我們的背景字詞集中,我們也可以知道。此外有對話歷史功能,允許使用者從之前未完成的序列繼續對話。對系統領域不熟悉的使用者,可以請求系統支援導引查詢資訊。
Speech dialogue is the most intuitive presentation of computer system user interface. We propose a dialogue interface agent for the user interface agent of service agent systems. It separates the interface from service systems. The dialogue interface agent guides users to use service systems. In this thesis, the dialogue interface agent interacts with travel agent system to serve users. It employs the Chinese speech recognition engine supported by Hidden Markov Models technology and IBM ViaVoice Chinese Text-to-Speech engine and Microsoft Agent altogether providing the speaking and hearing abilities and furthermore the visual animations. Result from the limitation of speech recognition engine, sometimes, it may get wrong hearing; besides, the recognition response time is enormously correlated with the amount of words to be recognized. By controlling the words set to be recognized applying dialogue strategies, as wrong speech recognition happens it can dialogue with users continuously and smoothly to percept user’s demanded word. We are in particularly interested in the process of wrong speech recognition of general words. We solve the problem that there is limitation in the amount of words to be recognized. For the condition that word sets have few observation characters and the condition that users mistake speaking and the case to distinguish between homophones, we can still find out the correct users’ demanded words. We can also realize whether a word is in the valid word set. Moreover, by dialogue history records, users can restart their not-yet set-up dialogue items. For those who are not familiar with the domain knowledge of service systems, they could invite the dialogue interface agent to guide them to query related information.
[1] Von-Wun Soo and Shu-How Liang, Recommending a Trip Plan by Negotiation With a Travel Agent, Artificial Intelligence Lab CS Dept. National Tsing Hua University, Taiwan, 2001.
[2] HTK, Hidden Markov Model Toolkit V3.1, Speech Vision and Robotics Group of the Cambridge University Engineering Department, 2002. Page(s):96-117
[3] Rabiner L.R. and Juang B.H., An Introduction to Hidden Markov Models, IEEE ASSP Magazine, 1986. pp.4-16.
[4] Rebecca McKay and Ben Ryan, Microsoft Agent Software Development Kit, Microsoft Press, 1999.
[5] Jyh-Shing Roger Jang and Shiuan-Sung Lin, Optimisation of Viterbi Beam Search in Speech Recognition, Multimedia Information Retrieval Lab CS Dept. National Tsing Hua University, Taiwan, 2002.
[6] Von-Wun Soo and Hai-Long Cheng, Conducting the Disambiguation Dialogues Between Software Agent Sellers and Human Buyers, Artificial Intelligence Lab CS Dept. National Tsing Hua University, Taiwan, 2001.
[7] Andreas Kellner, Bernd Rueber, and Hauke Schramm, Strategies for Name Recognition in Automatic Directory Assistance Systems, Interactive Voice Technology for Telecommunications Applications, IVTTA '98. Proceedings. 1998 IEEE 4th Workshop , Page(s): 21 -26
[8] Zue, V.; Seneff, S.; Glass, J.R.; Polifroni, J.; Pao, C.; Hazen, T.J. and Hetherington, L, JUPlTER: A Telephone-based Conversational Interface for Weather Information, Speech and Audio Processing, IEEE Transactions on, Volume: 8 Issue: 1, Jan. 2000. Page(s): 85 -96
[9] Lamel, L; Rosset, S.; Gauvain, J.L.; Bennacef, S.; Garnier-Rizet, M. and Prouts, B., The LIMSI ARISE System [Rail Travel Information System], Interactive Voice Technology for Telecommunications Applications, IVTTA’98. Proceedings. IEEE 4th Workshop, Page(s): 209 -214
[10] Seneff, S. and Polifroni, J, A New Restaurant Guide Conversational System: Issues in Rapid Prototyping for Specialized Domains, Spoken Language, ICSLP 96, Proceedings, Fourth International Conference on, Volume: 2, 1996. Page(s): 665 -668 vol.2
[11] Timothy Bickmore and Justine Cassell, How About This Weather? Social Dialogue with Embodied Conversational Agents, Proceedings of the AAAI Fall Symposium on Socially Intelligent Agents, 2000. Page(s): 4-9
[12] Timothy Bickmore and Justine Cassell, Small Talk and Conversational Storytelling In Embodied Conversational Interface Agents, Proceedings of the AAAI Fall Symposium on Narrative Intelligence, 1999. Page(s): 87-93
[13] Noriko Suzuki, Kazuo Ishii and Michio Okada, Talking Eye: Autonomous Creature as Accomplice for Human, Computer Human Interaction, Proceedings. 3rd Asia Pacific , 1998. Page(s): 409-414
[14] Leila Amgoud, Nicolas Maudet, and Simon Parson, Modelling Dialogues Using Argumentation, Proceedings of the 4th International Conference on Multi-Agent Systems, Boston, 2000. Page(s):31-38
[15] Werner Kiebling, Stefan Fischer, Stefan Holland and Thorsten Ehm, Design and Implementation of COSIMA - A Smart and Speaking E-Sales Assistant, Advanced Issues of E-Commerce and Web-Based Information Systems, WECWIS 2001, Third International Workshop on. , 2001. Page(s):21-30
[16] Yasmine Arafa and Abe Mamsani, Face-to-Face Interaction with An Electronic Personal Sales Assistant, Systems, Man, and Cybernetics, 2000 IEEE International Conference on , Volume: 2 . Page(s):792-797
[17] Takashi Nishiyama, Shuji Murakami, Ryoji Nakajima, Kazuya Sawada and Osamu Katai, The Application of Interface Agent to A Health Care System, Systems, Man, and Cybernetics, IEEE SMC '99 Conference Proceedings. IEEE International Conference on , Volume: 1 , Page(s):750-755
[18] Stuart Russell and Peter Norvig, Artificial Intelligence – A Modern Approach, Prentice Hall, 1995. Page(s):226-247
[19] Bellifemine F., Poggi, A. and Rimassa, G., JADE – A FIPA-compliant Agent Framework, in Proceedings of PAAM'99, London, April 1999.
[20] FIPA Org, http://www.fipa.org/
[21] FIPA Interaction Protocol Library Specification.
See http://www.fipa.org/specs/fipa00025/XC00025D.pdf
[22] FIPA Agent Communication Language Specification.
See http://www.fipa.org/specs/fipa00003/OC000003.pdf