Authors
Lizhong Wu, Sharon L. Oviatt, Philip R. Cohen
Publication date
1999/12
Journal
IEEE Transactions on Multimedia
Volume
1
Issue
4
Pages
334-341
Publisher
IEEE
Description
We present a statistical approach to developing multimodal recognition systems and, in particular, to integrating the posterior probabilities of parallel input signals involved in the multimodal system. We first identify the primary factors that influence multimodal recognition performance by evaluating the multimodal recognition probabilities. We then develop two techniques, an estimate approach and a learning approach, which are designed to optimize accurate recognition during the multimodal integration process. We evaluate these methods using Quickset, a speech/gesture multimodal system, and report evaluation results based on an empirical corpus collected with Quickset. From an architectural perspective, the integration technique presented offers enhanced robustness. It also is premised on more realistic assumptions than previous multimodal systems using semantic fusion. From a methodological standpoint …
Total citations
1999200020012002200320042005200620072008200920102011201220132014201520162017201820192020202120222023202418201412151818161616191797968117874452
Scholar articles
L Wu, SL Oviatt, PR Cohen - IEEE Transactions on Multimedia, 1999