View article

[PDF] from acm.org

Mutual disambiguation of recognition errors in a multimodal architecture

Authors

S. L. Oviatt

Publication date

1999

Conference

Proceedings of the Conference on Human Factors in Computing Systems

Pages

576-583

Publisher

ACM Press

Description

As a new generation of multimodal/media systems begins to define itself, researchers are attempting to learn how to combine different modes into strategically integrated whole systems. In theory, well designed multimodal systems should be able to integrate complementary modalities in a manner that supports mutual disambiguation (MD) of errors and leads to more robust performance. In this study, over 2,000 multimodal utterances by both native and accented speakers of English were processed by a multimodal system, and then logged and analyzed. The results confirmed that multimodal systems can indeed support significant levels of MD, and also higher levels of MD for the more challenging accented users. As a result, although speech recognition as a stand-alone performed far more poorly for accented speakers, their multimodal recognition rates did not differ from those of native speakers. Implications are …

Total citations

Cited by 424

199920002001200220032004200520062007200820092010201120122013201420152016201720182019202020212022202320246 17 21 33 21 33 33 37 29 26 24 15 15 10 16 6 10 5 8 8 20 10 7 2 6 3

Scholar articles

Mutual disambiguation of recognition errors in a multimodel architecture

S Oviatt - Proceedings of the SIGCHI conference on Human …, 1999