Authors
Susan T Dumais
Publication date
1991/6
Journal
Behavior research methods, instruments, & computers
Volume
23
Issue
2
Pages
229-236
Publisher
Springer-Verlag
Description
A major barrier to successful retrieval from external sources (e.g., electronic databases) is the tremendous variability in the words that people use to describe objects of interest. The fact that different authors use different words to describe essentially the same idea means that relevant objects will be missed; conversely, the fact that the same word can be used to refer to many different things means that irrelevant objects will be retrieved. We describe a statistical method called latent semantic indexing, which models the implicit higher order structure in the association of words and objects and improves retrieval performance by up to 30%. Additional large performance improvements of 40% and 67% can be achieved through the use of differential term weighting and iterative retrieval methods.
Total citations
199219931994199519961997199819992000200120022003200420052006200720082009201020112012201320142015201620172018201920202021202220232024531115918202323364045285453604043533949385033393015218159155
Scholar articles
ST Dumais - Behavior research methods, instruments, & computers, 1991