Authors
Michael W Berry, Susan T Dumais, Gavin W O’Brien
Publication date
1995/12
Journal
SIAM review
Volume
37
Issue
4
Pages
573-595
Publisher
Society for Industrial and Applied Mathematics
Description
Currently, most approaches to retrieving textual materials from scientific databases depend on a lexical match between words in users’ requests and those in or assigned to documents in a database. Because of the tremendous diversity in the words people use to describe the same document, lexical methods are necessarily incomplete and imprecise. Using the singular value decomposition (SVD), one can take advantage of the implicit higher-order structure in the association of terms with documents by determining the SVD of large sparse term by document matrices. Terms and documents represented by 200–300 of the largest singular vectors are then matched against user queries. We call this retrieval method latent semantic indexing (LSI) because the subspace represents important associative relationships between terms and documents that are not evident in individual documents. LSI is a completely …
Total citations
199719981999200020012002200320042005200620072008200920102011201220132014201520162017201820192020202120222023202432335066828812315014515315514414412011810310393871027461502723354210
Scholar articles