MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  A Semidiscrete Matrix Decomposition for Latent Semantic Indexing in Information Retrieval (1998) [57 citations — 4 self]

Download:
Download as a PDF | Download as a PS
by Tamara G. Kolda, Dianne P. O'leary
ACM Transactions on Information Systems
http://csmr.ca.sandia.gov/~tgkolda/papers/umcp-cs-tr-3724.ps
Add To MetaCart

Abstract:

The vast amount of textual information available today is useless unless it can be effectively and efficiently searched. In information retrieval, we wish to match queries with relevant documents. Documents can be represented by the terms that appear within them, but literal matching of terms does not necessarily retrieve all relevant documents. Latent Semantic Indexing represents documents by approximations and tends to cluster documents on similar topics even if their term profiles are somewhat different. This approximate representation is usually accomplished using a low-rank singular value decomposition (SVD) approximation. In this paper, we use an alternate decomposition, the semi-discrete decomposition (SDD). In our tests, for equal query times, the SDD does as well as the SVD and uses less than one-tenth the storage. Additionally, we show how to update the SDD for a dynamically changing document collection. 1

Citations

1460 Indexing by latent semantic analysis – Deerwester, Dumais, et al. - 1990
881 Term-weighting approaches in automatic text retrieval – Salton, Buckley - 1998
754 The Algebraic Eigenvalue Problem – Wilkinson - 1965
563 Managing Gigabytes: Compressing and Indexing Documents and Images – Witten, Moffat, et al. - 1999
377 Using linear algebra for intelligent information retrieval – Berry, Dumais, et al. - 1995
364 Information Retrieval: Data Structures and Algorithms – Frakes, Baeza-Yates - 1992
172 Overview of the third text REtrieval conference (TREC-3), in Overview of the Third Text REtrieval Conference – Harman - 1995
170 Improving the Retrieval of Information from External Sources – Dumais - 1991
88 Stemming algorithms – Frakes - 1992
61 Ranking Algorithms, in – Harman - 1992
46 SVDPACKC (version 1.0) User's Guide – Berry, Do, et al. - 1993
40 Document retrieval and routing using the INQUERY system – Broglio, Callan, et al. - 1995
37 Low-rank orthogonal decompositions for information retrieval applications – Berry, Fierro - 1996
37 Latent semantic indexing (lsi): Trec-3 report – Dumais - 1995
28 Digital image compression by outer product expansion – O�Leary, Peleg - 1983
22 Approximating matrix multiplication for pattern recognition tasks – Cohen, Lewis - 1997
22 Information management tools for updating an SVD-encoded indexing scheme – O’Brien - 1994
20 Limited-memory matrix methods with applications – KOLDA - 1997
16 Frakes and Ricardo Baeza�Yates. Information Retrieval� Data Structures and Algorithms – William - 1992
9 Indexing by latent semantic analysis – dauer�, Harshman - 1990
8 Latent semantic indexing via a semi�discrete matrix decomposition – Kolda, O�Leary - 1997
6 Bidiagonalization of matrices and solution of linear equations – Paige - 1974
3 Improving the retrieval of infomation from external sources – Dumais - 1991
2 Latent sematic indexing (LSI): TREC-3 report – Dumais - 1995
1 A Semidiscrete Matrix Decomposition • 345 – BERRY, DO, et al. - 1993
1 The 3rd Text – HARMAN, ED - 1995
1 Descent property and global convergence of the Fletcher-Reeves method with inexact line search – Frakes - 1992
1 Theresa Do� Gavin O�Brien� Vijay Krishna� and Sowmini Varadhan. SVDPACKC �Version 1.0� Users� Guide – Berry� - 1993