Abstract
This paper discusses a few algorithms for updating the approximate singular value decomposition (SVD) in the context of information retrieval by latent semantic indexing (LSI) methods. A unifying framework is considered which is based on Rayleigh-Ritz projection methods. First, a Rayleigh-Ritz approach for the SVD is discussed and it is then used to interpret the Zha and Simon algorithms [SIAM J. Sci. Comput., 21 (1999),pp. 782-791]. This viewpoint leads to a few alternatives whose goal is to reduce computational cost and storage requirement by projection techniques that utilize subspaces of much smaller dimension. Numerical experiments show that the proposed algorithms yield accuracies comparable to those obtained from standard ones at a much lower computational cost.
Original language | English (US) |
---|---|
Pages (from-to) | 1105-1131 |
Number of pages | 27 |
Journal | SIAM Journal on Matrix Analysis and Applications |
Volume | 35 |
Issue number | 3 |
DOIs | |
State | Published - 2014 |
Bibliographical note
Publisher Copyright:© 2014 Society for Industrial and Applied Mathematics.
Keywords
- Latent semantic indexing
- Low-rank approximation
- Min-max characterization
- Rayleigh-Ritz procedure
- Ritz singular values
- Ritz singular vectors
- Singular value decomposition
- Text mining
- Updating algorithm