Results 1  10
of
190,532
2 Term Document Matrix
"... A term/document matrix is a matrix composed of columns of documents and rows of the terms that occur in each of the documents. Figure 1 is the term/document matrix generated as a result of the following threedocument set (These short examples are for illustrative purposes only). Document 1: After t ..."
Abstract
 Add to MetaCart
A term/document matrix is a matrix composed of columns of documents and rows of the terms that occur in each of the documents. Figure 1 is the term/document matrix generated as a result of the following threedocument set (These short examples are for illustrative purposes only). Document 1: After
Indexing by latent semantic analysis
 JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE
, 1990
"... A new method for automatic indexing and retrieval is described. The approach is to take advantage of implicit higherorder structure in the association of terms with documents (“semantic structure”) in order to improve the detection of relevant documents on the basis of terms found in queries. The p ..."
Abstract

Cited by 3779 (35 self)
 Add to MetaCart
. The particular technique used is singularvalue decomposition, in which a large term by document matrix is decomposed into a set of ca. 100 orthogonal factors from which the original matrix can be approximated by linear combination. Documents are represented by ca. 100 item vectors of factor weights. Queries
Concept Decompositions for Large Sparse Text Data using Clustering
 Machine Learning
, 2000
"... . Unlabeled document collections are becoming increasingly common and available; mining such data sets represents a major contemporary challenge. Using words as features, text documents are often represented as highdimensional and sparse vectorsa few thousand dimensions and a sparsity of 95 to 99 ..."
Abstract

Cited by 407 (27 self)
 Add to MetaCart
. Unlabeled document collections are becoming increasingly common and available; mining such data sets represents a major contemporary challenge. Using words as features, text documents are often represented as highdimensional and sparse vectorsa few thousand dimensions and a sparsity of 95
The Determinants of Credit Spread Changes.
 Journal of Finance
, 2001
"... ABSTRACT Using dealer's quotes and transactions prices on straight industrial bonds, we investigate the determinants of credit spread changes. Variables that should in theory determine credit spread changes have rather limited explanatory power. Further, the residuals from this regression are ..."
Abstract

Cited by 422 (2 self)
 Add to MetaCart
in the corporate bond market is driven by local supply/demand shocks that are independent of both changes in creditrisk and typical measures of liquidity. We note that a similar, but significantly smaller effect has been documented in the mortgage backed (Ginnie Mae) securities market by Boudoukh, Richardson
Latent semantic indexing: A probabilistic analysis
, 1998
"... Latent semantic indexing (LSI) is an information retrieval technique based on the spectral analysis of the termdocument matrix, whose empirical success had heretofore been without rigorous prediction and explanation. We prove that, under certain conditions, LSI does succeed in capturing the underl ..."
Abstract

Cited by 323 (7 self)
 Add to MetaCart
Latent semantic indexing (LSI) is an information retrieval technique based on the spectral analysis of the termdocument matrix, whose empirical success had heretofore been without rigorous prediction and explanation. We prove that, under certain conditions, LSI does succeed in capturing
Based on matrix methods Term–Document Matrix Start With Dictionary of Terms
"... Goal: Identify documents that best match users query ..."
From frequency to meaning : Vector space models of semantics
 Journal of Artificial Intelligence Research
, 2010
"... Computers understand very little of the meaning of human language. This profoundly limits our ability to give instructions to computers, the ability of computers to explain their actions to us, and the ability of computers to analyse and process text. Vector space models (VSMs) of semantics are begi ..."
Abstract

Cited by 347 (3 self)
 Add to MetaCart
are beginning to address these limits. This paper surveys the use of VSMs for semantic processing of text. We organize the literature on VSMs according to the structure of the matrix in a VSM. There are currently three broad classes of VSMs, based on term–document, word–context, and pair–pattern matrices
UNIVERSITY OF PATRAS Design of a MATLAB toolbox for termdocument matrix generation
, 2005
"... Data clustering and many other fundamental operations in data mining and information retrieval are built using computational kernels from numerical linear algebra and operate on very large, sparse termdocument matrices. To facilitate these tasks, we have built TMG, a toolbox for the generation and ..."
Abstract

Cited by 1 (0 self)
 Add to MetaCart
Data clustering and many other fundamental operations in data mining and information retrieval are built using computational kernels from numerical linear algebra and operate on very large, sparse termdocument matrices. To facilitate these tasks, we have built TMG, a toolbox for the generation
Augmentation of a Term/Document Matrix with Partof Speech Tags to Improve Accuracy of Latent Semantic Analysis
"... Abstract: We consider the improvement in accuracy of latent semantic analysis when a part of speech tagger is used to augment a term/document matrix. We first construct an augmented term/document matrix as input into singular value decomposition (SVD). The singular values then serve as principal co ..."
Abstract

Cited by 1 (0 self)
 Add to MetaCart
Abstract: We consider the improvement in accuracy of latent semantic analysis when a part of speech tagger is used to augment a term/document matrix. We first construct an augmented term/document matrix as input into singular value decomposition (SVD). The singular values then serve as principal
Results 1  10
of
190,532