• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 1 - 10 of 9,465
Next 10 →

Document Classification

by Benjamin Walsham In , 2003
"... viii 1 ..."
Abstract - Add to MetaCart
Abstract not found

document classification∗

by unknown authors , 2010
"... ar ..."
Abstract - Add to MetaCart
Abstract not found

One-Class SVMs for Document Classification

by Larry M. Manevitz, Malik Yousef, Nello Cristianini, John Shawe-taylor, Bob Williamson - Journal of Machine Learning Research , 2001
"... We implemented versions of the SVM appropriate for one-class classification in the context of information retrieval. The experiments were conducted on the standard Reuters data set. For the SVM implementation we used both a version of Schölkopf et al. and a somewhat different version of one-class SV ..."
Abstract - Cited by 185 (3 self) - Add to MetaCart
We implemented versions of the SVM appropriate for one-class classification in the context of information retrieval. The experiments were conducted on the standard Reuters data set. For the SVM implementation we used both a version of Schölkopf et al. and a somewhat different version of one

Text Classification from Labeled and Unlabeled Documents using EM

by Kamal Nigam, Andrew Kachites Mccallum, Sebastian Thrun, Tom Mitchell - MACHINE LEARNING , 1999
"... This paper shows that the accuracy of learned text classifiers can be improved by augmenting a small number of labeled training documents with a large pool of unlabeled documents. This is important because in many text classification problems obtaining training labels is expensive, while large qua ..."
Abstract - Cited by 1033 (15 self) - Add to MetaCart
This paper shows that the accuracy of learned text classifiers can be improved by augmenting a small number of labeled training documents with a large pool of unlabeled documents. This is important because in many text classification problems obtaining training labels is expensive, while large

Document Classification with LSA and

by Murat Ahat, Sofiane Ben Amor, Marc Bui, Ra Jhean-larose, Cnrs Ums, Murat Ahat, Sofiane Ben Amor, Marc Bui, Ra Jhean-larose, Guy Denhière
"... ABSTRACT. Latent semantic analysis is a computation method to demonstrate a major com-ponent of language learning and use. Thus, in this sense, it is a theory of meaning, such that it applies to and offers an explanation of phenomena of meaning in words and passages of words. This enables LSA to hol ..."
Abstract - Add to MetaCart
to hold a strong position in the automated document classification, document analysis, etc. Though the experiments show that LSA can reach a very high accuracy in document classification, it also depends on the various factors such as quality and amount of training documents, characteristics

Knowledge Discovery for Document Classification

by Chidanand Apt, Fred Damerau, Sholom Weiss - AAAI-93 Workshop on Knowledge Discovery in Databases, Washington D.C., AAAI , 1993
"... We report on extensive experiments using rule-based induction methods for document classification. The goal is to automatically discover patterns in document classifications, potentially surpassing humans who currently read and classify these documents. By using a decision rule model, we induce resu ..."
Abstract - Cited by 1 (0 self) - Add to MetaCart
We report on extensive experiments using rule-based induction methods for document classification. The goal is to automatically discover patterns in document classifications, potentially surpassing humans who currently read and classify these documents. By using a decision rule model, we induce

Features for Unsupervised Document Classification

by S H Srinivasan, Sh_srinivasansatyam Com - Proceedings of CoNLL-2002
"... Unsupervised document classification is an important problem in practical text mining since training data is seldom available. In this paper we study the problem of term selection and the performance of various features for unsuper- vised text classification. The features studied are: principal comp ..."
Abstract - Cited by 4 (0 self) - Add to MetaCart
Unsupervised document classification is an important problem in practical text mining since training data is seldom available. In this paper we study the problem of term selection and the performance of various features for unsuper- vised text classification. The features studied are: principal

Intelligent Document Classification

by Rafael A. Calvo, H. A. Ceccatto - Intelligent Data Analysis , 2000
"... In this work we investigate some technical questions related to the application of neural networks in document classification. First, we discuss the effects of different averaging protocols for the 2 statistic used to remove non-informative terms. This is an especially relevant issue for the n ..."
Abstract - Cited by 17 (6 self) - Add to MetaCart
In this work we investigate some technical questions related to the application of neural networks in document classification. First, we discuss the effects of different averaging protocols for the 2 statistic used to remove non-informative terms. This is an especially relevant issue

Abstract The tasks of document classification and

by unknown authors
"... sentiment classification have been explored in the literature, but to our knowledge the task of political classification has not. We use a modified form of a document classification algorithm (Hu and Liu, 2004) to classify newspapers as liberal, conservative, or neutral based on their text. By using ..."
Abstract - Add to MetaCart
sentiment classification have been explored in the literature, but to our knowledge the task of political classification has not. We use a modified form of a document classification algorithm (Hu and Liu, 2004) to classify newspapers as liberal, conservative, or neutral based on their text

Document Classification using Implication

by László Kovács, Tibor Répási
"... Abstract: The document categorization is a very importatnt part of the current information retrieval systems for the WEB. The main problem of the text categorization is to find appropriate methods to manage the large amount of different words In this paper we describe a proposal for a theasurus-base ..."
Abstract - Add to MetaCart
-based document classification. The graph edges are generated from the implication rules calculated witha frequent itemset uncovering algorithm. 1
Next 10 →
Results 1 - 10 of 9,465
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University