• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 1 - 10 of 1,357,808
Next 10 →

Recognizing Text Similarity

by Ozlem Uzuner, All Davis, Boris Katz
"... Overview: There are a variety of circumstances under which it would be useful to determine that two documents contain similar text, including detecting plagiarism and copyright infringement, and filtering and organizing documents returned by a search engine. The vast amount of digital information av ..."
Abstract - Cited by 1 (0 self) - Add to MetaCart
Overview: There are a variety of circumstances under which it would be useful to determine that two documents contain similar text, including detecting plagiarism and copyright infringement, and filtering and organizing documents returned by a search engine. The vast amount of digital information

A Reflective View on Text Similarity

by Daniel Bär, Torsten Zesch, Iryna Gurevych
"... www.ukp.tu-darmstadt.de While the concept of similarity is well grounded in psychology, text similarity is less well-defined. Thus, we analyze text similarity with respect to its definition and the datasets used for evaluation. We formalize text similarity based on the geometric model of conceptual ..."
Abstract - Cited by 7 (4 self) - Add to MetaCart
www.ukp.tu-darmstadt.de While the concept of similarity is well grounded in psychology, text similarity is less well-defined. Thus, we analyze text similarity with respect to its definition and the datasets used for evaluation. We formalize text similarity based on the geometric model of conceptual

Linguistically Fuelled Text Similarity

by Björn Andrist, Martin Hassel
"... This paper describes TEXTSIM, a system for determining the similarity between texts. Further, we show the results of a comparison between two various configurations of TEXTSIM; one with and one without any deeper linguistic analysis. To evaluate and compare the two models of TEXTSIM we used two sets ..."
Abstract - Add to MetaCart
This paper describes TEXTSIM, a system for determining the similarity between texts. Further, we show the results of a comparison between two various configurations of TEXTSIM; one with and one without any deeper linguistic analysis. To evaluate and compare the two models of TEXTSIM we used two

A Survey of Text Similarity Approaches

by Wael H. Gomaa, Aly A. Fahmy
"... Measuring the similarity between words, sentences, paragraphs and documents is an important component in various tasks such as information retrieval, document clustering, word-sense disambiguation, automatic essay scoring, short answer grading, machine translation and text summarization. This survey ..."
Abstract - Cited by 6 (1 self) - Add to MetaCart
Measuring the similarity between words, sentences, paragraphs and documents is an important component in various tasks such as information retrieval, document clustering, word-sense disambiguation, automatic essay scoring, short answer grading, machine translation and text summarization

Comparing different text similarity methods

by Junpeng Bao, Caroline Lyon, Peter C. R. Lane, Wei Ji, James A. Malcolm , 2007
"... This paper reports experiments on a corpus of news articles from the Financial Times, comparing different text similarity models. First the Ferret system using a method based solely on lexical similarities is used, then methods based on semantic similarities are inves-tigated. Different feature stri ..."
Abstract - Cited by 1 (0 self) - Add to MetaCart
This paper reports experiments on a corpus of news articles from the Financial Times, comparing different text similarity models. First the Ferret system using a method based solely on lexical similarities is used, then methods based on semantic similarities are inves-tigated. Different feature

TEXT ENTAILMENT VERIFICATION WITH TEXT SIMILARITIES

by Doina T Ătar, Mihaiela Lupea
"... Abstract. This paper presents a new method for recognizing the text entailment obtained from the text-to-text metric introduced in [3] and from the modified resolution introduced in [12]. In [11], using the directional measure of similarity as presented in [3], which measures the semantic similarity ..."
Abstract - Add to MetaCart
Abstract. This paper presents a new method for recognizing the text entailment obtained from the text-to-text metric introduced in [3] and from the modified resolution introduced in [12]. In [11], using the directional measure of similarity as presented in [3], which measures the semantic

DKPro Similarity: An Open Source Framework for Text Similarity

by Daniel Bär, Torsten Zesch, Iryna Gurevych
"... www.ukp.tu-darmstadt.de We present DKPro Similarity, an open source framework for text similarity. Our goal is to provide a comprehensive repository of text similarity measures which are implemented using standardized interfaces. DKPro Similarity comprises a wide variety of measures ranging from one ..."
Abstract - Cited by 8 (1 self) - Add to MetaCart
www.ukp.tu-darmstadt.de We present DKPro Similarity, an open source framework for text similarity. Our goal is to provide a comprehensive repository of text similarity measures which are implemented using standardized interfaces. DKPro Similarity comprises a wide variety of measures ranging from

Efficient similarity search in sequence databases

by Rakesh Agrawal, Christos Faloutsos, Arun Swami , 1994
"... We propose an indexing method for time sequences for processing similarity queries. We use the Discrete Fourier Transform (DFT) to map time sequences to the frequency domain, the crucial observation being that, for most sequences of practical interest, only the first few frequencies are strong. Anot ..."
Abstract - Cited by 505 (21 self) - Add to MetaCart
We propose an indexing method for time sequences for processing similarity queries. We use the Discrete Fourier Transform (DFT) to map time sequences to the frequency domain, the crucial observation being that, for most sequences of practical interest, only the first few frequencies are strong

Machine Learning in Automated Text Categorization

by Fabrizio Sebastiani - ACM COMPUTING SURVEYS , 2002
"... The automated categorization (or classification) of texts into predefined categories has witnessed a booming interest in the last ten years, due to the increased availability of documents in digital form and the ensuing need to organize them. In the research community the dominant approach to this p ..."
Abstract - Cited by 1658 (22 self) - Add to MetaCart
The automated categorization (or classification) of texts into predefined categories has witnessed a booming interest in the last ten years, due to the increased availability of documents in digital form and the ensuing need to organize them. In the research community the dominant approach

An evaluation of statistical approaches to text categorization

by Yiming Yang - Journal of Information Retrieval , 1999
"... Abstract. This paper focuses on a comparative evaluation of a wide-range of text categorization methods, including previously published results on the Reuters corpus and new results of additional experiments. A controlled study using three classifiers, kNN, LLSF and WORD, was conducted to examine th ..."
Abstract - Cited by 664 (23 self) - Add to MetaCart
Abstract. This paper focuses on a comparative evaluation of a wide-range of text categorization methods, including previously published results on the Reuters corpus and new results of additional experiments. A controlled study using three classifiers, kNN, LLSF and WORD, was conducted to examine
Next 10 →
Results 1 - 10 of 1,357,808
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University