See this document in CiteSeerX!

Author Identification on the Large Scale  (Make Corrections)  
David Madigan, Alexander Genkin, David D. Lewis, Shlomo Argamon, Dmitriy Fradkin, Li Ye



  Home/Search   Context   Related

 
View or download:
rutgers.edu/~dfrad...uthoridcsna05.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  rutgers.edu/~dfradkin/pap...index (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: this paper is on techniques for identifying authors in large collections of textual artifacts (e-mails, communiques, transcribed speech, etc.). Our approach focuses on very high-dimensional, topic-free document representations and particular attribution problems, such as: (1) Which one of these K authors wrote this particular document? (2) Did any of these K authors write this particular document? Scientific investigation into measuring style and authorship of texts goes back to the late... (Update)

Active bibliography (related documents):   More   All
2.5:   Bayesian Multinomial Logistic Regression for Author.. - Madigan, Genkin..   (Correct)
1.1:   Style Mining of Electronic Messages for Multiple Authorship .. - Argamon, Saric, al. (2003)   (Correct)
1.0:   Multi-Topic E-mail Authorship Attribution Forensics - de Vel, Anderson, Corney.. (2001)   (Correct)

System load high. Please wait...
Timeout. Please try your query later.
Similar documents based on text:
0.0:   Unknown -   (Correct)

BibTeX entry:   (Update)

@misc{ madigan-author,
  author = "David Madigan and Alexander Genkin and David D. Lewis and Shlomo Argamon
    and Dmitriy Fradkin and Li Ye",
  title = "Author Identification on the Large Scale",
  url = "citeseer.ist.psu.edu/742899.html" }
Citations (may not include all citations):
2133   Pattern Classification and Scene Analysis (context) - Duda, Hart - 1973
667   UCI repository of machine learning databases (context) - Hettich, Merz - 1998
317   Learning quickly when irrelevant attributes abound: A new li.. (context) - Littlestone - 1988
205   An Introduction To Support Vector Machines (context) - Cristianini, Shawe-Taylor - 2000
139   Machine learning in automated text categorization - Sebastiani - 2002
83   Regression shrinkage and selection via the lasso - Tibshirani - 1996
50   Sparse bayesian learning and the relevance vector machine - Tipping - 2001
45   Variations Across Speech and Writing (context) - Biber - 1988
25   Shallow parsing with conditional random fields - Sha, Pereira - 2003
24   Text categorization based on regularized linear classifiers - Zhang, Oles - 2001
20   A comparison of algorithms for maximum entropy parameter est.. - Malouf - 2002
20   Least angle regression - Efron, Hastie et al. - 2004
16   Neural network applications in stylometry: The federalist pa.. (context) - Tweedie, Singh et al. - 1996
13   Authorship attribution with support vector machines - Diederich, Kindermann et al. - 2000
11   Stylistic Experiments for Information Retrieval - Karlgren - 2000
8   Automatic text categorization in terms of genre and author (context) - Stamatatos, Kokkinakis et al. - 2000
8   Large-scale bayesian logistic regression for text categoriza.. (context) - Genkin, Lewis et al. - 2004
7   The characteristic curves of composition (context) - Mendenhall
6   Lexical co-occurrence: The missing link (context) - Smadja - 1989
6   A loss function analysis for classification methods in text .. - Li, Yang - 2003
6   Visualization of literary style (context) - Kjell, Frieder - 1992
6   Psychological aspect natural language useour word (context) - Mehl, Psychological et al. - 2003
5   Automatically categorizing written texts by author gender - Koppel, Argamon et al. - 2003
5   Routing documents according to style - Argamon, Koppel et al. - 1998
5   Identifying the authors of suspect e-mail (context) - Corney, Anderson et al. - 2001
4   IEEE Transactions on Pattern Analysis and Machine Intelligen.. (context) - Figueiredo, for - 2003
3   Disputed authorship resolution using relative entropy for ma.. (context) - Khmelev - 2000
3   and writing style in formal written texts (context) - Argamon, Koppel et al. - 2003
3   Robustness of regularized linear classification methods in t.. - Zhang, Yang - 2003
3   Learning to classify documents according to genre - Finn, Kushmerick - 2003
3   Shakespeare vs Fletcher: A stylometric analysis by radial ba.. (context) - Lowe, Matthews - 1995
3   Computation into Criticism: A Study of Jane Austen's Novels .. (context) - Burrows - 1987
3   Probabilistic authortopic models for information discovery - Steyvers, Smyth et al. - 2004
3   Series in behavioral science: Quantitative methods edition (context) - Mosteller, Wallace et al. - 1964
3   mail text authorship for forensic purposes (context) - Corney - 2003
2   Feature finding for text classification (context) - Forsyth, Holmes - 1996
2   Outside the cave of shadows: Using syntactic annotation to e.. (context) - Baayen, van Halteren et al. - 1996
2   Multinomial logistic regression algorithm (context) - Bohning - 1972
2   author identification using only citations (context) - Hill, Provost et al. - 2003
2   An equivlance between sparse approximation and support vecto.. (context) - Girosi - 1998
2   Sparse multinomial logistic regression: Fast algorithms and .. (context) - Krishnapuram, Hartemink et al. - 2005
2   Style mining of electronic messages for multiple author disc.. - Argamon, Saric et al. - 2003
1   Models and methods of automatic reading (context) - Pereversev-Orlov - 1976
1   Curves of pauline and pseudo-pauline style ii (context) - Mascol
1   cient algorithm for gene selection using sparse logistic reg.. (context) - Shevade, Keerthi et al. - 2003
1   Style-based text categorization: What newspaper am i reading (context) - Argamon-Engelson, Koppel et al. - 1998
1   Language and gender author cohort analysis of e-mail for com.. - de Vel, Corney et al. - 2002
1   Authorship attribution: the case of oliver goldsmith (context) - Mannion, Dixon - 1997
1   Exploiting stylistic idiosyncrasies for authorship attributi.. - Koppel, Schler - 2003
1   Curves of pauline and pseudo-pauline style (context) - Mascol

Documents on the same site (http://mms-03.rutgers.edu/~dfradkin/papers/index.html):   More
Image Compression in Real-Time Multiprocessor Systems.. - Fradkin, Muchnik.. (2003)   (Correct)
Clusters With Core-Tail Hierarchical Structure And Their.. - Dmitriy Fradkin Ilya   (Correct)
Prospective Data Fusion for Batch Filtering - Anghelescu, Boros, Fradkin.. (2003)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC