Results 1  10
of
12,166
Assessing agreement on classification tasks: the kappa statistic
 Computational Linguistics
, 1996
"... Currently, computational linguists and cognitive scientists working in the area of discourse and dialogue argue that their subjective judgments are reliable using several different statistics, none of which are easily interpretable or comparable to each other. Meanwhile, researchers in content analy ..."
Abstract

Cited by 846 (9 self)
 Add to MetaCart
analysis have already experienced the same difficulties and come up with a solution in the kappa statistic. We discuss what is wrong with reliability measures as they are currently used for discourse and dialogue work in computational linguistics and cognitive science, and argue that we would be better off
Active Learning with Statistical Models
, 1995
"... For manytypes of learners one can compute the statistically "optimal" way to select data. We review how these techniques have been used with feedforward neural networks [MacKay, 1992# Cohn, 1994]. We then showhow the same principles may be used to select data for two alternative, statist ..."
Abstract

Cited by 679 (10 self)
 Add to MetaCart
, statisticallybased learning architectures: mixtures of Gaussians and locally weighted regression. While the techniques for neural networks are expensive and approximate, the techniques for mixtures of Gaussians and locally weighted regression are both efficient and accurate.
Statistical phrasebased translation
, 2003
"... We propose a new phrasebased translation model and decoding algorithm that enables us to evaluate and compare several, previously proposed phrasebased translation models. Within our framework, we carry out a large number of experiments to understand better and explain why phrasebased models outpe ..."
Abstract

Cited by 944 (11 self)
 Add to MetaCart
outperform wordbased models. Our empirical results, which hold for all examined language pairs, suggest that the highest levels of performance can be obtained through relatively simple means: heuristic learning of phrase translations from wordbased alignments and lexical weighting of phrase translations
Online Supplement for: DIAGNOSIS OF SLEEP APNEA BY AUTOMATIC ANALYSIS OF NASAL PRESSURE AND FORCED OSCILLATION IMPEDANCE APPENDIX A Weighted Kappa Statistic
"... this set of weights specifically disfavors episodes classified as central apneas by one scorer and as obstructive apneas by the other (weight 0.25). The weighted observed proportional agreement between the two scorers is obtained as i g j ij ij w o n w N ) ( . 1 Abbreviating the row and column t ..."
Abstract
 Add to MetaCart
totals of the table of frequencies for the ith category by j ij i n r and i ij j n c , the weighted proportional agreement expected just by chance is estimated by i g j j i ij w e c r w N 2 ) ( . Then, weighted kappa, which may be interpreted as the chancecorrected weighted proportional
Additive Logistic Regression: a Statistical View of Boosting
 Annals of Statistics
, 1998
"... Boosting (Freund & Schapire 1996, Schapire & Singer 1998) is one of the most important recent developments in classification methodology. The performance of many classification algorithms can often be dramatically improved by sequentially applying them to reweighted versions of the input dat ..."
Abstract

Cited by 1750 (25 self)
 Add to MetaCart
data, and taking a weighted majority vote of the sequence of classifiers thereby produced. We show that this seemingly mysterious phenomenon can be understood in terms of well known statistical principles, namely additive modeling and maximum likelihood. For the twoclass problem, boosting can
A statistical interpretation of term specificity and its application in retrieval
 Journal of Documentation
, 1972
"... Abstract: The exhaustivity of document descriptions and the specificity of index terms are usually regarded as independent. It is suggested that specificity should be interpreted statistically, as a function of term use rather than of term meaning. The effects on retrieval of variations in term spec ..."
Abstract

Cited by 589 (3 self)
 Add to MetaCart
Abstract: The exhaustivity of document descriptions and the specificity of index terms are usually regarded as independent. It is suggested that specificity should be interpreted statistically, as a function of term use rather than of term meaning. The effects on retrieval of variations in term
The spread of obesity in a large social network over 32 years
 NEW ENGLAND JOURNAL OF MEDICINE
, 2007
"... The prevalence of obesity has increased substantially over the past 30 years. We performed a quantitative analysis of the nature and extent of the persontoperson spread of obesity as a possible factor contributing to the obesity epidemic. Methods We evaluated a densely interconnected social networ ..."
Abstract

Cited by 509 (23 self)
 Add to MetaCart
network of 12,067 people assessed repeatedly from 1971 to 2003 as part of the Framingham Heart Study. The bodymass index was available for all subjects. We used longitudinal statistical models to examine whether weight gain in one person was associated with weight gain in his or her friends, siblings
Linear models and empirical bayes methods for assessing differential expression in microarray experiments.
 Stat. Appl. Genet. Mol. Biol.
, 2004
"... Abstract The problem of identifying differentially expressed genes in designed microarray experiments is considered. Lonnstedt and Speed (2002) derived an expression for the posterior odds of differential expression in a replicated twocolor experiment using a simple hierarchical parametric model. ..."
Abstract

Cited by 1321 (24 self)
 Add to MetaCart
from spot filtering or spot quality weights. The posterior odds statistic is reformulated in terms of a moderated tstatistic in which posterior residual standard deviations are used in place of ordinary standard deviations. The empirical Bayes approach is equivalent to shrinkage of the estimated
Image denoising using a scale mixture of Gaussians in the wavelet domain
 IEEE TRANS IMAGE PROCESSING
, 2003
"... We describe a method for removing noise from digital images, based on a statistical model of the coefficients of an overcomplete multiscale oriented basis. Neighborhoods of coefficients at adjacent positions and scales are modeled as the product of two independent random variables: a Gaussian vecto ..."
Abstract

Cited by 513 (17 self)
 Add to MetaCart
We describe a method for removing noise from digital images, based on a statistical model of the coefficients of an overcomplete multiscale oriented basis. Neighborhoods of coefficients at adjacent positions and scales are modeled as the product of two independent random variables: a Gaussian
Estimating the Support of a HighDimensional Distribution
, 1999
"... Suppose you are given some dataset drawn from an underlying probability distribution P and you want to estimate a "simple" subset S of input space such that the probability that a test point drawn from P lies outside of S is bounded by some a priori specified between 0 and 1. We propo ..."
Abstract

Cited by 783 (29 self)
 Add to MetaCart
of the weight vector in an associated feature space. The expansion coefficients are found by solving a quadratic programming problem, which we do by carrying out sequential optimization over pairs of input patterns. We also provide a preliminary theoretical analysis of the statistical performance of our
Results 1  10
of
12,166