67 citations found. Retrieving documents...
S. Kullback and R. A. Leibler. On information and sufficiency. The Annals of Mathematical Statistics, 22:79-- 86, 1951.

 Home/Search   Document Not in Database   Summary   Related Articles   Check  

This paper is cited in the following contexts:

First 50 documents  Next 50

Bayesian Probability - Bruyninckx (2002)   (Correct)

....The relative information measure (often called mutual information ) H(p, q) of two continuous PDFs p(x) and q(x) is defined as: H(p, q) p(x) ln p(x) q(x) dx. 15) 16 This scalar is also called the Kullback Leibler divergence, after the duo that first presented it, 22] [23]) or also mutual entropy, or cross entropy, of both probability measures p(x) and m(x) 22, 28] It is a (coordinate independent) measure for how much information one needs to go from the probability distribution m(x) to the probability distribution p(x) As Shannon s entropy, H p(x) m(x) ....

S. Kullback and R. A. Leibler. On information and su#ciency. The Annals of Mathematical Statistics, 22:79--86, 1951.


A Distance Measure for Bounding Probabilistic Belief Change - Chan, Darwiche (2003)   (Correct)

....a number of theoretical results about the proposed measure and then present two of its key applications. On the theoretical side, we prove that our proposed measure satisfies the three properties of distance. We also contrast our distance measure with classical measures, including KL divergence [12] where we present some results on its ability to bound belief changes. Specifically, we show that belief change between two states of belief can be unbounded, even when their KL divergence tends to zero. We show, however, that KL divergence can be used to bound the average change in beliefs as ....

....however, that KL divergence can be used to provide an average case bound on belief changes and we also provide a relationship between that bound and ours. We start first with Kullback Leibler (KL) divergence, which is one of the most common measures for comparing probability distributions [12]. Definition 3.1 Let P r and P r # be two probability distributions over the same set of worlds w. The KL divergence between P r and P r # is defined as: P r(w) ln . The first thing to note about KL divergence is that it is incomparable with our distance measure. Example 3.1 ....

S. Kullback and R. A. Leibler. On Information and Su#ciency. Annals of Mathematical Statistics, volume 22, pages 79-86, 1951.


New Lower Bounds for Statistical Query Learning - Ke Yang April (2002)   (Correct)

....d 100 and ED [g(X; f i (X) C g 1 i (2) for every i = 1; 2; d. where C g is a constant that is independent of the target concept f i . We call C g the inherent bias of g. 3. 2 The KL divergence We present a de nition of the KL divergence, or Kullback Leibler divergence [KL51]. De nition 4 (KL divergence) For 2 random variables P , Q with identical support, we de ne their KL divergence to be KL(P jjQ) P (x) log : 3) The KL divergence is also known as the relative entropy. The reader is referred to [KL51, K59, B95, CT91] for a comprehensive treatise on ....

....the KL divergence, or Kullback Leibler divergence [KL51] De nition 4 (KL divergence) For 2 random variables P , Q with identical support, we de ne their KL divergence to be KL(P jjQ) P (x) log : 3) The KL divergence is also known as the relative entropy. The reader is referred to [KL51, K59, B95, CT91] for a comprehensive treatise on the KL divergence. We state several properties of the KL divergence and postpone their proofs to Appendix A. These properties are highly unlikely to be original. We list them here only for the completeness of the paper and for the readers convenience. Lemma 2 ....

S. Kullback and R. A. Leibler, On Information and Suciency, Annals of Mathematical Statistics 22, pp. 79-86, 1951.


An Adaptive Fusion Architecture for Target Tracking - Gareth Loy Luke (2002)   (2 citations)  (Correct)

....cues is assumed to give the best estimate of the true PDF P (e t t ) across the state space . So the performance of the j cue can be quantified by measuring how closely the cue s PDF P (e j,t t ) matches t ) This can be done using the relative entropy, or the Kullback Leibler distance [6], an information theoretic measure of how accurate an approximation one PDF is to another, given by # t (P (e t t ) s t t ) log P (e j,t where s t are the particle states at time t. Soto and Khosla [8] used this metric to rate the performance of their cues, and Triesh and ....

S. Kullback and R. A. Leibler. On information and sufficiency. Annals of Mathematical Statistics, 22(1):79--86, March 1951.


Using Probability Trees to Compute Marginals with Imprecise.. - Cano, Moral (2000)   (2 citations)  (Correct)

.... The approximation should contain the same normalization factor, i.e. x2U I T (x) x2U I OE(x) If we denote by p T and p OE the probability distributions proportional to T and OE respectively, the distance from a tree T to a potential OE is measured by Kullback Leibler s cross entropy [32] of p T and p OE : D(OE; T ) D(p OE ; p T ) Gamma x I 2U I p OE (x I ) log p OE (x I ) p T (x I ) 7) Proposition 1 [6] Let OE be a potential over a set of variables X I , and J I. If a tree T is such that every leaf T contains the value y J =x OE(y) jU I GammaJ j, then T ....

S. Kullback and R. A. Leibler. On information and sufficiency. Annals of Mathematical Statistics, 22:76--86, 1951.


Fusion of Domain Knowledge with Data for Structural Learning .. - Langseth, Nielsen (2003)   (1 citation)  (Correct)

....standard model, the learned CPTs of N may have other domains than the CPTs of . Hence a global measure for the di erence between the gold standard model and the estimated model is required. In the tests performed, we have measured this di erence by using the Kullback Leibler (KL) divergence [37] between the gold standard model and the estimated model. The KL divergence is de ned as D f jj f N x f(xj ) log f(xj ) 15) There are many arguments for using this particular measurement for calculating the quality of the approximation, see [10] One of them is the fact ....

Solomon Kullback and Richard A. Leibler. On information and suciency. Annals of Mathematical Statistics, 22:79-86, 1951.


Hypothesis Testing Over Factorizations for Data Association - Ihler, III, Willsky (2003)   (Correct)

....across sensors. 2.1 Mutual Information Mutual information is a quantity characterizing the statistical dependence between two random variables. Although most widely known for its application to communications (see e.g. 1] here it arises in the context of discrimination and hypothesis testing [2]. Correlation is equivalent to mutual information only for jointly Gaussian random variables. The common assumption of Gaussian distributions and its computational e#ciency have given it wide applicability to association problems. However, there are many forms of dependency which are not captured ....

S. Kullback and R. A. Leibler. On information and su#ciency. Annals of Mathematical Statistics, 22:79--86, 1951.


Ontology Enrichment with Texts from the WWW - Faatz, Steinmetz (2002)   (1 citation)  (Correct)

....reals with , which will be called configuration of the rule set . Each corresponds to a rule . The configuration k decides about the quantities of dissimilarity we derive from . The Kullback Leibler divergence generally measures the dissimilarity between two probability mass functions [6] and was applied successfully to statistical language modelling and prediction problems [7] The Kullback Leibler D(x,y) divergence for two words x, y is defined as In the basic version of the Kullback Leibler divergence, which is expressed by formula (1) w is a linguistic property and is the ....

Kullback, S. and Leibler, R.A.: On Information and Sufficiency, Annals of MAthematical Statistics 22, 1951


Deriving a Bayesian Probability Network from Data - Macus (1991)   (Correct)

....and is defined as the difference between the information held in the actual distribution and the information held in the approximating distribution. By subtracting these two quantities as in 16 Equation 2.4, we end up with the measure given Vx)log Vx) Vx)log Va(x) 2.4) in Equation 2. 5 [13]. V(x) 2.5) I(V,V, V(x) log V(x This quantity is equal to 0 if and only if P(x) P(x) for all x. In all other cases, it is positive, growing in value as the two distributions increase their divergence. This distance measure is one key to developing a method to construct a dependence tree. ....

S. Kullback and R. A. Leibler. On information and sufficiency. The Annals of Mathematical Statistics, 1951.


Mixtures of ARMA Models for Model-Based Time Series Clustering - Xiong, Yeung (2002)   (Correct)

....The BCD method first transforms each time series into a Markov chain, with its dynamics represented simply by a transition probability matrix. Next, it goes through an agglomerative procedure by trying to merge the two closest Markov chains at each step, using the Kullback Leibler divergence [13] as the dissimilarity (cf. distance) measure between transition probability matrices. Based on a greedy heuristic search approach, this procedure continues until the resulting model is found to be less probable than the model before merging. Thus the number of clusters can be determined ....

S. Kullback and R.A. Leibler. On information and su#ciency. Annals of Mathematical Statistics, 22:79--86, 1951.


An Analysis of Quantitative Measures Associated with Rules - Yao, Zhong (1999)   (Correct)

....of AS(H E) Smyth and Goodman [28] defined the information content of rules. For E J(H AS(H = G(E) AS(H log IND(E,H) AS(H log IND(H,E) a b) b d) 20) This measure is closely related to the divergence measure proposed by Kullback and Leibler [12]. 3.3 Two way support The measure of independence IND has been used by many authors. Silverstein et al. 27] referred to it as a measure of interest. Buchter and Wirth [3] regarded it as a measure of dependence. Gray and Orlowska [10] used the same measure, and provided the interpretation given ....

Kullback, S. and Leibler, R.A. On information and su#ciency, Annals of Mathematical Statistics, 22, 79-86, 1951.


Learning the Statistics of People in Images and Video - Sidenbladh, Black (2001)   (7 citations)  (Correct)

....0.15 0.11 0.15 0.60 0.68 0.46 0.63 The Bhattacharyya distance [20] provides one measure of similarity. Given two distributions Pon and Poff over the variable y, the Bhattacharyya distance between them is 5B(Pon,Poff) log f y Pon(Y)Poff(Y)dy. 3) Alternatively, the Kullback Leibler divergence [22] between Pon and Poff is given by 5KL(Pon,Poff) Port(Y) log Pon(Y) dy. 4) po(y) Note, the Kullback Leibler divergence is asymmetric, which strictly means that it is not a distance metric, but it still provides a measure of the difference between the two distributions. Below we use these ....

Kullback, S. and R. A. Leibler: 1951, 'On information and sufficiency'. Annals of Mathematical Statistics 22, 79-86.


A Probabilistic Network - Versus Decision Rules   (Correct)

....that we can characterize a Markov distribution by an entropy function. We now outline how we learn the dependency structure of a Markov distribution. Our method for learning the dependency structure of a Markov distribution is based on minimizing the Kullback Leibler I(OE; OE cross entropy [3], a measure of closeness between a true distribution OE and an approximate distribution OE . The cross entropy is defined by: OE(x) log OE(x) x) where x is a configuration (tuple) of the set of attributes R = fA 1 ; Am g. With a fixed OE, we can choose, from the set ....

S. Kullback and R. Leibler. Information and sufficiency. Annals of Mathematical Statistics, 22:79--86, 1951.


Survey Of Clustering Data Mining Techniques - Berkhin (2002)   (18 citations)  (Correct)

....similarity. Two attributes (two columns in data matrix X) with exactly the same probability distributions are identical for the purpose of data mining and so one can be deleted. Attributes that have probability distributions that are close in terms of their Kullback Leibler (KL) distance (see [KL51]) can still be grouped together without much of an impact. In addition, a natural derived attribute, the mixed distribution (a normalized sum of two columns) is now available to represent the group. This process can be generalized. The grouping simplifies the original matrix X to the compressed ....

Kullback, S. and Leibler, R.A. On information and sufficiency. Annals of Mathematical Statistics, 22, 76-86, 1951.


Evolutionary Synthesis of Bayesian Networks for Optimization - Mühlenbein, Mahnig   Self-citation (Information)   (Correct)

No context found.

S. Kullback and Leibler. On information and suffiency. Annals of Mathematical Statistics, 22:76--86, 1951.


Submitted to AUTOMATICA, July 20th, 1999 - The Geometry Of   (Correct)

No context found.

S. Kullback and R. A. Leibler. On information and sufficiency. The Annals of Mathematical Statistics, 22:79-- 86, 1951.


Large Alphabets and Incompressibility - Gagie (2006)   (Correct)

No context found.

S. Kullback and R.A. Leibler. On information and su#ciency. Annals of Mathematical Statistics, 22:79--86, 1951.


Fusion of Domain Knowledge with Data for Structural Learning .. - Langseth, Nielsen (2003)   (1 citation)  (Correct)

No context found.

Solomon Kullback and Richard A. Leibler. On information and sufficiency. Annals of Mathematical Statistics, 22:79--86, 1951.


Auto-Supervised Learning in the Bayesian Programming.. - Dangauthier.. (2005)   (Correct)

No context found.

Kullback S. and Leibler. On information and sufficiency. Annals of Mathematical Statistics, (22):79--86, 1951.


Variational Inference Guide - Lawrence (2002)   (Correct)

No context found.

S. Kullback and R. A. Leibler. On information and suciency. Annals of Mathematical Statistics, 22:7986, 1951. 6


The Analysis and Applications of Adaptive-Binning Color Histograms - Leow, Li   (Correct)

No context found.

S. Kullback and R. A. Leibler. On information and su#ciency. Annals of Mathematical Statistics, 22:79--86, 1951. 34


Recognition and Retrieval Via Histogram Trees - Stuart Gibson And (2001)   (Correct)

No context found.

S. Kullback and R.A. Leibler. On information and sufficiency. Annals of Mathematical Statistics, 22(1):79--86, Mar 1951.


Comparing Dissimilarity Measures for Probabilistic Symbolic.. - Malerba, Monopoli   (Correct)

No context found.

Kullback, S., Leibler, R.A.: On information and sufficiency. Annals of Mathematical Statistics, vol. 22, pp. 76-86, 1951.


Stock Returns Distributions: The Emergent Behaviour of a.. - Collie   (Correct)

No context found.

S. Kullback and R. Leibler. On information and su#ciency. Annals of Mathematical Statistics, 22:79--86, 1951.


Initial Description of Multi-Modal Dynamic Models - Karny, Nedoma, Nagy, Valeckova   (Correct)

No context found.

S. Kullback and R. Leibler, \On information and suciency," Annals of Mathematical Statistics, vol. 22, 1951.

First 50 documents  Next 50

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC