40 citations found. Retrieving documents...
T. Bailey and C. Elkan. Fitting a mixture model by expectation maximization to discover motifs in biopolymers. In Proc Int. Conf. Intell. Syst. Mol. Biol., pp. 28-36, 1994.

 Home/Search   Document Details and Download   Summary   Related Articles   Check  

This paper is cited in the following contexts:

First 50 documents

From Promoter Sequence to Expression: - Probabilistic Framework Eran   (Correct)

....sequence data to the Regulates variables. Experimental biology has shown that transcription factors bind to relatively short sequences, and that there can be some variability in the binding site sequences. Thus, most standard approaches to uncovering transcription factor binding sites, e.g. [1, 26, 29], search for relatively short sequence motifs in the bound promoter sequences. A common way of representing the variability within the binding site is by using a position specific scoring matrix (PSSM) Suppose we are searching for motifs of length k (or less) A PSSM w is a k 4 matrix, ....

....recall that each gene g has a promoter sequence g:S1 ; g:Sn , where each S i 2 fA; C; G; Tg. The standard approaches to learning PSSM is by training a probabilistic model of binding sites that maximizes the likelihood of sequences (given the assignment of the regulates variables) [1, 26]. These approaches rely on a clear probabilistic semantics of PSSM scores. We denote by 0 the probability distribution over nucleotides according to the background model. For simplicity, we use a Markov process of order 0 for the background distribution. As we will see, the choice of ....

[Article contains additional citation context not shown here]

T.L. Bailey and Elkan C. Fitting a mixture model by expectation maximization to discover motifs in biopolymers. In Proc. Int. Conf. Intell. Syst. Mol. Biol., volume 2, pages 28--36. 1994.


Statistical Identi - Cation Of Uniformly   (Correct)

....probability that O has been generated by independent tosses of a single coin or by a procession of multiple, coins selected by performing a random walk in the set of coins. More formally, we de ne an n coin model as an n state Hidden Markov Model similar to many other applications of HMMs [4]. Note that employing HMMs in the context of this paper is quite natural. Without any a priori information about which positions in the alignment sequence a coin switch is more likely, it is plausible to assume independent and identical distributions for the coin switch probabilities; this in turn ....

T. Bailey, C. Elkan. Fitting a mixture model by expectation maximization to discover motifs in biopolymers, Proceedings of ISMB 1994, AAAI Press.


Performance Comparison of Algorithms for Finding Transcription .. - Sinha, Tompa (2003)   (1 citation)  (Correct)

....overrepresented motifs that are good candidates for being transcription factor binding sites. There are numerous motif finding programs from Current address: Box 25, The Rockefeller University, 1230 York Ave, New York, NY 10021, saurabhlonnrot.rockefeller.edu which to choose, including MEME [1], Consensus [5] AlignACE [8] Oligo Analysis [11] YMF [9, 10] AnnSpec [12] and Projection [4] and little guidance to select among them. The only previous performance comparisons that have been done for some of these programs are by Pevzner and Sze [7] and by Buhler and Tompa [4] using ....

....Sze [7] and by Buhler and Tompa [4] using simulated data generated ac cording to a motif model that may not be accurate for transcription factor binding sites. In this paper, we compare the accuracy of an enu merative motif finding algorithm YMF [9, 10] to that of two popular contenders, MEME [1] and AlignACE [8] The comparison is done on real data sets from yeast, and on synthetic data with planted motifs. See Blanchette et al. 2] for an analogous perfor mance comparison among phylogenetic footprinting algorithms. The experiments on synthetic data in dicate that each algorithm s ....

T. L. Bailey and C. Elkan. Fitting a mixture model by expectation maximization to discover motifs in biopolymers. In Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology, pages 28-36, Menlo Park, CA, 1994. AAAI Press.


A String Pattern Regression Algorithm and Its.. - Bannai, Inenaga.. (2002)   (Correct)

....we would like to find patterns which are conserved in the sequences, generally preferring longer patterns which appear in most of the sequences in the set. Solving such a problem ranges from a plethora of multiple alignment methods, and numerous other methods such as Gibbs sampling [10] or MEME [1]. Another situation is when we have more information at hand, and are given a second set of sequences as an explicit negative data set that is known not to have characteristics of (or known to have di#erent characteristics from) the first set, and the problem is to find a pattern which match most ....

Bailey, T. L. and Elkan, C., Fitting a mixture model by expectation maximization to discover motifs in biopolymers. Proc. Second International Conference on Intelligent Systems for Molecular Biology, AAAI Press, 28--36, 1994.


Modeling Dependencies in Protein-DNA Binding Sites - Barash, Elidan, Friedman.. (2003)   (4 citations)  (Correct)

....is examining the promoter regions of a set of genes that have common functional annotation or are co expressed. In this case, the discovered motif indicates a possibly unknown factor that regulates the set of genes. Many works in recent years have proposed different schemes to handle this task [3, 4, 24, 31, 39, 40]. Both tasks require us to describe a motif that characterizes sequences that appear at binding sites of the transcription factor. The biological literature suggests that the relevant sequences are relatively short (up to 20bp long) Moreover, although binding sites are quite conserved, they do ....

....preamble to learning motif models, we describe how we model regulated sequences. We use a generarive approach to describe the probabilistic processes that could have generated the promoter sequences. Our model is similar to the model used by probabilistic approaches for PSSM learning (such as MEME [3]) with the important difference that we allow for a general binding site model as described in section 3. The model assumes that each sequence S (SO) S ) can be generated in two ways. It is either regulated by the transcription factor T and so contains a single binding site from our model ....

[Article contains additional citation context not shown here]

TL Bailey and C Elkan. Fitting a mixture model by expectation maximization to discover motifs in biopolymers. In 1SMB'94. 1994.


From Promoter Sequence to Expression: A Probabilistic Framework - Segal, Barash (2002)   (4 citations)  (Correct)

....sequence data to the Regulates variables. Experimental biology has shown that transcription factors bind to relatively short sequences, and that there can be some variability in the binding site sequences. Thus, most standard approaches to uncovering transcription factor binding sites, e.g. [1, 26, 29], search for relatively short sequence motifs in the bound promoter sequences. A common way of representing the variability within the binding site is by using a position speci c scoring matrix (PSSM) Suppose we are searching for motifs of length k (or less) A PSSM w is a k 4 matrix, ....

....recall that each gene g has a promoter sequence g:S1 ; g:Sn , where each S i 2 fA; C; G; Tg. The standard approaches to learning PSSM is by training a probabilistic model of binding sites that maximizes the likelihood of sequences (given the assignment of the regulates variables) [1, 26]. These approaches rely on a clear probabilistic semantics of PSSM scores. We denote by 0 the probability distribution over nucleotides according to the background model. For simplicity, we use a Markov process of order 0 for the background distribution. As we will see, the choice of ....

[Article contains additional citation context not shown here]

T.L. Bailey and Elkan C. Fitting a mixture model by expectation maximization to discover motifs in biopolymers. In Proc. Int. Conf. Intell. Syst. Mol. Biol., volume 2, pages 2836. 1994.


From Promoter Sequence to Expression: A Probabilistic Framework - Segal, Barash, Simon (2002)   (4 citations)  (Correct)

....sequence data to the Regulates variables. Experimental biology has shown that transcription factors bind to relatively short sequences, and that there can be some variability in the binding site sequences. Thus, most standard approaches to uncovering transcription factor binding sites, e.g. [1, 26, 29], search for relatively short sequence motifs in the bound promoter sequences. A common way of representing the variability within the binding site is by using aposition specific scoring matrix (PSSM) Suppose we are searching for motifs of length k (or less) A PSSM t is a k x 4 matrix, ....

....t from o notation. Furthermore, recl that each gene 9 has a promoter sequence 9.S, 9.S, where each Si A, C, G, T . The standd approaches to leaning PSSM is by ning a probabilistic model of binding sites that maximizes the likelihood of sequences (given the assignment of the regulates viables) [1, 26]. These approaches rely on a cle probabilistic semantics of PSSM scores. We denote by 0o the probability distribution over nucleotides according to the background model. For simplicity, we use a Mkov process of order 0 for the backound distribution. As we will see, the choice of background model ....

[Article contains additional citation context not shown here]

T.L. Bailey and Elkan C. Fitting a mixture model by expectation maximization to discover motifs in biopolymers. In Proc. Int. Conf Intell. Syst. Mol. Biol., volume 2, pages 28-36. 1994.


Unsupervised Activity Discovery and.. - Hamid, Maddi..   (Correct)

No context found.

T. Bailey and C. Elkan. Fitting a mixture model by expectation maximization to discover motifs in biopolymers. In Proc Int. Conf. Intell. Syst. Mol. Biol., pp. 28-36, 1994.


Improving Activity Discovery with Automatic Neighborhood.. - David Minnen Thad   (Correct)

No context found.

T.L. Bailey and C. Elkan. Fitting a mixture model by expectation maximization to discover motifs in biopolymers. In Second International Conference on Intelligent Systems for Molecular Biology, pages 28--36. AAAI Press, 1994.


Discovering Characteristic Actions from On-Body Sensor Data - David Minnen Thad   (Correct)

No context found.

T. Bailey and C. Elkan. Fitting a mixture model by expectation maximization to discover motifs in biopolymers. In Second International Conference on Intelligent Systems for Molecular Biology, pages 28--36. AAAI Press, 1994.


Predicting Protein Folds with Structural Repeats Using a.. - Liu, Xing, Carbonell (2005)   (Correct)

No context found.

Bailey, T. L., & Elkan, C. (1994). Fitting a mixture model by expectation maximization to discover motifs in biopolymers. Proc. of ISMB'94.


Dispersal and regulation of an adaptive mutagenesis.. - Erill, Campoy, Mazón.. (2006)   (Correct)

No context found.

Bailey, T.L. and Elkan, C. (1994) Fitting a mixture model by expectation maximization to discover motifs in biopolymers. Proc. Int. Conf. Intell. Syst. Mol. Biol., 2, 28-36.


Using a Hybrid Adaboost Algorithm to Integrate.. - Sun, Robinson.. (2006)   (Correct)

No context found.

T. L. Bailey and C. Elkan, "Fitting a mixture model by expectation maximization to discover motifs in biopolymers," Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology, 28-36, AAAI Press, 1994.


An Extension and Novel Solution to the (l,d)-Motif.. - Styczynski.. (2004)   (Correct)

No context found.

Bailey, T.L. and Elkan, C., Fitting a mixture model by expectation maximization to discover motifs in biopolymers, Proc.Int.Conf.Intell.Syst.Mol.Biol., 2:28--36, 1994.


Integrating Binding Site Predictions Using Meta.. - Sun, Robinson.. (2005)   (Correct)

No context found.

Bailey, T.L. & Elkan, C. (1994) Fitting a mixture model by expectation maximization to discover motifs in biopolymers. Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology: 28-36, AAAI Press.


Graphical Approach to Weak Motif Recognition - Yang, Rajapakse (2004)   (Correct)

No context found.

Bailey, T.L. and Elkan, C., Fitting a mixture model by expectation maximization to discover motifs in biopolymers, 2nd ISMB., 28--36, 1994.


Uusing Real-Valued Metaclassifiers To Integrate.. - Sun, Robinson.. (2005)   (Correct)

No context found.

T. L. Bailey and C. Elkan, "Fitting a mixture model by expectation maximization to discover motifs in biopolymers," Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology, 28-36, AAAI Press, 1994.


Motif-based Protein Sequence Classification Using Neural.. - Blekas, Fotiadis, Likas   (Correct)

No context found.

Bailey, T. L. and Elkan, C. (1994). Fitting a mixture model by expectation maximization to discover motifs in biopolymers. In Second International Conference on Intelligent Systems for Molecular Biology, pages 28--36, Menlo Park, California. AAAI Press.


Probabilistic Graphical Models for Computational.. - Moreau, Antal, Fannes, De .. (2003)   (Correct)

No context found.

Bailey TL, Elkan C. Fitting a mixture model by expectation maximization to discover motifs in biopolymers. Proc Int Conf Intell Syst Mol Biol. 1994;2:28-36.


Unknown -   (Correct)

No context found.

Bailey,T.L. and Elkan,C. (1994) Fitting a mixture model by expectation maximization to discover motifs in biopolymers. ISMB, 2, 28--36.


Modeling Dependencies in Protein-DNA Binding Sites - Barash, Elidan, Friedman.. (2003)   (4 citations)  (Correct)

No context found.

TL Bailey and C Elkan. Fitting a mixture model by expectation maximization to discover motifs in biopolymers. In ISMB'94. 1994.


MADCAP: MicroArray Data Characterization And Profiling. - Tool For Profiling (2003)   (Correct)

No context found.

Timothy L. Bailey and Charles Elkan, Fitting a mixture model by expectation maximization to discover motifs in biopolymers, Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology, pp. 28-36, AAAI Press, Menlo Park, California, 1994.


Unknown - Touzain Loria Fr (2003)   (Correct)

No context found.

T. Bailey and C. Elkan, Fitting a mixture model by expectation maximization to discover motifs in biopolymers, Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology, pp. 28-36, AAAI Press, Menlo Park, California, 1994.


Monocyte-Derived-Macrophage Gene Expression During HIV-1 - Infection Christopher Woelk (2003)   (Correct)

No context found.

T. L. Bailey and C. Elkan, Fitting a mixture model by expectation maximization to discover motifs in biopolymers. Proceedings of the International Conference of Intelligent Systems in Moleclular Biology, 2:28-36, 1994.


A Combinatorial Approach for Motif Discovery in Unaligned DNA.. - Liu (2004)   (Correct)

No context found.

T.L. Bailey and C. Elkan. Fitting a mixture model by expectation maximization to discover motifs in biopolymers. In Proceedings of the 2nd International Conference on Intelligent Systems for Molecular Biology, volume 2, pages 28-- 36. AAAI Press, 1994.

First 50 documents

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC