Results 11 - 20
of
55
The importance of bottlenecks in protein networks: Correlation with gene essentiality and expression dynamics
- PLoS Comput. Biol
, 2007
"... It has been a long-standing goal in systems biology to find relations between the topological properties and functional features of protein networks. However, most of the focus in network studies has been on highly connected proteins (‘‘hubs’’). As a complementary notion, it is possible to define bo ..."
Abstract
-
Cited by 10 (1 self)
- Add to MetaCart
It has been a long-standing goal in systems biology to find relations between the topological properties and functional features of protein networks. However, most of the focus in network studies has been on highly connected proteins (‘‘hubs’’). As a complementary notion, it is possible to define bottlenecks as proteins with a high betweenness centrality (i.e., network nodes that have many ‘‘shortest paths’ ’ going through them, analogous to major bridges and tunnels on a highway map). Bottlenecks are, in fact, key connector proteins with surprising functional and dynamic properties. In particular, they are more likely to be essential proteins. In fact, in regulatory and other directed networks, betweenness (i.e., ‘‘bottleneck-ness’’) is a much more significant indicator of essentiality than degree (i.e., ‘‘hub-ness’’). Furthermore, bottlenecks correspond to the dynamic components of the interaction network—they are significantly less well coexpressed with their neighbors than nonbottlenecks, implying that expression dynamics is wired into the network topology.
BioMiner - Modeling, analyzing, and visualizing biochemical pathways and networks
, 2002
"... Motivation: Understanding the biochemistry of a newly sequenced organism is an essential task for post-genomic analysis. Since, however, genome and array data grow much faster than biochemical information, it is necessary to infer reactions by comparative analysis. No integrated and easy to use soft ..."
Abstract
-
Cited by 9 (2 self)
- Add to MetaCart
Motivation: Understanding the biochemistry of a newly sequenced organism is an essential task for post-genomic analysis. Since, however, genome and array data grow much faster than biochemical information, it is necessary to infer reactions by comparative analysis. No integrated and easy to use software tool for this purpose exists as yet.
Identifying regulatory subnetworks for a set of genes
- Molecular and Cellular Proteomics
, 2005
"... High throughput genomic/proteomic strategies, such as microarray studies, drug screens, and genetic screens, often produce a list of genes that are believed to be important for one or more reasons. Unfortunately it is often difficult to discern meaningful biological relationships from such lists. Th ..."
Abstract
-
Cited by 7 (1 self)
- Add to MetaCart
High throughput genomic/proteomic strategies, such as microarray studies, drug screens, and genetic screens, often produce a list of genes that are believed to be important for one or more reasons. Unfortunately it is often difficult to discern meaningful biological relationships from such lists. This study presents a new bioinformatic approach that can be used to identify regulatory subnetworks for lists of significant genes or proteins. We demonstrate the utility of this approach using an interaction network for yeast constructed from BIND, TRANS-FAC, SCPD, and chromatin immunoprecipitation (ChIP)-Chip data bases and lists of genes from well known metabolic pathways or differential expression experiments. The approach accurately rediscovers known regulatory elements of the heat shock response as well as
The MAPPER database: a multi-genome catalog of putative transcription factor binding sites
- Nucleic Acids Res
, 2005
"... We describe a comprehensive map of putative transcription factor binding sites (TFBSs) across multiple genomes created using a search method that relies on hidden Markov models built from experimentally determined TFBSs. Using the information in the TRANSFAC and JASPAR databases, we built 1134 model ..."
Abstract
-
Cited by 7 (2 self)
- Add to MetaCart
We describe a comprehensive map of putative transcription factor binding sites (TFBSs) across multiple genomes created using a search method that relies on hidden Markov models built from experimentally determined TFBSs. Using the information in the TRANSFAC and JASPAR databases, we built 1134 models for TFBSs and used them to scan regions 10 kb upstream of the start of the transcript for all known genes in the human, mouse and Drosophila melanogaster genomes. The results, together with homology information on clusters of ortholog genes across the three genomes, were used to create a multiorganism catalog of annotated TFBSs. The catalog can be queried through a web interface accessible at
Sequence features of DNA binding sites reveal structural class of associated transcription factor
- Bioinformatics
, 2006
"... Motivation: A key goal in molecular biology is to understand the mechanisms by which a cell regulates the transcription of its genes. One important aspect of this transcriptional regulation is the binding of transcription factors (TFs) to their specific cis-regulatory counterparts on the DNA. TFs re ..."
Abstract
-
Cited by 7 (1 self)
- Add to MetaCart
Motivation: A key goal in molecular biology is to understand the mechanisms by which a cell regulates the transcription of its genes. One important aspect of this transcriptional regulation is the binding of transcription factors (TFs) to their specific cis-regulatory counterparts on the DNA. TFs recognize and bind their DNA counterparts according to the structure of their DNA-binding domains (e.g. zinc finger, leucine zipper, homeodomain). The structure of thesedomains can be used as a basis for grouping TFs into classes. Although the structure of DNAbinding domains varies widely across TFs generally, the TFs within a particular class bind to DNA in a similar fashion, suggesting the existence of class-specific features in the DNA sequences bound by each class of TFs. Results: In this paper, we apply a sparse Bayesian learning algorithm to identify a small set of class-specific features in the DNA sequences bound by different classes of TFs; the algorithm simultaneously learns a true multi-class classifier that uses these features to predict the DNA-binding domain of the TF that recognizes a particular set of DNA sequences. We train our algorithm on the six largest classes in TRANSFAC, comprising a total of 587 TFs. We learn a six-class classifier for this training set that achieves 87 % leave-one-out crossvalidation accuracy. We also identify features within cis-regulatory sequences that are highly specific to each class of TF, which has significant implications for how TF binding sites should be modeled for the purpose of motif discovery.
Bioinformatic Principles Underlying the Information Content of Transcription Factor Binding Sites
- Journal of Theoretical Biology
, 2003
"... Empirically, it has been observed in several cases that the information content of transcription factor binding site sequences (465798-:79;<97 ) approximately equals the information content of binding site positions (4>=@? 7A8-:79;
Abstract
-
Cited by 6 (3 self)
- Add to MetaCart
Empirically, it has been observed in several cases that the information content of transcription factor binding site sequences (465798-:79;<97 ) approximately equals the information content of binding site positions (4>=@? 7A8-:79;<CB ). A general framework for formal models of transcription factors and binding sites is developed to address this issue. Measures for information content in transcription factor binding sites are revisited and theoretic analyses are compared on this basis. These analyses do not lead to consistent results. A comparative review reveals that these inconsistent approaches do not include a transcription factor state space.
Module Networks: Discovering Regulatory Modules and their Condition Specific Regulators from Gene Expression Data
- Nature Genetics
, 2003
"... Introduction The complex functions of a living cell are carried out through the concerted activity of many genes and gene products. This activity is often coordinated by the organization of Computer Science Department, Stanford University, Stanford, California, 94305, USA. Department of Geneti ..."
Abstract
-
Cited by 6 (0 self)
- Add to MetaCart
Introduction The complex functions of a living cell are carried out through the concerted activity of many genes and gene products. This activity is often coordinated by the organization of Computer Science Department, Stanford University, Stanford, California, 94305, USA. Department of Genetics, Stanford University School of Medicine, Stanford, California, 94305, USA. Dept. of Cell Research and Immunology, Tel Aviv U. & Computer Science Dept., Weizmann Inst., Israel. School of Computer Science & Engineering, Hebrew University, Jerusalem, 91904, Israel. * These authors contributed equally to this manuscript. # Correspondence should be addressed to E.S. (eran@cs.stanford.edu) or D.K. (koller@cs.stanford.edu). Supplementary information: attached pdf and http://www.cs.stanford.edu/~eran/module_nets/ (username: modnet, password: modnet-review). the genome into regulatory modules, or sets of co-regulated genes that share a common function. Such is the case for most of the m
Whole-genome comparative annotation and regulatory motif discovery in multiple yeast species; 2003
- Proceedings of the 7th International Conference on Research in Computational Molecular Biology 2003, 7
, 2003
"... In [13] we reported the genome sequences of S. paradoxus, S. mikatae and S. bayanus and compared these three yeast species to their close relative, S. cerevisiae. Genome-wide comparative analysis allowed the identification of functionally important sequences, both coding and non-coding. In this comp ..."
Abstract
-
Cited by 5 (0 self)
- Add to MetaCart
In [13] we reported the genome sequences of S. paradoxus, S. mikatae and S. bayanus and compared these three yeast species to their close relative, S. cerevisiae. Genome-wide comparative analysis allowed the identification of functionally important sequences, both coding and non-coding. In this companion paper we describe the mathematical and algorithmic results underpinning the analysis of these genomes. We developed methods for the automatic comparative annotation of the four species and the determination of orthologous genes and intergenic regions. The algorithms enabled the automatic identification of orthologs for more than 90 % of genes despite the large number of duplicated genes in the yeast genome, and the discovery of recent gene family expansions and genome rearrangements. We also developed a test to validate
Classification of common conserved sequences in mammalian intergenic regions
- Hum. Mol. Genet
, 2002
"... Comparisons between orthologous intergenic regions of related genomes reveal numerous hits, i.e. pairs of relatively short highly similar sequences that evolved slowly, perhaps due to selective constraint. We analyzed and classified 2638 hits found within 100 pairs of complete, orthologous intergeni ..."
Abstract
-
Cited by 4 (2 self)
- Add to MetaCart
Comparisons between orthologous intergenic regions of related genomes reveal numerous hits, i.e. pairs of relatively short highly similar sequences that evolved slowly, perhaps due to selective constraint. We analyzed and classified 2638 hits found within 100 pairs of complete, orthologous intergenic regions of human and murine genomes. We identified all common fragments of hits that align well with many other hits and constructed their classification. Our analysis revealed 20 abundant classes each containing 10 or more fragments. Fragments of the same class may perform the same function, e.g. bind a particular protein. Ten of the abundant classes apparently correspond to known functional consensuses, whereas others may represent novel conserved sites. Thus, large-scale comparative analysis of slowly evolving intergenic sequences can provide valuable insights into their function.
Introduction to Bioinformatics
- Electronic Journal of Biotechnology
, 2004
"... Comparative genomics reveals unusually long motifs in mammalian genomes Vol. 22 no. 14 2006, pages e236–e242 doi:10.1093/bioinformatics/btl265 ..."
Abstract
-
Cited by 3 (0 self)
- Add to MetaCart
Comparative genomics reveals unusually long motifs in mammalian genomes Vol. 22 no. 14 2006, pages e236–e242 doi:10.1093/bioinformatics/btl265

