• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 1 - 10 of 14,028
Next 10 →

The diploid genome sequence of an individual human

by Samuel Levy, Granger Sutton, Pauline C. Ng, Lars Feuk, Aaron L. Halpern, Brian P. Walenz, Nelson Axelrod, Jiaqi Huang, Ewen F. Kirkness, Gennady Denisov, Yuan Lin, Jeffrey R. Macdonald, Andy Wing, Chun Pang, Mary Shago, Timothy B. Stockwell, Alexia Tsiamouri, Vineet Bafna, Vikas Bansal, Saul A. Kravitz, Dana A. Busam, Karen Y. Beeson, Tina C. Mcintosh, Karin A. Remington, Josep F. Abril, John Gill, Jon Borman, Yu-hui Rogers, Marvin E. Frazier, Stephen W. Scherer, Robert L. Strausberg, J. Craig Venter - PLoS Biol
"... Presented here is a genome sequence of an individual human. It was produced from;32 million random DNA fragments, sequenced by Sanger dideoxy technology and assembled into 4,528 scaffolds, comprising 2,810 million bases (Mb) of contiguous sequence with approximately 7.5-fold coverage for any given r ..."
Abstract - Cited by 293 (6 self) - Add to MetaCart
region. We developed a modified version of the Celera assembler to facilitate the identification and comparison of alternate alleles within this individual diploid genome. Comparison of this genome and the National Center for Biotechnology Information human reference assembly revealed more than 4

Prediction of complete gene structures in human genomic DNA

by Chris Burge, Samuel Karlin - J. Mol. Biol , 1997
"... The problem of identifying genes in genomic DNA sequences by computational methods has attracted considerable research attention in recent years. From one point of view, the problem is closely ..."
Abstract - Cited by 1177 (9 self) - Add to MetaCart
The problem of identifying genes in genomic DNA sequences by computational methods has attracted considerable research attention in recent years. From one point of view, the problem is closely

Haploview: analysis and visualization of LD and haplotype maps

by J. C. Barrett, B. Fry, J. Maller, M. J. Daly - BIOINFORMATICS , 2005
"... Summary: Research over the last few years has revealed significant haplotype structure in the human genome. The characterization of these patterns, particularly in the con-text of medical genetic association studies, is becoming a routine research activity. Haploview is a software pack-age that prov ..."
Abstract - Cited by 1275 (6 self) - Add to MetaCart
Summary: Research over the last few years has revealed significant haplotype structure in the human genome. The characterization of these patterns, particularly in the con-text of medical genetic association studies, is becoming a routine research activity. Haploview is a software pack

Similarity search in high dimensions via hashing

by Aristides Gionis, Piotr Indyk, Rajeev Motwani , 1999
"... The nearest- or near-neighbor query problems arise in a large variety of database applications, usually in the context of similarity searching. Of late, there has been increasing interest in building search/index structures for performing similarity search over high-dimensional data, e.g., image dat ..."
Abstract - Cited by 641 (10 self) - Add to MetaCart
databases, document collections, time-series databases, and genome databases. Unfortunately, all known techniques for solving this problem fall prey to the \curse of dimensionality. " That is, the data structures scale poorly with data dimensionality; in fact, if the number of dimensions exceeds 10

Identification of Prokaryotic and Eukaryotic Signal Peptides and Prediction of Their Cleavage Sites

by Henrik Nielsen, Jacob Engelbrecht, Søren Brunak, Gunnar von Heijne , 1997
"... We have developed a new method for identification of signal peptides and their cleavage sites based on neural networks trained on separate sets of prokaryotic and eukaryotic sequences. The method performs significantly better than previous prediction schemes, and can easily be applied on genome-wide ..."
Abstract - Cited by 787 (17 self) - Add to MetaCart
We have developed a new method for identification of signal peptides and their cleavage sites based on neural networks trained on separate sets of prokaryotic and eukaryotic sequences. The method performs significantly better than previous prediction schemes, and can easily be applied on genome

The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/ genome databases

by Ron Caspi, Tomer Altman, Richard Billington, Kate Dreher, Hartmut Foerster, Carol A. Fulcher, Timothy A. Holl, Ingrid M. Keseler, Anamika Kothari, Aya Kubo, Markus Krummenacker, Mario Latendresse, Lukas A. Mueller, Quang Ong, Suzanne Paley, Pallavi Subhraveti, Daniel S. Weaver, Deepika Weerasinghe, Peifen Zhang, Peter D. Karp - Nucleic Acids Res , 2012
"... The MetaCyc database (MetaCyc.org) is a compre-hensive and freely accessible database describing metabolic pathways and enzymes from all domains of life. MetaCyc pathways are experimentally determined, mostly small-molecule metabolic pathways and are curated from the primary scien-tific literature. ..."
Abstract - Cited by 404 (15 self) - Add to MetaCart
. MetaCyc contains>2100 pathways derived from>37000 publications, and is the largest curated collection of metabolic pathways currently available. BioCyc (BioCyc.org) is a collec-tion of>3000 organism-specific Pathway/Genome Databases (PGDBs), each containing the full genome and predicted

Protein structure prediction and structural genomics

by David Baker, Andrej Sali - Science , 2001
"... Genome sequencing projects are producing linear amino acid sequences, but full understanding of the biological role of these proteins will require knowledge of their structure and function. Although experimental structure determination methods are providing high-resolution structure information abou ..."
Abstract - Cited by 332 (14 self) - Add to MetaCart
Genome sequencing projects are producing linear amino acid sequences, but full understanding of the biological role of these proteins will require knowledge of their structure and function. Although experimental structure determination methods are providing high-resolution structure information

Twilight Zone of Protein Sequence Alignments

by Burkhard Rost , 1999
"... l findings are applicable to automatic database searches. Keywords: alignment quality analysis/evolutionary conservation/ genome analysis/protein sequence alignment/sequence space hopping Introduction Protein sequence alignments in twilight zone Protein sequences fold into unique three-dimension ..."
Abstract - Cited by 397 (49 self) - Add to MetaCart
l findings are applicable to automatic database searches. Keywords: alignment quality analysis/evolutionary conservation/ genome analysis/protein sequence alignment/sequence space hopping Introduction Protein sequence alignments in twilight zone Protein sequences fold into unique three

Fast and reliable prediction of noncoding RNAs

by Stefan Washietl, Ivo L. Hofacker, Peter F. Stadler - Proc Natl Acad Sci USA
"... We report an efficient method to detect functional RNAs. The approach, which combines comparative sequence analysis and structure prediction, yields excellent results already for a small number of aligned sequences and is suitable for large scale-genomic screens. It consists of two basic components: ..."
Abstract - Cited by 335 (45 self) - Add to MetaCart
We report an efficient method to detect functional RNAs. The approach, which combines comparative sequence analysis and structure prediction, yields excellent results already for a small number of aligned sequences and is suitable for large scale-genomic screens. It consists of two basic components

PROSITE: a dictionary of sites and patterns in proteins. Nucleic Acids Res

by Amos Bairoch , 1992
"... PROSITE is a compilation of sites and patterns found in protein sequences; it can be used as a method of determining the function of uncharacterized proteins translated from genomic or cDNA sequences. In some cases the sequence of an unknown protein is too distantly related to any protein of known s ..."
Abstract - Cited by 312 (3 self) - Add to MetaCart
PROSITE is a compilation of sites and patterns found in protein sequences; it can be used as a method of determining the function of uncharacterized proteins translated from genomic or cDNA sequences. In some cases the sequence of an unknown protein is too distantly related to any protein of known
Next 10 →
Results 1 - 10 of 14,028
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University