Results 1 - 10
of
636
Ultrafast and memoryefficient alignment of short DNA sequences to the human genome.
, 2009
"... ..."
Sequencing technologies - the next generation.
- Nature Reviews
, 2010
"... Abstract | Demand has never been greater for revolutionary technologies that deliver fast, inexpensive and accurate genome information. This challenge has catalysed the development of next-generation sequencing (NGS) technologies. The inexpensive production of large volumes of sequence data is the ..."
Abstract
-
Cited by 286 (2 self)
- Add to MetaCart
Abstract | Demand has never been greater for revolutionary technologies that deliver fast, inexpensive and accurate genome information. This challenge has catalysed the development of next-generation sequencing (NGS) technologies. The inexpensive production of large volumes of sequence data is the primary advantage over conventional methods. Here, I present a technical review of template preparation, sequencing and imaging, genome alignment and assembly approaches, and recent advances in current and near-term commercially available NGS instruments. I also outline the broad range of applications for NGS technologies, in addition to providing guidelines for platform selection to address biological questions of interest.
BreakDancer: An algorithm for high resolution mapping of genomic structural variation
- Nat. Methods
, 2009
"... structural variation ..."
Mouse genomic variation and its effect on phenotypes and gene regulation.
- Nature,
, 2011
"... We report genome sequences of 17 inbred strains of laboratory mice and identify almost ten times more variants than previously known. We use these genomes to explore the phylogenetic history of the laboratory mouse and to examine the functional consequences of allele-specific variation on transcrip ..."
Abstract
-
Cited by 117 (4 self)
- Add to MetaCart
We report genome sequences of 17 inbred strains of laboratory mice and identify almost ten times more variants than previously known. We use these genomes to explore the phylogenetic history of the laboratory mouse and to examine the functional consequences of allele-specific variation on transcript abundance, revealing that at least 12% of transcripts show a significant tissue-specific expression bias. By identifying candidate functional variants at 718 quantitative trait loci we show that the molecular nature of functional variants and their position relative to genes vary according to the effect size of the locus. These sequences provide a starting point for a new era in the functional analysis of a key model organism.
The sequence and de novo assembly of the giant panda genome.
- Nature
, 2010
"... Using next-generation sequencing technology alone, we have successfully generated and assembled a draft sequence of the giant panda genome. The assembled contigs (2.25 gigabases (Gb)) cover approximately 94% of the whole genome, and the remaining gaps (0.05 Gb) seem to contain carnivore-specific re ..."
Abstract
-
Cited by 116 (4 self)
- Add to MetaCart
Using next-generation sequencing technology alone, we have successfully generated and assembled a draft sequence of the giant panda genome. The assembled contigs (2.25 gigabases (Gb)) cover approximately 94% of the whole genome, and the remaining gaps (0.05 Gb) seem to contain carnivore-specific repeats and tandem repeats. Comparisons with the dog and human showed that the panda genome has a lower divergence rate. The assessment of panda genes potentially underlying some of its unique traits indicated that its bamboo diet might be more dependent on its gut microbiome than its own genetic composition. We also identified more than 2.7 million heterozygous single nucleotide polymorphisms in the diploid genome. Our data and analyses provide a foundation for promoting mammalian genetic research, and demonstrate the feasibility for using next-generation sequencing technologies for accurate, cost-effective and rapid de novo assembly of large eukaryotic genomes. The giant panda, Ailuropoda melanoleura, is at high risk of extinction because of human population expansion and destruction of its habitat. The latest molecular census of its population size, using faecal samples and nine microsatellite loci, provided an estimate of only 2,500-3,000 individuals, which were confined to several small mountain habitats in Western China 1 . The giant panda has several unusual biological and behavioural traits, including a famously restricted diet, primarily made up of bamboo, and a very low fecundity rate. Moreover, the panda holds a unique place in evolution, and there has been continuing controversy about its phylogenetic position 2 . At present, there is very little genetic information for the panda, which is an essential tool for detailed understanding of the biology of this organism. A major limitation in obtaining extensive genetic data is the prohibitive costs associated with sequencing and assembling large eukaryotic *These authors contributed equally to this work.
Uncovering the roles of rare variants in common disease through whole-genome sequencing. Nat Rev Genet.
, 2010
"... ..."
The European Bioinformatics Institute’s data resources. Nucleic Acids Res 2010;38:D17–25
"... ABSTRACT The wide uptake of next-generation sequencing and other ultra-high throughput technologies by life scientists with a diverse range of interests, ..."
Abstract
-
Cited by 91 (7 self)
- Add to MetaCart
(Show Context)
ABSTRACT The wide uptake of next-generation sequencing and other ultra-high throughput technologies by life scientists with a diverse range of interests,
Improved base calling for the Illumina Genome Analyzer using
, 2009
"... Software ..."
(Show Context)
Summarizing and correcting the GC content bias in high-throughput sequencing
- NUCLEIC ACIDS RES
, 2012
"... GC content bias describes the dependence between fragment count (read coverage) and GC content found in Illumina sequencing data. This bias can dominate the signal of interest for analyses that focus on measuring fragment abundance with-in a genome, such as copy number estimation (DNA-seq). The bias ..."
Abstract
-
Cited by 77 (0 self)
- Add to MetaCart
GC content bias describes the dependence between fragment count (read coverage) and GC content found in Illumina sequencing data. This bias can dominate the signal of interest for analyses that focus on measuring fragment abundance with-in a genome, such as copy number estimation (DNA-seq). The bias is not consistent between samples; and there is no consensus as to the best methods to remove it in a single sample. We analyze regularities in the GC bias patterns, and find a compact description for this unimodal curve family. It is the GC content of the full DNA fragment, not only the sequenced read, that most influences fragment count. This GC effect is unimodal: both GC-rich fragments and AT-rich frag-ments are underrepresented in the sequencing results. This empirical evidence strengthens the hypothesis that PCR is the most important cause of the GC bias. We propose a model that produces predictions at the base pair level, allowing strand-specific GC-effect correction regardless of the downstream smoothing or binning. These GC modeling considerations can inform other high-throughput sequencing analyses such as ChIP-seq and RNA-seq.
Shrimp: accurate mapping of short color-space reads
- PLoS Comput. Biol
, 2009
"... The development of Next Generation Sequencing technologies, capable of sequencing hundreds of millions of short reads (25–70 bp each) in a single run, is opening the door to population genomic studies of non-model species. In this paper we present SHRiMP- the SHort Read Mapping Package: a set of alg ..."
Abstract
-
Cited by 75 (3 self)
- Add to MetaCart
(Show Context)
The development of Next Generation Sequencing technologies, capable of sequencing hundreds of millions of short reads (25–70 bp each) in a single run, is opening the door to population genomic studies of non-model species. In this paper we present SHRiMP- the SHort Read Mapping Package: a set of algorithms and methods to map short reads to a genome, even in the presence of a large amount of polymorphism. Our method is based upon a fast read mapping technique, separate thorough alignment methods for regular letter-space as well as AB SOLiD (color-space) reads, and a statistical model for false positive hits. We use SHRiMP to map reads from a newly sequenced Ciona savignyi individual to the reference genome. We demonstrate that SHRiMP can accurately map reads to this highly polymorphic genome, while confirming high heterozygosity of C. savignyi in this second individual. SHRiMP is freely available at