• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

Comparative testing of DNA segmentation algorithms using benchmark simulations, (2010)

by E Elhaik, D Graur, K Josic
Venue:Mol. Biol. Evol.
Add To MetaCart

Tools

Sorted by:
Results 1 - 3 of 3

Multiscale DNA partitioning: statistical evidence for segments. Bioinformatics

by Andreas Futschik , Thomas Hotz , Axel Munk , Hannes Sieling , 2014
"... Abstract Motivation: DNA segmentation, i.e. the partitioning of DNA in compositionally homogeneous segments, is a basic task in bioinformatics. Different algorithms have been proposed for various partitioning criteria such as GC content, local ancestry in population genetics, or copy number variati ..."
Abstract - Cited by 2 (0 self) - Add to MetaCart
Abstract Motivation: DNA segmentation, i.e. the partitioning of DNA in compositionally homogeneous segments, is a basic task in bioinformatics. Different algorithms have been proposed for various partitioning criteria such as GC content, local ancestry in population genetics, or copy number variation. A critical component of any such method is the choice of an appropriate number of segments. Some methods use model selection criteria, and do not provide a suitable error control. Other methods that are based on simulating a statistic under a null model provide suitable error control only if the correct null model is chosen. Results: Here, we focus on partitioning with respect to GC content and propose a new approach that provides statistical error control: as in statistical hypothesis testing, it guarantees with a user specified probability 1−α that the number of identified segments does not exceed the number of actually present segments. The method is based on a statistical multiscale criterion, rendering this as segmentation method which searches segments of any length (on all scales), simultaneously. It is also very accurate in localizing segments: under bench-mark scenarios, our approach leads to a segmentation that is more accurate than the approaches discussed in the comparative review of Availability: Our method is implemented in function smuceR of the R-package stepR, available from
(Show Context)

Citation Context

... approach that provides statistical error control: as in statistical hypothesis testing, it guarantees with a user specified probability 1−α that the number of identified segments does not exceed the number of actually present segments. The method is based on a statistical multiscale criterion, rendering this as segmentation method which searches segments of any length (on all scales), simultaneously. It is also very accurate in localizing segments: under bench-mark scenarios, our approach leads to a segmentation that is more accurate than the approaches discussed in the comparative review of [18]. In our real data examples, we find segments that often correspond well to features taken from standard UCSC genome annotation tracks. Availability: Our method is implemented in function smuceR of the R-package stepR, available from http://www.stochastik.math.uni-goettingen.de/smuce. 2 MULTISCALE DNA PARTITIONING 1. Introduction It has been observed a long time ago [30] that DNA sequences are not composed homogeneously and that bases fluctuate in their frequency. These inhomogeneities often have an evolutionary or a functional interpretation, and can be relevant for the subsequent analysis of...

3 GC3 Biology in Eukaryotes and Prokaryotes

by Eran Elhaik, Tatiana Tatarinova
"... In this chapter we describe the distribution of Guanine and Cytosine (GC) content in the third codon position (GC3) distributions in different species, analyze evolutionary trends and discuss differences between genes and organisms with distinct GC3 levels. We scrutinize previously published theoret ..."
Abstract - Cited by 2 (1 self) - Add to MetaCart
In this chapter we describe the distribution of Guanine and Cytosine (GC) content in the third codon position (GC3) distributions in different species, analyze evolutionary trends and discuss differences between genes and organisms with distinct GC3 levels. We scrutinize previously published theoretical frameworks and construct a unified view of GC3
(Show Context)

Citation Context

...a. Indeed, further analyses showed thatsthe descriptions of the human or vertebrate genomes as mosaics of isochores are erroneouss(Cohen et al. 2005; Elsik et al. 2009; Elhaik, Graur, and Josić 2010; =-=Elhaik et al. 2010-=-). Due tosthe lack of predictive power of GC3, new genomic studies scan the entire genomic structureswww.intechopen.com GC3 Biology in Eukaryotes and Prokaryotes 63susing automatic algorithms rather t...

ublished by Elsevier B.V. on behalf of the Research Network of Computational Contents

by unknown authors
"... ..."
Abstract - Add to MetaCart
Abstract not found
(Show Context)

Citation Context

...Many early statistical segmentationmethods were reviewed in [69]. Elhaik et al. reviewed the performance of seven recent algorithms by segmenting human chromosome 1 based on variability of GC content =-=[70]-=-. 3. Changept analysis In the remainder of this mini-review, we focus on the changept program developed by Keith et al. [43,44]. This is a Bayesian multiple change-point algorithm capable of simultane...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University