• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

HAPLO: A program using the EM algorithm to estimate the frequencies of multi-site haplotypes. (1995)

by M E HAWLEY, K K KIDD
Venue:Journal of Heredity
Add To MetaCart

Tools

Sorted by:
Results 1 - 10 of 92
Next 10 →

Bayesian Haplotype Inference for Multiple Linked Single-Nucleotide Polymorphisms .American

by Tianhua Niu, Zhaohui S. Qin, Xiping Xu, Jun S. Liu - Journal of Human Genetics , 2002
"... Haplotypes have gained increasing attention in the mapping of complex-disease genes, because of the abundance of single-nucleotide polymorphisms (SNPs) and the limited power of conventional single-locus analyses. It has been shown that haplotype-inference methods such as Clark’s algorithm, the expec ..."
Abstract - Cited by 230 (6 self) - Add to MetaCart
Haplotypes have gained increasing attention in the mapping of complex-disease genes, because of the abundance of single-nucleotide polymorphisms (SNPs) and the limited power of conventional single-locus analyses. It has been shown that haplotype-inference methods such as Clark’s algorithm, the expectation-maximization algorithm, and a coalescence-based iterative-sampling algorithm are fairly effective and economical alternatives to molecular-hap-lotyping methods. To contend with some weaknesses of the existing algorithms, we propose a new Monte Carlo approach. In particular, we first partition the whole haplotype into smaller segments. Then, we use the Gibbs sampler both to construct the partial haplotypes of each segment and to assemble all the segments together. Our algorithm can accurately and rapidly infer haplotypes for a large number of linked SNPs. By using a wide variety of real and simulated data sets, we demonstrate the advantages of our Bayesian algorithm, and we show that it is robust to the violation of Hardy-Weinberg equilibrium, to the presence of missing data, and to occurrences of recombination hotspots.

Haplotype reconstruction from genotype data using imperfect phylogeny

by Eran Halperin, Eleazar Eskin - Bioinformatics , 2004
"... Critical to the understanding of the genetic basis for complex diseases is the modeling of human variation. Most of this variation can be characterized by single nucleotide polymorphisms (SNPs) which are mutations at a single nucleotide position. To characterize the genetic variation between differe ..."
Abstract - Cited by 96 (8 self) - Add to MetaCart
Critical to the understanding of the genetic basis for complex diseases is the modeling of human variation. Most of this variation can be characterized by single nucleotide polymorphisms (SNPs) which are mutations at a single nucleotide position. To characterize the genetic variation between different people, we must determine an individual’s haplotype or which nucleotide base occurs at each position of these common SNPs for each chromosome. In this paper, we present results for a highly accurate method for haplotype resolution from genotype data. Our method leverages a new insight into the underlying structure of haplotypes which shows that SNPs are organized in highly correlated “blocks”. In a few recent studies (see Daly et al. (2001); Patil et al. (2001)), considerable parts of the human genome were partitioned into blocks, such that the majority of the sequenced genotypes have one of about four common haplotypes in each block. Our method partitions the SNPs into blocks and for each block, we predict the common haplotypes and each individual’s haplotype. We evaluate our method over biological data. Our method predicts the common haplotypes perfectly and has a very low error rate (less than ¢ ¡ over the data from Daly et al. (2001).) when taking into account the predictions for the uncommon haplotypes. Our method is extremely efficient compared to previous methods, such as PHASE and HAPLOTYPER. Its efficiency allows us to find the block partition of the haplotypes, to cope with missing data and to work with large data sets. Availability: The algorithm is available via webserver at
(Show Context)

Citation Context

...h of Clark (1990) presented more than 10 years ago, parsimony approaches (Gusfield, 2000, 2001; Lancia et al., 2001), maximum-likelihood methods (Excoffier and Slatkin, 1995; Fallin and Schork, 2000; =-=Hawley and Kidd, 1995-=-; Long et al., 1995), statistical methods such as PHASE (Stephens et al., 2001) and HAPLOTYPER (Niu et al., 2002), and perfect phylogeny-based approaches (Gusfield, 2002). Some of these previous metho...

Efficient reconstruction of haplotype structure via perfect phylogeny

by Eleazar Eskin, Eran Halperin, Richard M. Karp - Journal of Bioinformatics and Computational Biology , 2003
"... Each person’s genome contains two copies of each chromosome, one inherited from the father and the other from the mother. A person’s genotype specifies the pair of bases at each site, but does not specify which base occurs on which chromosome. The sequence of each chromosome separately is called a h ..."
Abstract - Cited by 75 (12 self) - Add to MetaCart
Each person’s genome contains two copies of each chromosome, one inherited from the father and the other from the mother. A person’s genotype specifies the pair of bases at each site, but does not specify which base occurs on which chromosome. The sequence of each chromosome separately is called a haplotype. The determination of the haplotypes within a population is essential for understanding genetic variation and the inheritance of complex diseases. The haplotype mapping project, a successor to the human genome project, seeks to determine the common haplotypes in the human population. Since experimental determination of a person’s genotype is less expensive than determining its component haplotypes, algorithms are required for computing haplotypes from genotypes. Two observations aid in this process: first, the human genome contains short blocks within which only a few different haplotypes occur; second, as suggested by Gusfield, it is reasonable to assume that the haplotypes observed within a block have evolved according to a perfect phylogeny, in which at most one mutation event has occurred at any site, and no recombination occurred at the given region. We present a simple and efficient polynomial-time algorithm for inferring haplotypes from the genotypes of a set of individuals assuming a perfect phylogeny. Using a reduction to 2-SAT we extend this algorithm to handle constraints that apply when we have genotypes from both parents and child. We also present a hardness result for the problem of removing the minimum number of individuals from a population to ensure that the genotypes of the remaining individuals are consistent with a perfect phylogeny. Our algorithms have been tested on real data and give biologically meaningful results. Our webserver
(Show Context)

Citation Context

...revious approaches to determining the haplotype information from genotype data. These methods include the parsimony approach of Clark [3] and related approaches [8, 9, 13], maximum likelihood methods =-=[6, 12, 14, 7]-=- and statistical methods such as PHASE [20]. These approaches use heuristics, but in practice they do not scale to data that contains more than 30 sites, while our algorithm could cope with large data...

Y: Haplotype inference by maximum parsimony

by Lusheng Wang, Ying Xu - Bioinformatics
"... Motivation: Haplotypes have been attracting increasing attention because of their importance in analysis of many fine-scale molecular-genetics data. Since direct sequencing of haplotype via experimental methods is both time-consuming and expensive, haplotype inference methods that infer haplotypes b ..."
Abstract - Cited by 64 (4 self) - Add to MetaCart
Motivation: Haplotypes have been attracting increasing attention because of their importance in analysis of many fine-scale molecular-genetics data. Since direct sequencing of haplotype via experimental methods is both time-consuming and expensive, haplotype inference methods that infer haplotypes based on genotype samples become attractive alternatives. Results: (1) We design and implement an algorithm for an important computational model of haplotype inference that has been suggested before in several places. The model finds a set of minimum number of haplotypes that explains the genotype samples. (2) Strong supports of this computational model are given based on the computational results on both real data and simulation data. (3) We also did some comparative study to show the strength and weakness of this computational model using our program. Availability: The software HAPAR is free for non-commercial uses. Available upon request (lwang@cs.cityu.edu.hk). Contact:
(Show Context)

Citation Context

...solved vectors. The second category is expectation-maximization (EM) method which looks for the set of haplotypes maximizing the posterior probability of given genotypes (Excoffier and Slatkin, 1995; =-=Hawley and Kidd, 1995-=-; Long et al., 1995; Chiano and Clayton, 1998). The third category contains several statistical methods based on Bayesian estimators and Gibbs sampling (Stephens et al., 2001; Niu et al., 2002; Zhang ...

Large scale reconstruction of haplotypes from genotype data

by Eleazar Eskin, Eran Halperin, Richard M. Karp - In Proc. RECOMB’03 , 2003
"... Critical to the understanding of the genetic basis for complex diseases is the modeling of human variation. Most of this variation can be characterized by single nucleotide polymorphisms (SNPs) which are mutations at a single nucleotide position. To characterize an individual’s variation, we must de ..."
Abstract - Cited by 45 (3 self) - Add to MetaCart
Critical to the understanding of the genetic basis for complex diseases is the modeling of human variation. Most of this variation can be characterized by single nucleotide polymorphisms (SNPs) which are mutations at a single nucleotide position. To characterize an individual’s variation, we must determine an individual’s haplotype or which nucleotide base occurs at each position of these common SNPs for each chromosome. In this paper, we present results for a highly accurate method for haplotype resolution from genotype data. Our method leverages a new insight into the underlying structure of haplotypes which shows that SNPs are organized in highly correlated “blocks”. The majority of individuals have one of about four common haplotypes in each block. Our method partitions the SNPs into blocks and for each block, we predict the common haplotypes and each individual’s haplotype. We evaluate our method over biological data. Our method predicts the common haplotypes perfectly and has a very low error rate (0.47%) when taking into account the predictions for the uncommon haplotypes. Our method is extremely efficient compared to previous methods, (a matter of seconds where previous methods needed hours). Its efficiency allows us to find the block partition of the haplotypes, to cope with missing data and to work with large data sets such as genotypes for thousands of SNPs for hundreds of individuals. The algorithm is available via webserver
(Show Context)

Citation Context

...ihood of a candidate solution for a population of genotypes is n� f(i1)f(i2) L = (2n) 2 (1) i=1 This model is consistent with previous maximum likelihood models for choosing haplotypes from genotypes =-=[7, 14, 17, 8, 26]-=-. The main caveat with these previous approaches is that they do not restrict the possible solutions to the ones that roughly fit the perfect phylogeny model. This results in far to many possible hapl...

A survey of computational methods for determining haplotypes

by Bjarni V. Halldórsson, Vineet Bafna, Nathan Edwards, Shibu Yooseph, Sorin Istrail - Lecture Notes in Computer Science (2983): Computational Methods for SNPs and Haplotype Inference , 2004
"... Abstract. It is widely anticipated that the study of variation in the human genome will provide a means of predicting risk of a variety of complex diseases. Single nucleotide polymorphisms (SNPs) are the most common form of genomic variation. Haplotypes have been suggested as one means for reducing ..."
Abstract - Cited by 38 (4 self) - Add to MetaCart
Abstract. It is widely anticipated that the study of variation in the human genome will provide a means of predicting risk of a variety of complex diseases. Single nucleotide polymorphisms (SNPs) are the most common form of genomic variation. Haplotypes have been suggested as one means for reducing the complexity of studying SNPs. In this paper we review some of the computational approaches that have been taking for determining haplotypes and suggest new approaches. 1
(Show Context)

Citation Context

...e population is related to the haplotype probabilities by the compact expression φ g = � h+ ¯ h=g φhφ¯ h where φh is the probability of haplotype h in the population. The maximum likelihood method of =-=[15,24,39,52,16,1]-=- estimates the haplotype probabilities φH =(φh,φ¯ h,...,φh ′) from observed genotype frequencies ˆ φG in n individuals. The approach assumes HWE and a uniform prior on the φh’s. The likelihood functio...

HAPLORE: a program for haplotype reconstruction in general pedigrees without recombination. Bioinformatics 2005;21:90

by Kui Zhang, Fengzhu Sun, Hongyu Zhao, Hongyu Zhao Ph. D, Key Words Genotype, Em Algorithm
"... Running Tile: haplotype reconstruction in general pedigrees ..."
Abstract - Cited by 23 (3 self) - Add to MetaCart
Running Tile: haplotype reconstruction in general pedigrees

A comprehensive literature review of haplotyping software and methods for use with unrelated individuals

by Rany M. Salem, et al. , 2005
"... ..."
Abstract - Cited by 14 (0 self) - Add to MetaCart
Abstract not found

Combinatorial problems arising in SNP and Haplotype Analysis

by Bjarni V. Halldórsson, Vineet Bafna, Nathan Edwards, Shibu Yooseph, Sorin Istrail - Discrete Mathematics and Theoretical Computer Science. Proceedings of DMTCS 2003 , 2003
"... Abstract. It is widely anticipated that the study of variation in the human genome will provide a means of predicting riskof a variety of complex diseases. This paper presents a number of algorithmic and combinatorial problems that arise when studying a very common form of genomic variation, single ..."
Abstract - Cited by 14 (2 self) - Add to MetaCart
Abstract. It is widely anticipated that the study of variation in the human genome will provide a means of predicting riskof a variety of complex diseases. This paper presents a number of algorithmic and combinatorial problems that arise when studying a very common form of genomic variation, single nucleotide polymorphisms (SNPs). We review recent results and present challenging open problems. 1
(Show Context)

Citation Context

...e population is related to the haplotype probabilities by the compact expression φ g = � h+ ¯ h=g φhφ¯ h where φh is the probability of haplotype h in the population. The maximum likelihood method of =-=[17,27,41,60]-=- estimates the haplotype probabilities φH =(φh,φ¯ h,...,φh ′) fromobserved genotype frequencies ˆ φG in n individuals. The approach assumes HWE and a uniform prior on the φh’s. The likelihood function...

Islands of Tractability for Parsimony Haplotyping

by Roded Sharan, Bjarni V. Halldórsson, Sorin Istrail - Proc. IEEE Computational Systems Bioinformatics Conf , 2005
"... We study the parsimony approach to haplotype inference, which calls for finding a set of haplotypes of minimum cardinality that explains an input set of genotypes. We prove that the problem is APX-hard even in very restricted cases. On the positive side, we identify islands of tractability for the p ..."
Abstract - Cited by 13 (2 self) - Add to MetaCart
We study the parsimony approach to haplotype inference, which calls for finding a set of haplotypes of minimum cardinality that explains an input set of genotypes. We prove that the problem is APX-hard even in very restricted cases. On the positive side, we identify islands of tractability for the problem, by focusing on instances with specific structure of haplotype sharing among the input genotypes. We exploit the structure of those instance to give polynomial and constant-approximation algorithms to the problem. We also show that the general parsimony haplotyping problem is fixed parameter tractable.
(Show Context)

Citation Context

...the literature to resolve haplotypes from genotype data. These methods include the seminal approach of Clark [4] and related parsimony approaches [9], [10], [12], maximum likelihood methods [5], [6], =-=[16]-=-, [21], Bayesian methods . R. Sharan is with the School of Computer Science, Tel-Aviv University, Tel-Aviv 69978, Israel. E-mail: roded@tau.ac.il. All correspondence should be addressed to this author...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University