• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 1 - 10 of 842
Next 10 →

Non-contiguous sequence pattern queries

by Nikos Mamoulis, Man Lung Yiu - In Proc. of Extending Database Technology (EDBT , 2004
"... Abstract. Non-contiguous subsequence pattern queries search for symbol instances in a long sequence that satisfy some soft temporal constraints. In this paper, we propose a methodology that indexes long sequences, in order to efficiently process such queries. The sequence data are decomposed into ta ..."
Abstract - Cited by 4 (1 self) - Add to MetaCart
Abstract. Non-contiguous subsequence pattern queries search for symbol instances in a long sequence that satisfy some soft temporal constraints. In this paper, we propose a methodology that indexes long sequences, in order to efficiently process such queries. The sequence data are decomposed

Tandem repeats finder: a program to analyze DNA sequences

by Gary Benson , 1999
"... A tandem repeat in DNA is two or more contiguous, approximate copies of a pattern of nucleotides. Tandem repeats have been shown to cause human disease, may play a variety of regulatory and evolutionary roles and are important laboratory and analytic tools. Extensive knowledge about pattern size, co ..."
Abstract - Cited by 961 (9 self) - Add to MetaCart
A tandem repeat in DNA is two or more contiguous, approximate copies of a pattern of nucleotides. Tandem repeats have been shown to cause human disease, may play a variety of regulatory and evolutionary roles and are important laboratory and analytic tools. Extensive knowledge about pattern size

Text Classification using String Kernels

by Huma Lodhi, Craig Saunders, John Shawe-Taylor, Nello Cristianini, Chris Watkins
"... We propose a novel approach for categorizing text documents based on the use of a special kernel. The kernel is an inner product in the feature space generated by all subsequences of length k. A subsequence is any ordered sequence of k characters occurring in the text though not necessarily contiguo ..."
Abstract - Cited by 495 (7 self) - Add to MetaCart
We propose a novel approach for categorizing text documents based on the use of a special kernel. The kernel is an inner product in the feature space generated by all subsequences of length k. A subsequence is any ordered sequence of k characters occurring in the text though not necessarily

Davenport-Schinzel Sequences and Their Geometric Applications

by Pankaj K. Agarwal , Micha Sharir , 1998
"... An (n; s) Davenport-Schinzel sequence, for positive integers n and s, is a sequence composed of n distinct symbols with the properties that no two adjacent elements are equal, and that it does not contain, as a (possibly non-contiguous) subsequence, any alternation a \Delta \Delta \Delta b \Delta \ ..."
Abstract - Cited by 439 (105 self) - Add to MetaCart
An (n; s) Davenport-Schinzel sequence, for positive integers n and s, is a sequence composed of n distinct symbols with the properties that no two adjacent elements are equal, and that it does not contain, as a (possibly non-contiguous) subsequence, any alternation a \Delta \Delta \Delta b \Delta

The diploid genome sequence of an individual human

by Samuel Levy, Granger Sutton, Pauline C. Ng, Lars Feuk, Aaron L. Halpern, Brian P. Walenz, Nelson Axelrod, Jiaqi Huang, Ewen F. Kirkness, Gennady Denisov, Yuan Lin, Jeffrey R. Macdonald, Andy Wing, Chun Pang, Mary Shago, Timothy B. Stockwell, Alexia Tsiamouri, Vineet Bafna, Vikas Bansal, Saul A. Kravitz, Dana A. Busam, Karen Y. Beeson, Tina C. Mcintosh, Karin A. Remington, Josep F. Abril, John Gill, Jon Borman, Yu-hui Rogers, Marvin E. Frazier, Stephen W. Scherer, Robert L. Strausberg, J. Craig Venter - PLoS Biol
"... Presented here is a genome sequence of an individual human. It was produced from;32 million random DNA fragments, sequenced by Sanger dideoxy technology and assembled into 4,528 scaffolds, comprising 2,810 million bases (Mb) of contiguous sequence with approximately 7.5-fold coverage for any given r ..."
Abstract - Cited by 293 (6 self) - Add to MetaCart
Presented here is a genome sequence of an individual human. It was produced from;32 million random DNA fragments, sequenced by Sanger dideoxy technology and assembled into 4,528 scaffolds, comprising 2,810 million bases (Mb) of contiguous sequence with approximately 7.5-fold coverage for any given

1 Non-contiguous SCHEMA protein recombination

by Matthew A. Smith, Frances H. Arnold
"... SCHEMA is a method of designing protein recombination libraries that contain a large fraction of functional proteins with a high degree of mutational diversity. In the previous chapter we illustrated the method for designing libraries by swapping contiguous sequence elements. Here, we introduce the ..."
Abstract - Add to MetaCart
SCHEMA is a method of designing protein recombination libraries that contain a large fraction of functional proteins with a high degree of mutational diversity. In the previous chapter we illustrated the method for designing libraries by swapping contiguous sequence elements. Here, we introduce

Permutation pseudographs and contiguity

by Catherine Greenhill, Svante Janson, Jeong Han Kim, Nicholas C. Wormald - COMBIN. PROBAB. COMPUT , 2001
"... The space of permutation pseudographs is a probabilistic model of 2-regular pseudographs on n vertices, where a pseudograph is produced by choosing a permutation σ of {1, 2,..., n} uniformly at random and taking the n edges {i, σ(i)}. We prove several contiguity results involving permutation pseudog ..."
Abstract - Cited by 10 (3 self) - Add to MetaCart
pseudographs (contiguity is a kind of asymptotic equivalence of sequences of probability spaces). Namely, we show that a random 4-regular pseudograph is contiguous with the sum of two permutation pseudographs, the sum of a permutation pseudograph and a random Hamilton cycle, and the sum of a permutation

Isolation and direct complete nucleotide determination of entire genes. Characterization of a gene coding for 16S ribosomal RNA. Nucleic Acids Res

by Ulrike Edwards, Till Rogall, Helmut Blocker, Monica Emde, Erik C. Bottger , 1989
"... Using a set of synthetic oligonucleotides homologous to broadly conserved sequences in-vitro amplification via the polymerase chain reaction followed by direct sequencing results in almost complete nucleotide determination of a gene coding for 16S ribosomal RNA. As a model system the nucleotide sequ ..."
Abstract - Cited by 186 (2 self) - Add to MetaCart
sequence of the 16S rRNA gene of M.kansasii was determined and found to be 98.7 % homologous to that of M.bovis BCG. This is the first report on a contiguous sequence information of an entire amplified gene spanning 1.5 fcb without any subcloning procedures. INTRODTJCTIOH The analysis of nucleic acid

Non-contiguous word sequences for information retrieval

by Antoine Doucet, Helena Ahonen-myka - In Proceedings of the 42nd annual meeting of the Association for Computational Lingustics, Workshop on Multiword Expressions: Integrating Processing , 2004
"... The growing amount of textual information available electronically has increased the need for high performance retrieval. The use of phrases was long seen as a natural way to improve retrieval performance over the common document models that ignore the sequential aspect of word occurrences in docume ..."
Abstract - Cited by 9 (3 self) - Add to MetaCart
in documents, considering them as “bags of words”. However, both statistical and syntactical phrases showed disappointing results for large document collections. In this paper we present a recent type of multi-word expressions in the form of Maximal Frequent Sequences (Ahonen-Myka, 1999). Mined phrases rather

ABACAS: algorithm based automatic contiguation of assembled sequences

by Sammy Assefa, Thomas D. Otto, Thomas M. Keane, Chris Newbold, Matthew Berriman - Bioinformatics , 2009
"... There is increasing interest in sequencing the genomes of strains that are closely related to existing finished reference genomes. Recently a number of de novo and mapping based assemblers have been developed to produce high quality draft genomes from second generation sequencing technology reads. H ..."
Abstract - Cited by 3 (1 self) - Add to MetaCart
There is increasing interest in sequencing the genomes of strains that are closely related to existing finished reference genomes. Recently a number of de novo and mapping based assemblers have been developed to produce high quality draft genomes from second generation sequencing technology reads
Next 10 →
Results 1 - 10 of 842
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University