(Enter summary)
Abstract: The biological sciences are undergoing an explosion in the amount of available data. New data analysis methods are needed to deal with the data. We present work using KDD to analyse data from mutant phenotype growth experiments with the yeast S. cerevisiae to predict novel gene functions. The analysis of the data presented a number of challenges: multi-class labels, a large number of sparsely populated classes, the need to learn a set of accurate rules (not a complete classification), and a... (Update)
Cited by: More
Learning Ontology-Aware Classifiers - Jun Zhang Doina
(Correct)
Under consideration for publication in Knowledge and.. - Systems Learning Accurate
(Correct)
Learning Classifiers Using Hierarchically Structured Class.. - Wu, Zhang, Honavar (2005)
(Correct)
Similar documents (at the sentence level):
78.6%: Knowledge Discovery in Multi-Label Phenotype Data - Clare, King (2001)
(Correct)
37.6%: Machine Learning of Functional Class From Phenotype Data - Clare, King (2002)
(Correct)
Active bibliography (related documents): More All
0.5: The Utility of Different Representations of Protein.. - King, Karwath, Clare, .. (2001)
(Correct)
0.4: Accurate Prediction of Protein Functional Class from.. - King, Karwath, Clare, .. (2000)
(Correct)
0.3: Prediction of Glycosylation Across the Human Proteome and the.. - Gupta, Brunak (2002)
(Correct)
Similar documents based on text: More All
0.4: Systematic Functional Analysis of the Yeast Genome - Oliver, Winson, Kell, Baganz (1998)
(Correct)
0.3: Combining Inductive Logic Programming, Active Learning and.. - Muggleton, al.
(Correct)
0.3: Random Exploration of the Kluyveromyces lactis genome.. - Ozier-Kalogeropoulos.. (1998)
(Correct)
Related documents from co-citation: More All
4: Multi-label text classification with a mixture model trained by em
- McCallum - 1999
4: Text categorization with Support Vector Machines: Learning with many relevant fe..
- Joachims - 1998
3: Scene-centered description from spatial envelope properties (context) - Oliva, Torralba - 2002
BibTeX entry: (Update)
A. Clare and R.D. King. Knowledge discovery in multi-label phenotype data. In L. De Raedt and A. Siebes, editors, 5th European Conference on Principles of Data Mining and Knowledge Discovery (PKDD2001), volume 2168 of Lecture Notes in Arti cial Intelligence, pages 42-53. Springer-Verlag, 2001. http://citeseer.ist.psu.edu/clare01knowledge.html More
@article{ clare01knowledge,
author = "Amanda Clare and Ross D. King",
title = "Knowledge Discovery in Multi-label Phenotype Data",
journal = "Lecture Notes in Computer Science",
volume = "2168",
pages = "42+",
year = "2001",
url = "citeseer.ist.psu.edu/clare01knowledge.html" }
Citations (may not include all citations):
2177
programs for Machine Learning (context) - Quinlan - 1993
696
UCI repository of machine learning databases (context) - Blake, Merz - 1998
546
An introduction to the bootstrap (context) - Efron, Tibshirani - 1993
272
Cluster analysis and display of genome-wide expression patte.. (context) - Eisen, Spellman et al. - 1998
164
A study of cross-validation and bootstrap for accuracy estim..
- Kohavi - 1995
139
Exploring the metabolic and genetic control of gene expressi.. (context) - DeRisi, Iyer et al. - 1997
135
Hierarchically classifying documents using very few words
- Koller, Sahami - 1997
105
Estimating probabilities: A crucial task in machine learning (context) - Cestnik - 1990
86
Knowledge-based analysis of microarray gene expression data ..
- Brown, Grundy et al. - 2000
76
BoosTexter: A boosting-based system for text categorization
- Schapire, Singer - 2000
73
The sequence of the human genome (context) - Venter - 2001
61
Improving text classification by shrinkage in a hierarchy of..
- McCallum, Rosenfeld et al. - 1998
38
Initial sequencing and analysis of the human genome (context) - genome, consortium - 2001
23
Analysis of gene expression data using self-organizing maps (context) - Toronen, Kolehmainen et al. - 1999
13
MIPS: a database for protein sequences and complete genomes
- Mewes, Heumann et al. - 1999
13
Multi-label text classification with a mixture model trained..
- McCallum - 1999
7
Analysis of the genome sequence of the flowering plant arabi.. (context) - genome - 2000
6
Systems for categorizing functions of gene products (context) - Riley - 1998
5
Genome scale prediction of protein functional class from seq.. (context) - King, Karwath et al. - 2000
5
Proteomics: quantitative and physical mapping of cellular pr.. (context) - Blackstock, Weir - 1999
4
the optimization of classes for the assignment of unidentifi.. (context) - Kell, King - 2000
4
A functional genomics strategy that uses metabolome data to .. (context) - Raamsdonk, Teusink et al. - 2001
3
Prediction of enzyme classification from protein sequence wi.. (context) - Jardins, Karp et al. - 1997
3
Functional classes in the three domains of life (context) - Andrade, Ouzounis et al. - 1999
2
Out print but available httpwww (context) - Spiegelhalter, Machine et al. - 1994
2
TRIPLES: a database of gene function in S (context) - Kumar, Cheung et al. - 2000
2
an essential gene required for DNA replication in Saccharomy.. (context) - Sugimoto, Sakamoto et al. - 1995
2
A new approach for isolating cell wall mutants in Saccharomy.. (context) - Ram, Wolters et al. - 1994
2
A network approach to the systematic analysis of yeast gene .. (context) - Oliver - 1996
2
Genome analysis using clusters of orthologous groups (context) - Koonin, Tatusov et al. - 1998
1
Significance level based classification with multiple trees (context) - Karalic, Pirnat - 1991
1
Learning document classification from large text hierarchy (context) - Mladenic, Grobelnik - 1998
1
Large scale identification of genes involved in cell surface.. (context) - Lussier, White et al. - 1997
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://users.aber.ac.uk/ajc99/research.html): More
Knowledge Discovery in Multi-Label Phenotype Data - Clare, King (2001)
(Correct)
The Utility of Different Representations of Protein.. - King, Karwath, Clare, .. (2000)
(Correct)
Machine Learning of Functional Class From Phenotype Data - Clare, King (2002)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC