• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

Limma: linear models for microarray data. In Bioinformatics and Computational Biology Solutions using R (2005)

by G K Smyth
Add To MetaCart

Tools

Sorted by:
Results 1 - 10 of 774
Next 10 →

Linear models and empirical bayes methods for assessing differential expression in microarray experiments.

by Gordon K Smyth , Gordon K Smyth - Stat. Appl. Genet. Mol. Biol. , 2004
"... Abstract The problem of identifying differentially expressed genes in designed microarray experiments is considered. Lonnstedt and Speed (2002) derived an expression for the posterior odds of differential expression in a replicated two-color experiment using a simple hierarchical parametric model. ..."
Abstract - Cited by 1321 (24 self) - Add to MetaCart
Abstract The problem of identifying differentially expressed genes in designed microarray experiments is considered. Lonnstedt and Speed (2002) derived an expression for the posterior odds of differential expression in a replicated two-color experiment using a simple hierarchical parametric model. The purpose of this paper is to develop the hierarchical model of Lonnstedt and Speed (2002) into a practical approach for general microarray experiments with arbitrary numbers of treatments and RNA samples. The model is reset in the context of general linear models with arbitrary coefficients and contrasts of interest. The approach applies equally well to both single channel and two color microarray experiments. Consistent, closed form estimators are derived for the hyperparameters in the model. The estimators proposed have robust behavior even for small numbers of arrays and allow for incomplete data arising from spot filtering or spot quality weights. The posterior odds statistic is reformulated in terms of a moderated t-statistic in which posterior residual standard deviations are used in place of ordinary standard deviations. The empirical Bayes approach is equivalent to shrinkage of the estimated sample variances towards a pooled estimate, resulting in far more stable inference when the number of arrays is small. The use of moderated t-statistics has the advantage over the posterior odds that the number of hyperparameters which need to estimated is reduced; in particular, knowledge of the non-null prior for the fold changes are not required. The moderated t-statistic is shown to follow a t-distribution with augmented degrees of freedom. The moderated t inferential approach extends to accommodate tests of composite null hypotheses through the use of moderated F-statistics. The performance of the methods is demonstrated in a simulation study. Results are presented for two publicly available data sets.
(Show Context)

Citation Context

...proved test statistics. In many gene discovery experiments for which microarrays are used the primary aim is to rank the genes in order of evidence against H0 rather than to assign absolute p-values (=-=Smyth et al, 2003-=-). This is because only a limited number of genes may be followed up for further study regardless of the number which are significant. Even when the above distributional assumptions fail for a given d...

A novel signaling pathway impact analysis

by Adi Laurentiu Tarca, Sorin Draghici, Purvesh Khatri, Sonia S. Hassan, Pooja Mittal, Jung-sun Kim, Chong Jai Kim, Juan Pedro Kusanovic, Roberto Romero - Bioinformatics , 2009
"... Motivation: Gene expression class comparison studies may identify hundreds or thousands of genes as differentially expressed (DE) between sample groups. Gaining biological insight from the result of such experiments can be approached, for instance, by identifying the signaling pathways impacted by t ..."
Abstract - Cited by 85 (1 self) - Add to MetaCart
Motivation: Gene expression class comparison studies may identify hundreds or thousands of genes as differentially expressed (DE) between sample groups. Gaining biological insight from the result of such experiments can be approached, for instance, by identifying the signaling pathways impacted by the observed changes. Most of the existing pathway analysis methods focus on either the number of DE genes observed in a given pathway (enrichment analysis methods), or on the correlation between the pathway genes and the class of the samples (functional class scoring methods). Both approaches treat the pathways as simple sets of genes, disregarding the complex gene interactions that these pathways are built to describe. Results: We describe a novel Signaling Pathway Impact Analysis (SPIA) that combines the evidence obtained from the classical enrichment analysis with a novel type of evidence, which measures the actual perturbation on a given pathway under a given condition. A bootstrap procedure is used to assess the significance of the observed total pathway perturbation. Using simulations we show that the evidence derived from perturbations is independent of the pathway enrichment evidence. This allows us to calculate a global pathway significance p-value, which combines the enrichment and perturbation p-values. We illustrate the capabilities of the novel method on 4 real data sets. The results obtained on these data show that SPIA has better specificity and more sensitivity than several widely used pathway analysis methods. Availability: SPIA was implemented as an R package which is available at
(Show Context)

Citation Context

...and quantile normalization (Irizarry et al., 2003), microarray expression data from all three experiments were analyzed in the same way. Differential expression was inferred using a moderated t-test (=-=Smyth, 2005-=-) and a false discovery rate (FDR) adjustment of the resulting p-values. For the Colorectal cancer and Vessels datasets, genes were considered as DE provided they had a FDR corrected p-value less than...

limma powers differential expression analyses for RNA-sequencing and microarray studies

by Matthew E. Ritchie, Belinda Phipson, Di Wu, Yifang Hu, Charity W. Law, Wei Shi, Gordon K. Smyth - Nucleic Acids Res , 2015
"... RNA-sequencing and microarray studies (Article begins on next page) The Harvard community has made this article openly available. Please share how this access benefits you. Your story matters. ..."
Abstract - Cited by 32 (4 self) - Add to MetaCart
RNA-sequencing and microarray studies (Article begins on next page) The Harvard community has made this article openly available. Please share how this access benefits you. Your story matters.
(Show Context)

Citation Context

... from beginning to end in a flexible and statistically rigorous way. The limma package is a core component of Bioconductor, an R-based open-source software development project in statistical genomics =-=(1,2)-=-. It has proven a popular choice for the analysis of data from experiments involvingmicroarrays (3,4), high-throughput polymerase chain reaction (PCR) (5), protein arrays (6) and other platforms. The ...

waves: improved detection of copy number variation from microarray-based comparative genomic hybridization

by John C Marioni, Natalie P Thorne, Tomas Fitzgerald, Richard Redon, Heike Fiegler, T Daniel Andrews, Barbara E Stranger, Andrew G Lynch, Emmanouil T Dermitzakis, Nigel P Carter, Simon Tavaré, Matthew E Hurles - R Redon, H Fiegler, T D Andrews, B E Stranger, A G Lynch, E T Dermitzakis, N P Carter, S Tavaré, and M E Hurles. Breaking the , 2007
"... This is an open access article distributed under the terms of the Creative Commons Attribution License ..."
Abstract - Cited by 26 (0 self) - Add to MetaCart
This is an open access article distributed under the terms of the Creative Commons Attribution License
(Show Context)

Citation Context

...ng the loess curve After reading the data into R [16] we fitted the loess curve to each sample/chromosome using a modified version of the loessFit function available within the limma software library =-=[17]-=-. Since it is widely known [18] that more variation in log 2 ratios is seen at the telomeres of chromosomes than at other locations on a chromosome, we added 50 simulated data points at either end of ...

Optimizing the noise versus bias trade-off for Illumina whole genome expression BeadChips. Nucleic acids research. 2010; 38:e204. [PubMed: 20929874

by Wei Shi, Alicia Oshlack, Gordon K. Smyth
"... Five strategies for pre-processing intensities from Illumina expression BeadChips are assessed from the point of view of precision and bias. The strategies include a popular variance stabilizing transformation and model-based background cor-rections that either use or ignore the control probes. Four ..."
Abstract - Cited by 22 (8 self) - Add to MetaCart
Five strategies for pre-processing intensities from Illumina expression BeadChips are assessed from the point of view of precision and bias. The strategies include a popular variance stabilizing transformation and model-based background cor-rections that either use or ignore the control probes. Four calibration data sets are used to evaluate precision, bias and false discovery rate (FDR). The original algorithms are shown to have operating characteristics that are not easily com-parable. Some tend to minimize noise while others minimize bias. Each original algorithm is shown to have an innate intensity offset, by which unlogged intensities are bounded away from zero, and the size of this offset determines its position on the noise–bias spectrum. By adding extra offsets, a continuum of related algorithms with different noise–bias trade-offs is generated, allowing direct comparison of the performance of the strategies on equivalent terms. Adding a positive offset is shown to decrease the FDR of each original algo-rithm. The potential of each strategy to generate an algorithm with an optimal noise–bias trade-off is explored by finding the offset that minimizes its FDR. The use of control probes as part of the back-ground correction and normalization strategy is shown to achieve the lowest FDR for a given bias.
(Show Context)

Citation Context

...ground correction or normalization. Output columns included detection P-values scores and bead-level standard deviations for each array. The input function read.ilmn in the Bioconductor package limma =-=(22)-=- was used to read the data into R. The neqc function in limma implements the neqc pre-processing strategy. Data pre-processing by the logq and neq strategies was performed using the backgroundCorrect ...

Tissue-specific splicing factor gene expression signatures

by Ana Rita Grosso, Anita Q. Gomes, Nuno L. Barbosa-morais, Ra Caldeira, Natalie P. Thorne, Godfrey Grech, Marieke Von Lindern, Maria Carmo-fonseca - Nucleic Acids Res , 2008
"... signatures ..."
Abstract - Cited by 20 (0 self) - Add to MetaCart
signatures
(Show Context)

Citation Context

...nally, the differentially expressed splicingrelated genes for each T1 and T2 differentiation stage were selected using linear models and empirical Bayes methods (40) as implemented in ‘limma’ package =-=(41)-=-. The B-statistics gives the log odds of differential expression and it requires an ‘a priori’ value for the estimated proportion of differentially expressed genes. To determine this value, we visuall...

Draghici S. Analysis of microarray experiments of gene expression profiling

by Adi L. Tarca, Roberto Romero, Sorin Draghici - Am J Obstet Gynecol
"... The study of gene expression profiling of cells and tissue has become a major tool for discovery in medicine. Microarray experiments allow description of genome-wide expression changes in health and disease. The results of such experiments are expected to change the methods employed in the diagnosis ..."
Abstract - Cited by 15 (1 self) - Add to MetaCart
The study of gene expression profiling of cells and tissue has become a major tool for discovery in medicine. Microarray experiments allow description of genome-wide expression changes in health and disease. The results of such experiments are expected to change the methods employed in the diagnosis and prognosis of disease in obstetrics and gynecology. Moreover, an unbiased and systematic study of gene expression profiling should allow the establishment of a new taxonomy of disease for obstetric and gynecologic syndromes. Thus, a new era is emerging in which reproductive processes and disorders could be characterized using molecular tools and fingerprinting. The design, analysis, and interpretation of microarray experiments require specialized knowledge that is not part of the standard curriculum of our discipline. This article describes the types of studies that can be conducted with microarray experiments (class comparison, class prediction, class discovery). We discuss key issues pertaining to experimental design, data preprocessing, and gene selection methods. Common types of data representation are illustrated. Potential pitfalls in the interpretation of microarray experiments, as well as the strengths and limitations of this technology, are highlighted. This article is intended to assist clinicians in appraising the quality of the scientific evidence now reported in the obstetric and gynecologic literature.

BATS: a Bayesian user-friendly software for Analyzing Time Series microarray experiments.

by Claudia Angelini, Luisa Cutillo, Daniela De Canditiis, Margherita Mutarelli, Marianna Pensky
"... BATS is a user-friendly software for Bayesian Analysis of Time Series microarray experiments based on the novel, truly functional and fully Bayesian approach proposed in Angelini et at. (2006). The software is specifically designed for time series data. It allows an user to automatically identify an ..."
Abstract - Cited by 14 (1 self) - Add to MetaCart
BATS is a user-friendly software for Bayesian Analysis of Time Series microarray experiments based on the novel, truly functional and fully Bayesian approach proposed in Angelini et at. (2006). The software is specifically designed for time series data. It allows an user to automatically identify and rank differentially expressed genes and to estimate their expression profiles. BATS successfully manages various technical difficulties which arise in microarray time-course experiments, such as a small number of observations, non-uniform sampling intervals, and presence of missing or multiple data. BATS can carry out analysis with both simulated and real experimental data. It also handles data from different platforms. 1 Availability: BATS is written in Matlab and executable in Windows (Macintosh and Linux version are currently under development). It is freely available upon request from the authors. 1

Estimating the proportion of microarray probes expressed in an RNA sample. Nucleic acids research 38

by Wei Shi, Carolyn A. De Graaf, Sarah A. Kinkel, Ariel H. Achtman, Tracey Baldwin, Louis Schofield Hamish S. Scott , 2010
"... A fundamental question in microarray analysis is the estimation of the number of expressed probes in different RNA samples. Negative control probes available in the latest microarray platforms, such as Illumina whole genome expression BeadChips, provide a unique opportunity to estimate the number of ..."
Abstract - Cited by 10 (6 self) - Add to MetaCart
A fundamental question in microarray analysis is the estimation of the number of expressed probes in different RNA samples. Negative control probes available in the latest microarray platforms, such as Illumina whole genome expression BeadChips, provide a unique opportunity to estimate the number of expressed probes without setting a threshold. A novel algorithm was proposed in this study to estimate the number of expressed probes in an RNA sample by utilizing these negative controls to measure background noise. The perfor-mance of the algorithm was demonstrated by comparing different generations of Illumina BeadChips, comparing the set of probes targeting well-characterized RefSeq NM transcripts with other probes on the array and comparing pure samples with heterogenous samples. Furthermore, hematopoietic stem cells were found to have a larger transcriptome than progenitor cells. Aire knockout medullary thymic epithelial cells were shown to have significantly less expressed probes than matched wild-type cells.
(Show Context)

Citation Context

...ated in Table 1, all data is from in-house experiments conducted by the authors. Data input and annotation All microarray data was read and manipulated using the Bioconductor R software package limma =-=(24)-=-. Probe annotation files were downloaded from Illumina web site (http://www.illumina.com). RESULTS Negative control intensities Illumina BeadChips include a set of negative control probes (10). The ne...

Adaptation of the MapMan ontology to biotic stress responses:

by Ana Rotter, Björn Usadel, Špela Baebler, Mark Stitt , 2007
"... application in solanaceous species ..."
Abstract - Cited by 10 (5 self) - Add to MetaCart
application in solanaceous species
(Show Context)

Citation Context

...ed [22]. Three biological replicates for a virus and a mock infected sample were analysed using the TIGR potato 10 k microarray. Data were analyzed using R statistical software with the limma package =-=[23]-=-. The output of the analysis, a list of possible differentially expressed genes (p < 0.05), with their respective M values, was visualized using MapMan (Figure 1a and 1b). Instead of having to analyze...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University