sva
R package SVA: Surrogate Variable Analysis. The sva package contains functions for removing batch effects and other unwanted variation in high-throughput experiment. Specifically, the sva package contains functions for the identifying and building surrogate variables for high-dimensional data sets. Surrogate variables are covariates constructed directly from high-dimensional data (like gene expression/RNA sequencing/methylation/brain imaging data) that can be used in subsequent analyses to adjust for unknown, unmodeled, or latent sources of noise. The sva package can be used to remove artifacts in three ways: (1) identifying and estimating surrogate variables for unknown sources of variation in high-throughput experiments (Leek and Storey 2007 PLoS Genetics,2008 PNAS), (2) directly removing known batch effects using ComBat (Johnson et al. 2007 Biostatistics) and (3) removing batch effects with known control probes (Leek 2014 biorXiv). Removing batch effects and using surrogate variables in differential expression analysis have been shown to reduce dependence, stabilize error rate estimates, and improve reproducibility, see (Leek and Storey 2007 PLoS Genetics, 2008 PNAS or Leek et al. 2011 Nat. Reviews Genetics).
Keywords for this software
References in zbMATH (referenced in 6 articles )
Showing results 1 to 6 of 6.
Sorted by year (- Akalin, Altuna: Computational genomics with R. With the assistance of Verdan Franke, Bora Uyar and Jonathan Ronen (2021)
- McKennan, Chris; Ober, Carole; Nicolae, Dan: Estimation and inference in metabolomics with nonrandom missing data and latent factors (2020)
- Kiihl, Samara F.; Martinez-Garrido, Maria Jose; Domingo-Relloso, Arce; Bermudez, Jose; Tellez-Plaza, Maria: \textttMLML2R: an R package for maximum likelihood estimation of DNA methylation and hydroxymethylation proportions (2019)
- Sanguinetti, Guido (ed.); Huynh-Thu, Vân Anh (ed.): Gene regulatory networks. Methods and protocols (2019)
- Adam Kaplan, Eric F. Lock: Prediction with Dimension Reduction of Multiple Molecular Data Sources for Patient Survival (2017) arXiv
- Sheu, Ching-Fan; Perthame, Émeline; Lee, Yuh-Shiow; Causeur, David: Accounting for time dependence in large-scale multiple testing of event-related potential data (2016)