BioOptimizer

BioOptimizer: A Bayesian scoring function approach to motif discovery. Motivation: Transcription factors (TFs) bind directly to short segments on the genome, often within hundreds to thousands of base pairs upstream of gene transcription start sites, to regulate gene expression. The experimental determination of TFs binding sites is expensive and time-consuming. Many motif-finding programs have been developed, but no program is clearly superior in all situations. Practitioners often find it difficult to judge which of the motifs predicted by these algorithms are more likely to be biologically relevant. Results: We derive a comprehensive scoring function based on a full Bayesian model that can handle unknown site abundance, unknown motif width and two-block motifs with variable-length gaps. An algorithm called BioOptimizer is proposed to optimize this scoring function so as to reduce noise in the motif signal found by any motif-finding program. The accuracy of BioOptimizer, which can be used in conjunction with several existing programs, is shown to be superior to using any of these motif-finding programs alone when evaluated by both simulation studies and application to sets of co-regulated genes in bacteria. In addition, this scoring function formulation enables us to compare objectively different predicted motifs and select the optimal ones, effectively combining the strengths of existing programs. Availability: BioOptimizer is available for download at www.fas.harvard.edu/ junliu/BioOptimizer/


References in zbMATH (referenced in 6 articles )

Showing results 1 to 6 of 6.
Sorted by year (citations)

  1. Kasabov, Nikola (ed.): Springer handbook of bio-/neuro-informatics (2014)
  2. Chan, Tak-Ming; Li, Gang; Leung, Kwong-Sak; Lee, Kin-Hong: Discovering multiple realistic TFBS motifs based on a generalized model (2009) ioport
  3. Corander, Jukka; Ekdahl, Magnus; Koski, Timo: Bayesian unsupervised learning of DNA regulatory binding regions (2009) ioport
  4. Ewens, Warren J.: On the use of statistics in genomics and bioinformatics (2008)
  5. Tsukahara, Takuma; Kim, Sun; Taylor, Milton W.: REFINEMENT: a search framework for the identification of interferon-responsive elements in DNA sequences -- a case study with ISRE and GAS (2006)
  6. Wang, Chuancai; Xie, Jun; Craig, B. A.: Context dependent models for discovery of transcription factor binding sites (2006)