OSG-KINC

OSG-KINC: High-throughput gene co-expression network construction using the open science grid. Gene Co-expression Network (GCN) analysis is a method to characterize the complexity underlying biological systems. With an increasing availability of datasets available for mining complex gene expression patterns, novel algorithms and computational frameworks must be developed to take advantage of the wealth of information. OSG-KINC is a Pegasus workflow that enables highly parallel execution of KINC - Knowledge Independent Network Construction - using resources available on the Open Science Grid (OSG). A yeast GCN was constructed using the OSG-KINC workflow, providing an example GCN resource for biological hypothesis testing. Timing experiments demonstrate that the number of jobs submitted by the user significantly affects the performance of the workflow. An overview of workflow usage, bottlenecks, and efforts for improvement is provided. OSG-KINC is freely available at https://github.com/feltus/OSG-KINC under GNU General Public License version 3.