A dynamic programing approach to integrate gene expression data and network information for pathway model generation

Y Jiang, Y Liang, D Wang, D Xu, T Joshi - Bioinformatics, 2020 - academic.oup.com
Bioinformatics, 2020academic.oup.com
Motivation As large amounts of biological data continue to be rapidly generated, a major
focus of bioinformatics research has been aimed toward integrating these data to identify
active pathways or modules under certain experimental conditions or phenotypes. Although
biologically significant modules can often be detected globally by many existing methods, it
is often hard to interpret or make use of the results toward pathway model generation and
testing. Results To address this gap, we have developed the IMPRes algorithm, a new step …
Motivation
As large amounts of biological data continue to be rapidly generated, a major focus of bioinformatics research has been aimed toward integrating these data to identify active pathways or modules under certain experimental conditions or phenotypes. Although biologically significant modules can often be detected globally by many existing methods, it is often hard to interpret or make use of the results toward pathway model generation and testing.
Results
To address this gap, we have developed the IMPRes algorithm, a new step-wise active pathway detection method using a dynamic programing approach. IMPRes takes advantage of the existing pathway interaction knowledge in Kyoto Encyclopedia of Genes and Genomes. Omics data are then used to assign penalties to genes, interactions and pathways. Finally, starting from one or multiple seed genes, a shortest path algorithm is applied to detect downstream pathways that best explain the gene expression data. Since dynamic programing enables the detection one step at a time, it is easy for researchers to trace the pathways, which may lead to more accurate drug design and more effective treatment strategies. The evaluation experiments conducted on three yeast datasets have shown that IMPRes can achieve competitive or better performance than other state-of-the-art methods. Furthermore, a case study on human lung cancer dataset was performed and we provided several insights on genes and mechanisms involved in lung cancer, which had not been discovered before.
Availability and implementation
IMPRes visualization tool is available via web server at http://digbio.missouri.edu/impres.
Supplementary information
Supplementary data are available at Bioinformatics online.
Oxford University Press