Publication

Synergy between embedding and protein functional association networks for drug label prediction using harmonic function

Timilsina, Mohan
Mc Kernan, Declan Patrick
Yang, Haixuan
d’Aquin, Mathieu
Citation
Timilsina, Mohan, Mc Kernan, Declan Patrick, Yang, Haixuan, & d’Aquin, Mathieu. (2020). Synergy between embedding and protein functional association networks for drug label prediction using harmonic function. IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB). doi:10.1109/TCBB.2020.3031696
Abstract
Semi-Supervised Learning (SSL) is an approach to machine learning that makes use of unlabeled data for training with a small amount of labeled data. In the context of molecular biology and pharmacology, one can take advantage of unlabeled data. For instance, to identify drugs and targets where a few genes are known to be associated with a specific target for drugs and considered as labeled data. Labeling the genes requires laboratory verification and validation. This process is usually very time consuming and expensive. Thus, it is useful to estimate the functional role of drugs from unlabeled data using computational methods. To develop such a model, we used openly available data resources to create (i) drugs and genes, (ii) genes and disease, bipartite graphs. We constructed the genetic embedding graph from the two bipartite graphs using Tensor Factorization methods. We integrated the genetic embedding graph with the publicly available genetic interaction graphs. Our results show the usefulness of the integration by effectively predicting drug labels.
Publisher
ACM and IEEE
Publisher DOI
Rights
Attribution-NonCommercial-NoDerivs 3.0 Ireland