Prot-A-GAN: Automatic Protein Function Annotation using GAN-inspired Knowledge Graph Embedding - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Communication Dans Un Congrès Année : 2021

Prot-A-GAN: Automatic Protein Function Annotation using GAN-inspired Knowledge Graph Embedding

Résumé

Proteins perform various functions in living organisms. Automatic protein function annotation is defined as finding appropriate association between proteins and functional labels like Gene Ontology (GO) terms. n this paper, we present a preliminary exploration of the potential of generative adversarial networks (GAN) for protein function annotation. The Prot-A-GAN approach uses GAN-like adversarial training for learning embedding of nodes and relation in an heterogeneous knowledge graph. Following the terminologies of GAN, we firstly train a discriminator using domain-adaptive negative sampling to discriminate positive and negative triples, and then, we train a generator to guide a random walk over the knowledge graph that identify paths between proteins and GO annotations. We evaluate the method by performing protein function annotation using GO terms on human disease proteins from UniProtKB/SwissProt. As a proof-of-concept, the conducted experiments show promising outcome and open up new avenue for further exploration for protein function annotation.
Fichier principal
Vignette du fichier
WCBICML2021_paper_55.pdf (314.13 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03541255 , version 1 (24-01-2022)

Identifiants

  • HAL Id : hal-03541255 , version 1

Citer

Bishnu Sarker, Marie-Dominique Devignes, Guy Wolf, Sabeur Aridhi. Prot-A-GAN: Automatic Protein Function Annotation using GAN-inspired Knowledge Graph Embedding. ICML 2021 - Workshop on Computational Biology, Jul 2021, Virtual, United States. ⟨hal-03541255⟩
171 Consultations
109 Téléchargements

Partager

Gmail Facebook X LinkedIn More