Abstract
The outbreak of the global COVID-19 pandemic emphasizes the importance of collaborative drug discovery for high effectiveness; however, due to the stringent data regulation, data privacy becomes an imminent issue needing to be addressed to enable collaborative drug discovery. In addition to the data privacy issue, the efficiency of drug discovery is another key objective since infectious diseases spread exponentially and effectively conducting drug discovery could save lives. Advanced Artificial Intelligence (AI) techniques are promising to solve these problems: (1) Federated Learning (FL) is born to keep data privacy while learning data from distributed clients; (2) graph neural network (GNN) can extract structural properties of molecules whose underlying architecture is the connected atoms; and (3) generative adversarial network (GAN) can generate novel molecules while retaining the properties learned from the training data. In this work, we make the first attempt to build a holistic collaborative and privacy-preserving FL framework, namely FL-DISCO, which integrates GAN and GNN to generate molecular graphs. Experimental results demonstrate the effectiveness of FL-DISCO on: (1) IID data for ESOL and QM9, where FL-DISCO can generate highly novel compounds with high drug-likeliness, uniqueness and LogP scores compared to the baseline; (2) non-IID data for ESOL and QM9, where FL-DISCO generates 100% novel compounds with high validity and LogP scores compared to the baseline. We also demonstrate how different fractions of clients, generator and discriminator architectures affect our evaluation scores.
Original language | English (US) |
---|---|
Title of host publication | 2021 40th IEEE/ACM International Conference on Computer-Aided Design, ICCAD 2021 - Proceedings |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
ISBN (Electronic) | 9781665445078 |
DOIs | |
State | Published - 2021 |
Externally published | Yes |
Event | 40th IEEE/ACM International Conference on Computer-Aided Design, ICCAD 2021 - Munich, Germany Duration: Nov 1 2021 → Nov 4 2021 |
Publication series
Name | IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD |
---|---|
Volume | 2021-November |
ISSN (Print) | 1092-3152 |
Conference
Conference | 40th IEEE/ACM International Conference on Computer-Aided Design, ICCAD 2021 |
---|---|
Country/Territory | Germany |
City | Munich |
Period | 11/1/21 → 11/4/21 |
Bibliographical note
Publisher Copyright:© 2021 IEEE
Keywords
- Drug discovery
- Federated learning
- Generative adversarial network
- Graph neural network