Transformer graph variational autoencoder for generative molecular design. 2025

Trieu Nguyen, and Aleksandra Karolak
Department of Machine Learning, Moffitt Cancer Center, Tampa, Florida; Department of Mathematics and Statistics, University of South Florida, Tampa, Florida.

In the field of drug discovery, the generation of new molecules with desirable properties remains a critical challenge. Traditional methods often rely on simplified molecular input line entry system representations for molecular input data, which can limit the diversity and novelty of generated molecules. To address this, we present the transformer graph variational autoencoder (TGVAE), an innovative AI model that employs molecular graphs as input data, thus capturing the complex structural relationships within molecules more effectively than string models. To enhance molecular generation capabilities, TGVAE combines a transformer, graph neural network (GNN), and VAE. Additionally, we address common issues like over-smoothing in training GNNs and posterior collapse in VAEs to ensure robust training and improve the generation of chemically valid and diverse molecular structures. Our results demonstrate that TGVAE outperforms existing approaches, generating a larger collection of diverse molecules and discovering structures that were previously unexplored. This advancement not only brings more possibilities for drug discovery but also sets a new level for the use of AI in molecular generation.

UI MeSH Term Description Entries

Related Publications

Trieu Nguyen, and Aleksandra Karolak
July 2018, Journal of cheminformatics,
Trieu Nguyen, and Aleksandra Karolak
June 2022, Journal of chemical information and modeling,
Trieu Nguyen, and Aleksandra Karolak
October 2025, Proceedings of the National Academy of Sciences of the United States of America,
Trieu Nguyen, and Aleksandra Karolak
November 2023, Briefings in bioinformatics,
Trieu Nguyen, and Aleksandra Karolak
December 2023, Digital discovery,
Trieu Nguyen, and Aleksandra Karolak
January 2023, IEEE transactions on image processing : a publication of the IEEE Signal Processing Society,
Trieu Nguyen, and Aleksandra Karolak
November 2023, International journal of molecular sciences,
Trieu Nguyen, and Aleksandra Karolak
January 2025, IEEE transactions on computational biology and bioinformatics,
Copied contents to your clipboard!