computational drug discovery

peptidy: A light-weight Python library for peptide representation in machine learning

We present a python library to encode peptides for machine learning applications. Non-canonical amino acids and post-translational modifications are supported.

The Jungle of Generative Drug Discovery: Traps, Treasures, and Ways Out

Surprising pitfalls in common evaluation approaches for molecule libraries generated by deep learning models. Simple solutions are proposed.

A Hitchhiker's Guide to Deep Chemical Language Processing for Bioactivity Prediction

Practical guidelines for training deep learning models on molecular string representations for bioactivity prediction.

Deep Supramolecular Language Processing for Co-crystal Prediction

DeepCocrystal is a convolutional neural network to predict co-crystal formation. SMILES augmentation is key to its development.

Chemical Language Modeling with Structured State Space Sequence Models

A novel approach to chemical language modeling. First application of structured state space sequence models (S4) to *de novo* design.

The surprising ineffectiveness of molecular dynamics coordinates for predicting bioactivity with machine learning

Measuring the impact of integrating molecular dynamics simulations to machine learning pipelines for bioactivity prediction.

Deep learning for low-data drug discovery: hurdles and opportunities

A review of the deep learning approaches in low-data drug discovery. Future research directions are outlined.

Exploring Data‐Driven Chemical SMILES Tokenization Approaches to Identify Key Protein‐Ligand Binding Moieties

We pharmacologically study chemical words and find that they can designate functional groups.

A Computational Software for Training Robust Drug-Target Affinity Prediction Models: pydebiaseddta

We present a python library to train more generalizable drug-target affinity prediction models.

Structure-based Drug Discovery with Deep Learning

A review of the deep learning approaches for structure-based drug discovery. Future research directions are outlined.