Poster
A Variational Perspective on Generative Protein Fitness Optimization
Lea Bogensperger · Dominik Narnhofer · Ahmed Allam · Konrad Schindler · Michael Krauthammer
West Exhibition Hall B2-B3 #W-313
The goal of protein fitness optimization is to discover new protein variants with enhanced fitness for a given use. The vast search space and the sparsely populated fitness landscape, along with the discrete nature of protein sequences, pose significant challenges when trying to determine the gradient towards configurations with higher fitness. We introduce Variational Latent Generative Protein Optimization (VLGPO), a variational perspective on fitness optimization. Our method embeds protein sequences in a continuous latent space to enable efficient sampling from the fitness distribution and combines a (learned) flow matching prior over sequence mutations with a fitness predictor to guide optimization towards sequences with high fitness. VLGPO achieves state-of-the-art results on two different protein benchmarks of varying complexity. Moreover, the variational design with explicit prior and likelihood functions offers a flexible plug-and-play framework that can be easily customized to suit various protein design tasks.
We consider the task of protein fitness optimization, which aims to improve a protein’s functionality by modifying its amino acid sequence to enhance a specific function. Due to the vast search space of possible sequences, computational approaches can aid in suggesting new protein candidates.We use an approach based on generative models, where the goal is to learn the distribution of a data set of protein mutants. We then employ a second model to steer the generation process toward sequences of higher fitness. The effectiveness of our approach is demonstrated on two proteins, AAV and GFP, each with two design tasks of different difficulty (medium and hard).
Live content is unavailable. Log in and register to view live content