Protein design and directed evolution have separately contributed enormously to protein engineering. Without being mutually exclusive, the former relies on computation from first principles, while the latter is a combinatorial approach based on chance. Advances in ultrahigh throughput (uHT) screening, next generation sequencing and machine learning may create alternative routes to engineered proteins, where functional information linked to specific sequences is interpreted and extrapolated in silico. In particular, the miniaturisation of functional tests in water-in-oil emulsion droplets with picoliter volumes and their rapid generation and analysis (>1 kHz) allows screening of >107-membered libraries in a day. Subsequently, decoding the selected clones by short or long-read sequencing methods leads to large sequence-function datasets that may allow extrapolation from experimental directed evolution to further improved mutants beyond the observed hits. In this work, we explore experimental strategies for how to draw up ‘fitness landscapes’ in sequence space with uHT droplet microfluidics, review the current state of AI/ML in enzyme engineering and discuss how uHT datasets may be combined with AI/ML to make meaningful predictions and accelerate biocatalyst engineering.

On synergy between ultrahigh throughput screening and machine learning in biocatalyst engineering / Gantz, M.; Mathis, S. V.; Nintzel, F. E. H.; Lio, P.; Hollfelder, F.. - In: FARADAY DISCUSSIONS. - ISSN 1359-6640. - 252:(2024), pp. 89-114. [10.1039/d4fd00065j]

On synergy between ultrahigh throughput screening and machine learning in biocatalyst engineering

Lio P.;
2024

Abstract

Protein design and directed evolution have separately contributed enormously to protein engineering. Without being mutually exclusive, the former relies on computation from first principles, while the latter is a combinatorial approach based on chance. Advances in ultrahigh throughput (uHT) screening, next generation sequencing and machine learning may create alternative routes to engineered proteins, where functional information linked to specific sequences is interpreted and extrapolated in silico. In particular, the miniaturisation of functional tests in water-in-oil emulsion droplets with picoliter volumes and their rapid generation and analysis (>1 kHz) allows screening of >107-membered libraries in a day. Subsequently, decoding the selected clones by short or long-read sequencing methods leads to large sequence-function datasets that may allow extrapolation from experimental directed evolution to further improved mutants beyond the observed hits. In this work, we explore experimental strategies for how to draw up ‘fitness landscapes’ in sequence space with uHT droplet microfluidics, review the current state of AI/ML in enzyme engineering and discuss how uHT datasets may be combined with AI/ML to make meaningful predictions and accelerate biocatalyst engineering.
2024
Biocatalysis; High-Throughput Screening Assays; Machine Learning; Protein Engineering
01 Pubblicazione su rivista::01a Articolo in rivista
On synergy between ultrahigh throughput screening and machine learning in biocatalyst engineering / Gantz, M.; Mathis, S. V.; Nintzel, F. E. H.; Lio, P.; Hollfelder, F.. - In: FARADAY DISCUSSIONS. - ISSN 1359-6640. - 252:(2024), pp. 89-114. [10.1039/d4fd00065j]
File allegati a questo prodotto
File Dimensione Formato  
Gantz_On-synergy_2024.pdf

accesso aperto

Note: https://doi.org/10.1039/D4FD00065J
Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Creative commons
Dimensione 2.26 MB
Formato Adobe PDF
2.26 MB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1723972
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 2
social impact