Prediction of Chromatography Conditions for Purification in Organic Synthesis Using Deep Learning

In this research, a process for developing normal-phase liquid chromatography solvent systems has been proposed. In contrast to the development of conditions via thin-layer chromatography (TLC), this process is based on the architecture of two hierarchically connected neural network-based components...

Full description

Bibliographic Details
Main Authors:	Mantas Vaškevičius, Jurgita Kapočiūtė-Dzikienė, Liudas Šlepikas
Format:	Article
Language:	English
Published:	MDPI AG 2021-04-01
Series:	Molecules
Subjects:	deep learning chromatography neural networks machine learning solvent prediction organic synthesis
Online Access:	https://www.mdpi.com/1420-3049/26/9/2474

Description
Summary:	In this research, a process for developing normal-phase liquid chromatography solvent systems has been proposed. In contrast to the development of conditions via thin-layer chromatography (TLC), this process is based on the architecture of two hierarchically connected neural network-based components. Using a large database of reaction procedures allows those two components to perform an essential role in the machine-learning-based prediction of chromatographic purification conditions, i.e., solvents and the ratio between solvents. In our paper, we build two datasets and test various molecular vectorization approaches, such as extended-connectivity fingerprints, learned embedding, and auto-encoders along with different types of deep neural networks to demonstrate a novel method for modeling chromatographic solvent systems employing two neural networks in sequence. Afterward, we present our findings and provide insights on the most effective methods for solving prediction tasks. Our approach results in a system of two neural networks with long short-term memory (LSTM)-based auto-encoders, where the first predicts solvent labels (by reaching the classification accuracy of 0.950 ± 0.001) and in the case of two solvents, the second one predicts the ratio between two solvents (R<sup>2</sup> metric equal to 0.982 ± 0.001). Our approach can be used as a guidance instrument in laboratories to accelerate scouting for suitable chromatography conditions.
ISSN:	1420-3049

Prediction of Chromatography Conditions for Purification in Organic Synthesis Using Deep Learning

Similar Items