Liquid chromatography (LC) coupled to mass spectrometry (MS) is a powerful and versatile technique with several applications in analytical chemistry such as the untargeted analysis of complex mixtures of plant food bioactive compounds. In this framework, identification of compounds mainly relies on mass and fragmentation patterns derived from MS spectra of available databases. However, to develop automated tools for a rapid compound identification, prior knowledge of retention times (RTs) has been demonstrated to be a valid support to MS data to reduce the number of possible candidate structures. Unlike experimental methods, which are time-consuming and limited to a few classes of compounds, data-driven computational approaches can predict retention times for a wide range of compounds using only their molecular structures and physicochemical properties. In this research, Genetic Algorithms (GAs) coupled to Multiple Linear Regression (MLR) were applied to select the most relevant molecular descriptors to establish quantitative structure-retention relationships (QSRRs) aimed at predicting the retention times of plant food bioactive compounds across three different LC chromatographic systems. The statistical parameters showed model robustness and satisfactory predictive ability. Particular attention was paid to measuring the uncertainty of predictions and assessing their reliability based on the model applicability domain. Interpretation of the selected molecular descriptors provided valuable insight into the separation mechanism. Finally, the developed models were applied to predict the unknown retention times, for the three studied LC chromatographic systems, of a large library of plant food bioactive compounds, which were made freely available to further assist the research in the field of natural products.

Sepehri, B., Consonni, V., Ballabio, D., Muñoz, E., Abbasi, E., Todeschini, R. (2025). Application of QSRR models for predicting the retention times of plant food bioactive compounds. JOURNAL OF CHROMATOGRAPHY A, 1758(13 September 2025) [10.1016/j.chroma.2025.466194].

Application of QSRR models for predicting the retention times of plant food bioactive compounds

Consonni, Viviana
;
Ballabio, Davide;Muñoz, Enmanuel Cruz;Todeschini, Roberto
2025

Abstract

Liquid chromatography (LC) coupled to mass spectrometry (MS) is a powerful and versatile technique with several applications in analytical chemistry such as the untargeted analysis of complex mixtures of plant food bioactive compounds. In this framework, identification of compounds mainly relies on mass and fragmentation patterns derived from MS spectra of available databases. However, to develop automated tools for a rapid compound identification, prior knowledge of retention times (RTs) has been demonstrated to be a valid support to MS data to reduce the number of possible candidate structures. Unlike experimental methods, which are time-consuming and limited to a few classes of compounds, data-driven computational approaches can predict retention times for a wide range of compounds using only their molecular structures and physicochemical properties. In this research, Genetic Algorithms (GAs) coupled to Multiple Linear Regression (MLR) were applied to select the most relevant molecular descriptors to establish quantitative structure-retention relationships (QSRRs) aimed at predicting the retention times of plant food bioactive compounds across three different LC chromatographic systems. The statistical parameters showed model robustness and satisfactory predictive ability. Particular attention was paid to measuring the uncertainty of predictions and assessing their reliability based on the model applicability domain. Interpretation of the selected molecular descriptors provided valuable insight into the separation mechanism. Finally, the developed models were applied to predict the unknown retention times, for the three studied LC chromatographic systems, of a large library of plant food bioactive compounds, which were made freely available to further assist the research in the field of natural products.
Articolo in rivista - Articolo scientifico
Data sharing; FooDB database; HPLC; Molecular descriptors; Plant food bioactive compounds; PredRet database; QSRR; Retention time;
English
5-lug-2025
2025
1758
13 September 2025
466194
open
Sepehri, B., Consonni, V., Ballabio, D., Muñoz, E., Abbasi, E., Todeschini, R. (2025). Application of QSRR models for predicting the retention times of plant food bioactive compounds. JOURNAL OF CHROMATOGRAPHY A, 1758(13 September 2025) [10.1016/j.chroma.2025.466194].
File in questo prodotto:
File Dimensione Formato  
Sepehri et al-2025-Journal of Chromatography A-VoR.pdf

accesso aperto

Descrizione: This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Creative Commons
Dimensione 1.45 MB
Formato Adobe PDF
1.45 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/561021
Citazioni
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 1
Social impact