Plant biodiversity represents a rich and largely unexplored source of bioactive compounds with potential applications in human health. Despite advances in synthetic pharmaceuticals, many wild and cultivated plant species remain poorly characterized, limiting the discovery of molecular compounds. Phylogenetic and molecular approaches can provide a powerful framework to study this diversity, enabling the identification of evolutionarily distinct lineages and the prediction of species likely to produce bioactive compounds. By integrating evolutionary history, genetic variation, and functional traits, these methods can improve the efficiency of bioprospecting offering a systematic strategy for the targeted exploration of medicinal plants. Within this framework, this thesis aims to valorize plant biodiversity by using and implementing phylogenetic methods. It provides novel approaches for researchers to explore botanical diversity in the search for new molecules and to identify variants with enhanced efficacy for human health. In particular this thesis explores advanced computational and phylogenetic methodologies for identifying bioactive compounds and characterizing genetic diversity in plants. The first section introduces a novel workflow, implemented in the pm4mp R package, designed to predict medicinal potential using three distinct methods: nodesigl_harvesteR for identifying "stable hot nodes" in phylogenetic trees and the Hot Ancestry Score (HAS) and the Hidden Medicinal Plant Prediction (HMPP) for ranking species. The workflow was applied to search putative medicinal plants using data obtained from a public database of known medicinal plants related to ten diseases, in combination with a phylogeny of more than 30,000 land plant species. The analysis made it possible to identify new plant species showing a high probability of being associated with important diseases (allergy, Alzheimer’s disease, arteriosclerosis, colon cancer, depression, hyperglycemia, hypertension, insomnia, malaria and prostate hyperplasia). This pipeline was validated by screening sixty Mediterranean species for neuroprotective properties against Parkinson’s disease, ultimately leading to the identification of acteoside as a potent inhibitor of alpha-synuclein aggregation through a combination of phylogenetic screening, metabolomics, and in vivo assays. The second part of the research shifts focus to the Bowman–Birk inhibitors (BBIs) within the genus Vigna. BBIs exhibit chemopreventive and antitumor activity against colon cancer by inhibiting serine proteases involved in tumor growth and progression. By analyzing the genetic variability of trypsin and chymotrypsin inhibitor genes across wild and domesticated accessions, the study assesses the impact of natural selection and geographic variation on protein evolution. Utilizing Ancestral Sequence Reconstruction (ASR) based on phylogenetic analysis and expression analysis, the research has identified key ancestral isoforms. These genetic and phylogenetic findings were integrated with computational modeling of binding energies to pinpoint the most efficient inhibitors for human biological targets, demonstrating how evolutionary insights can influence the discovery of high-performing natural proteins. Overall, this thesis confirms that computational bioprospecting methods are a powerful tool to explore and valorise plant biodiversity, providing a foundation for the discovery of novel bioactive compounds with potential human health applications.

La biodiversità vegetale rappresenta una risorsa ricca e in gran parte inesplorata di composti bioattivi con potenziali applicazioni per la salute umana. Nonostante i progressi nel campo dei farmaci sintetici, molte specie vegetali spontanee e coltivate rimangono scarsamente caratterizzate, limitando la scoperta di nuove molecole. Gli approcci filogenetici e molecolari offrono un quadro metodologico efficace per lo studio di questa diversità, consentendo l’identificazione di linee evolutivamente distinte e la previsione delle specie più probabilmente in grado di produrre composti bioattivi. Integrando storia evolutiva, variabilità genetica e tratti funzionali, tali metodi migliorano l’efficienza del bioprospecting e favoriscono un’esplorazione mirata delle piante medicinali. In questo contesto, la presente tesi mira a valorizzare la biodiversità vegetale attraverso l’uso e lo sviluppo di metodi filogenetici per l’identificazione di nuove risorse bioattive. La tesi esplora metodologie computazionali e filogenetiche avanzate per l’identificazione di composti bioattivi e la caratterizzazione della diversità genetica nelle piante. La prima parte introduce un workflow innovativo, implementato nel pacchetto R pm4mp, progettato per predire il potenziale medicinale delle specie mediante tre approcci: nodesigl_harvesteR per l’identificazione dei “nodi caldi stabili” negli alberi filogenetici, e i metodi Hot Ancestry Score (HAS) e Hidden Medicinal Plant Prediction (HMPP) per il ranking delle specie. Il workflow è stato applicato a dati provenienti da un database pubblico di piante medicinali associate a dieci patologie, in combinazione con una filogenesi comprendente oltre 30.000 specie di piante terrestri. L’analisi ha permesso di identificare nuove specie con un’elevata probabilità di associazione a patologie di rilevanza sanitaria, tra cui allergia, malattia di Alzheimer, carcinoma del colon, iperglicemia, ipertensione e malaria. Il pipeline è stato validato mediante lo screening di sessanta specie mediterranee per attività neuroprotettiva contro la malattia di Parkinson, portando all’identificazione dell’acteoside come potente inibitore dell’aggregazione dell’alfa-sinucleina attraverso un approccio integrato di screening filogenetico, metabolomica e saggi in vivo. La seconda parte della ricerca si concentra sugli inibitori di Bowman–Birk (BBI) nel genere Vigna, noti per la loro attività chemiopreventiva e antitumorale nel carcinoma del colon. Attraverso l’analisi della variabilità genetica dei geni codificanti per inibitori della tripsina e della chimotripsina in accessioni selvatiche e domesticate, lo studio valuta l’effetto della selezione naturale e della variazione geografica sull’evoluzione proteica. Mediante Ricostruzione delle Sequenze Ancestrali (ASR), analisi filogenetiche ed espressione genica, sono state identificate isoforme ancestrali chiave, successivamente integrate con modellazione computazionale delle energie di legame per individuare gli inibitori più efficienti verso target biologici umani. Nel complesso, questa tesi dimostra come il bioprospecting computazionale rappresenti uno strumento efficace per esplorare e valorizzare la biodiversità vegetale, favorendo la scoperta di nuovi composti bioattivi di interesse per la salute umana.

Toini, E (2026). Phylogenetic Prediction of Medicinal Plants and Evolutionary Analysis of Bowman–Birk Inhibitors. (Tesi di dottorato, , 2026).

Phylogenetic Prediction of Medicinal Plants and Evolutionary Analysis of Bowman–Birk Inhibitors

TOINI, ELISA
2026

Abstract

Plant biodiversity represents a rich and largely unexplored source of bioactive compounds with potential applications in human health. Despite advances in synthetic pharmaceuticals, many wild and cultivated plant species remain poorly characterized, limiting the discovery of molecular compounds. Phylogenetic and molecular approaches can provide a powerful framework to study this diversity, enabling the identification of evolutionarily distinct lineages and the prediction of species likely to produce bioactive compounds. By integrating evolutionary history, genetic variation, and functional traits, these methods can improve the efficiency of bioprospecting offering a systematic strategy for the targeted exploration of medicinal plants. Within this framework, this thesis aims to valorize plant biodiversity by using and implementing phylogenetic methods. It provides novel approaches for researchers to explore botanical diversity in the search for new molecules and to identify variants with enhanced efficacy for human health. In particular this thesis explores advanced computational and phylogenetic methodologies for identifying bioactive compounds and characterizing genetic diversity in plants. The first section introduces a novel workflow, implemented in the pm4mp R package, designed to predict medicinal potential using three distinct methods: nodesigl_harvesteR for identifying "stable hot nodes" in phylogenetic trees and the Hot Ancestry Score (HAS) and the Hidden Medicinal Plant Prediction (HMPP) for ranking species. The workflow was applied to search putative medicinal plants using data obtained from a public database of known medicinal plants related to ten diseases, in combination with a phylogeny of more than 30,000 land plant species. The analysis made it possible to identify new plant species showing a high probability of being associated with important diseases (allergy, Alzheimer’s disease, arteriosclerosis, colon cancer, depression, hyperglycemia, hypertension, insomnia, malaria and prostate hyperplasia). This pipeline was validated by screening sixty Mediterranean species for neuroprotective properties against Parkinson’s disease, ultimately leading to the identification of acteoside as a potent inhibitor of alpha-synuclein aggregation through a combination of phylogenetic screening, metabolomics, and in vivo assays. The second part of the research shifts focus to the Bowman–Birk inhibitors (BBIs) within the genus Vigna. BBIs exhibit chemopreventive and antitumor activity against colon cancer by inhibiting serine proteases involved in tumor growth and progression. By analyzing the genetic variability of trypsin and chymotrypsin inhibitor genes across wild and domesticated accessions, the study assesses the impact of natural selection and geographic variation on protein evolution. Utilizing Ancestral Sequence Reconstruction (ASR) based on phylogenetic analysis and expression analysis, the research has identified key ancestral isoforms. These genetic and phylogenetic findings were integrated with computational modeling of binding energies to pinpoint the most efficient inhibitors for human biological targets, demonstrating how evolutionary insights can influence the discovery of high-performing natural proteins. Overall, this thesis confirms that computational bioprospecting methods are a powerful tool to explore and valorise plant biodiversity, providing a foundation for the discovery of novel bioactive compounds with potential human health applications.
LABRA, MASSIMO
GRASSI, FABRIZIO
Pianta medicinale; Filogenesi; Vigna Savi; Bowman-Birk inhibito; Bioprospezione
Medicinal plant; Phylogenetic; Vigna Savi; Bowman-Birk inhibito; Bioprospecting
Italian
30-mar-2026
38
2024/2025
embargoed_20290330
Toini, E (2026). Phylogenetic Prediction of Medicinal Plants and Evolutionary Analysis of Bowman–Birk Inhibitors. (Tesi di dottorato, , 2026).
File in questo prodotto:
File Dimensione Formato  
phd_unimib_803387.pdf

embargo fino al 30/03/2029

Descrizione: Tesi di Toini Elisa - 803387
Tipologia di allegato: Doctoral thesis
Dimensione 7.53 MB
Formato Adobe PDF
7.53 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/610812
Citazioni
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
Social impact