Bicocca Open Archive

Prompt optimization is a crucial task for improving the performance of Large Language Models for down-stream tasks. In this paper a prompt is a sequence of n-grams, selected from a vocabulary. Consequently, the aim is to select the optimal prompt with concerning a certain performance metric. Prompt optimization can be considered as a combinatorial optimization problem, with the number of possible prompts (i.e., the combinatorial search space) given by the size of vocabulary (i.e., all the possible n-grams) powered to the length of the prompt. Exhaustive search is impractical; thus, an efficient search strategy is needed. We propose a Bayesian Optimization method per-formed over a continuous relaxation of the combinatorial search space. Bayesian Optimization is the dominant approach in black-box optimization for its sample efficiency along with its modular structure and versatility. We use BoTorch, a library for Bayesian Optimization research built on top of PyTorch. Specifically, we focus on Hard Prompt Tuning which directly searches for an optimal prompt to be added to the text input without requiring access to the Large Language Model as well as using it as a black-box (such as for GPT-4 which is available as a Model as a Service). Albeit preliminary and based on "vanilla"Bayesian Optimization algorithms, our experiments with RoBERTa as Large Language Model, on six benchmark datasets, show good performances when compared against other state-of-the-art black-box prompt optimization methods and enable an analysis of the trade-off between size of the search space, accuracy, and wall clock time.

Archetti, F., Ponti, A., Candelieri, A., Sabbatella, A. (2025). Bayesian optimization, machine learning, and probabilistic numerics. In Proceedings of the International Conference on Numerical Analysis and Applied Mathematics 2023 (ICNAAM-2023) Available. American Institute of Physics [10.1063/5.0287470].

Bayesian optimization, machine learning, and probabilistic numerics

Archetti F.;Ponti A.;Candelieri A.;Sabbatella A.

2025

Abstract

Prompt optimization is a crucial task for improving the performance of Large Language Models for down-stream tasks. In this paper a prompt is a sequence of n-grams, selected from a vocabulary. Consequently, the aim is to select the optimal prompt with concerning a certain performance metric. Prompt optimization can be considered as a combinatorial optimization problem, with the number of possible prompts (i.e., the combinatorial search space) given by the size of vocabulary (i.e., all the possible n-grams) powered to the length of the prompt. Exhaustive search is impractical; thus, an efficient search strategy is needed. We propose a Bayesian Optimization method per-formed over a continuous relaxation of the combinatorial search space. Bayesian Optimization is the dominant approach in black-box optimization for its sample efficiency along with its modular structure and versatility. We use BoTorch, a library for Bayesian Optimization research built on top of PyTorch. Specifically, we focus on Hard Prompt Tuning which directly searches for an optimal prompt to be added to the text input without requiring access to the Large Language Model as well as using it as a black-box (such as for GPT-4 which is available as a Model as a Service). Albeit preliminary and based on "vanilla"Bayesian Optimization algorithms, our experiments with RoBERTa as Large Language Model, on six benchmark datasets, show good performances when compared against other state-of-the-art black-box prompt optimization methods and enable an analysis of the trade-off between size of the search space, accuracy, and wall clock time.

Scheda breve

Scheda completa

Scheda completa (DC)

	Tipo di intervento
	
				paper
			
	Parole chiave
	
				Bayesian Optimization, Prompt Optimization, Large Language Models
			
	Lingua del contenuto
	
				English
			
	Nome del convegno
	
				2023 International Conference on Numerical Analysis and Applied Mathematics, ICNAAM 2023 - 11 September 2023 - 17 September 2023
			
	Anno del convegno
	
				2023
			
	Curatori della monografia
	
				Simos, TE
			
	Titolo degli atti
	
				Proceedings of the International Conference on Numerical Analysis and Applied Mathematics 2023 (ICNAAM-2023) Available
			
	ISBN del volume degli atti
	
				9780735452459
			
	Collana o serie
	
				AIP CONFERENCE PROCEEDINGS
			
	Data ahead of print o Data prima pubblicazione Online
	
				11-set-2025
			
	Data di pubblicazione
	
				2025
			
	Numero del volume
	
				3315
			
	Article number
	
				400049
			
	DOI dell'intervento
	
				https://dx.doi.org/10.1063/5.0287470
			
	Fulltext
	
				none
			
	Citazione
	
				Archetti, F., Ponti, A., Candelieri, A., Sabbatella, A. (2025). Bayesian optimization, machine learning, and probabilistic numerics. In Proceedings of the International Conference on Numerical Analysis and Applied Mathematics 2023 (ICNAAM-2023) Available. American Institute of Physics [10.1063/5.0287470].
			
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/571141

Citazioni

0

ND

Social impact