ReFactX: Scalable Reasoning with Reliable Facts via Constrained Generation

Pozzi, R; Palmonari, M; Coletta, A; Bellomarini, L; Lehmann, J; Vahdati, S

doi:10.1007/978-3-032-09527-5_16

Knowledge gaps and hallucinations are persistent challenges for Large Language Models (LLMs), which generate unreliable responses when lacking the necessary information to fulfill user instructions. Existing approaches, such as Retrieval-Augmented Generation (RAG) and tool use, aim to address these issues by incorporating external knowledge. Yet, they rely on additional models or services, resulting in complex pipelines, potential error propagation, and often requiring the model to process a large number of tokens. In this paper, we present ReFactX, a scalable method that enables LLMs to access external knowledge without depending on retrievers or auxiliary models. Our approach uses constrained generation with a pre-built prefix-tree index. Triples from Wikidata are verbalized in textual facts, tokenized, and indexed in a prefix tree for efficient access. During inference, to acquire external knowledge, the LLM generates facts with constrained generation which allows only sequences of tokens that form an existing fact. We evaluate our proposal on Question Answering and show that it scales to large knowledge bases (800 million facts), adapts to domain-specific data, and achieves effective results. These gains come with minimal generation-time overhead. ReFactX code is available at https://github.com/rpo19/ReFactX.

Pozzi, R., Palmonari, M., Coletta, A., Bellomarini, L., Lehmann, J., Vahdati, S. (2026). ReFactX: Scalable Reasoning with Reliable Facts via Constrained Generation. In The Semantic Web – ISWC 2025 24th International Semantic Web Conference, Nara, Japan, November 2–6, 2025, Proceedings, Part I (pp.290-308). Springer Science and Business Media Deutschland GmbH [10.1007/978-3-032-09527-5_16].

ReFactX: Scalable Reasoning with Reliable Facts via Constrained Generation

Pozzi R.;Palmonari M.;Coletta A.;Bellomarini L.;Lehmann J.;Vahdati S.

2026

Abstract

Knowledge gaps and hallucinations are persistent challenges for Large Language Models (LLMs), which generate unreliable responses when lacking the necessary information to fulfill user instructions. Existing approaches, such as Retrieval-Augmented Generation (RAG) and tool use, aim to address these issues by incorporating external knowledge. Yet, they rely on additional models or services, resulting in complex pipelines, potential error propagation, and often requiring the model to process a large number of tokens. In this paper, we present ReFactX, a scalable method that enables LLMs to access external knowledge without depending on retrievers or auxiliary models. Our approach uses constrained generation with a pre-built prefix-tree index. Triples from Wikidata are verbalized in textual facts, tokenized, and indexed in a prefix tree for efficient access. During inference, to acquire external knowledge, the LLM generates facts with constrained generation which allows only sequences of tokens that form an existing fact. We evaluate our proposal on Question Answering and show that it scales to large knowledge bases (800 million facts), adapts to domain-specific data, and achieves effective results. These gains come with minimal generation-time overhead. ReFactX code is available at https://github.com/rpo19/ReFactX.

Scheda breve

Scheda completa

Scheda completa (DC)

	Tipo di intervento
	
				paper
			
	Parole chiave
	
				Question Answering, Constrained Generation
			
	Lingua del contenuto
	
				English
			
	Nome del convegno
	
				ISWC 2025 24th International Semantic Web Conference November 2–6, 2025
			
	Anno del convegno
	
				2025
			
	Curatori della monografia
	
				Garijo, D; Kirrane, S; Salatino, A; Shimizu, C; Acosta, M; Nuzzolese, AG; Ferrada, S; Soulard, T; Kozaki, K; Takeda, H; Gentile, AL
			
	Titolo degli atti
	
				The Semantic Web – ISWC 2025
24th International Semantic Web Conference, Nara, Japan, November 2–6, 2025, Proceedings, Part I
			
	ISBN del volume degli atti
	
				9783032095268
			
	Collana o serie
	
				LECTURE NOTES IN COMPUTER SCIENCE
			
	Data di pubblicazione
	
				2026
			
	Numero del volume
	
				16140 LNCS
			
	Pagina iniziale
	
				290
			
	Pagina finale
	
				308
			
	DOI dell'intervento
	
				https://dx.doi.org/10.1007/978-3-032-09527-5_16
			
	Fulltext
	
				none
			
	Citazione
	
				Pozzi, R., Palmonari, M., Coletta, A., Bellomarini, L., Lehmann, J., Vahdati, S. (2026). ReFactX: Scalable Reasoning with Reliable Facts via Constrained Generation. In The Semantic Web – ISWC 2025
24th International Semantic Web Conference, Nara, Japan, November 2–6, 2025, Proceedings, Part I (pp.290-308). Springer Science and Business Media Deutschland GmbH [10.1007/978-3-032-09527-5_16].
			
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/579661

Citazioni

0

ND

Bicocca Open Archive

ReFactX: Scalable Reasoning with Reliable Facts via Constrained Generation

Pozzi R.;Palmonari M.;Coletta A.;Bellomarini L.;Lehmann J.;Vahdati S.

2026

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Citazioni

Social impact

Bicocca Open Archive

ReFactX: Scalable Reasoning with Reliable Facts via Constrained Generation

Pozzi R.;Palmonari M.;Coletta A.;Bellomarini L.;Lehmann J.;Vahdati S.

2026

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Citazioni

Social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)