Knowledge gaps and hallucinations are persistent challenges for Large Language Models (LLMs), which generate unreliable responses when lacking the necessary information to fulfill user instructions. Existing approaches, such as Retrieval-Augmented Generation (RAG) and tool use, aim to address these issues by incorporating external knowledge. Yet, they rely on additional models or services, resulting in complex pipelines, potential error propagation, and often requiring the model to process a large number of tokens. In this paper, we present ReFactX, a scalable method that enables LLMs to access external knowledge without depending on retrievers or auxiliary models. Our approach uses constrained generation with a pre-built prefix-tree index. Triples from Wikidata are verbalized in textual facts, tokenized, and indexed in a prefix tree for efficient access. During inference, to acquire external knowledge, the LLM generates facts with constrained generation which allows only sequences of tokens that form an existing fact. We evaluate our proposal on Question Answering and show that it scales to large knowledge bases (800 million facts), adapts to domain-specific data, and achieves effective results. These gains come with minimal generation-time overhead. ReFactX code is available at https://github.com/rpo19/ReFactX.

Pozzi, R., Palmonari, M., Coletta, A., Bellomarini, L., Lehmann, J., Vahdati, S. (2026). ReFactX: Scalable Reasoning with Reliable Facts via Constrained Generation. In The Semantic Web – ISWC 2025 24th International Semantic Web Conference, Nara, Japan, November 2–6, 2025, Proceedings, Part I (pp.290-308). Springer Science and Business Media Deutschland GmbH [10.1007/978-3-032-09527-5_16].

ReFactX: Scalable Reasoning with Reliable Facts via Constrained Generation

Pozzi R.;Palmonari M.;
2026

Abstract

Knowledge gaps and hallucinations are persistent challenges for Large Language Models (LLMs), which generate unreliable responses when lacking the necessary information to fulfill user instructions. Existing approaches, such as Retrieval-Augmented Generation (RAG) and tool use, aim to address these issues by incorporating external knowledge. Yet, they rely on additional models or services, resulting in complex pipelines, potential error propagation, and often requiring the model to process a large number of tokens. In this paper, we present ReFactX, a scalable method that enables LLMs to access external knowledge without depending on retrievers or auxiliary models. Our approach uses constrained generation with a pre-built prefix-tree index. Triples from Wikidata are verbalized in textual facts, tokenized, and indexed in a prefix tree for efficient access. During inference, to acquire external knowledge, the LLM generates facts with constrained generation which allows only sequences of tokens that form an existing fact. We evaluate our proposal on Question Answering and show that it scales to large knowledge bases (800 million facts), adapts to domain-specific data, and achieves effective results. These gains come with minimal generation-time overhead. ReFactX code is available at https://github.com/rpo19/ReFactX.
paper
Question Answering, Constrained Generation
English
ISWC 2025 24th International Semantic Web Conference November 2–6, 2025
2025
Garijo, D; Kirrane, S; Salatino, A; Shimizu, C; Acosta, M; Nuzzolese, AG; Ferrada, S; Soulard, T; Kozaki, K; Takeda, H; Gentile, AL
The Semantic Web – ISWC 2025 24th International Semantic Web Conference, Nara, Japan, November 2–6, 2025, Proceedings, Part I
9783032095268
2026
16140 LNCS
290
308
none
Pozzi, R., Palmonari, M., Coletta, A., Bellomarini, L., Lehmann, J., Vahdati, S. (2026). ReFactX: Scalable Reasoning with Reliable Facts via Constrained Generation. In The Semantic Web – ISWC 2025 24th International Semantic Web Conference, Nara, Japan, November 2–6, 2025, Proceedings, Part I (pp.290-308). Springer Science and Business Media Deutschland GmbH [10.1007/978-3-032-09527-5_16].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/579661
Citazioni
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
Social impact