Bicocca Open Archive

Scientific question answering remains a significant challenge for the current generation of large language models (LLMs) due to the requirement of engaging with highly specialised concepts. A promising solution is to integrate LLMs with knowledge graphs of research concepts, ensuring that responses are grounded in structured, verifiable information. One effective approach involves using LLMs to translate questions posed in natural language into SPARQL queries, enabling the retrieval of relevant data. In this paper, we analyse the performance of several LLMs on this task using two scientific question-answering benchmarks: SciQA and DBLP-QuAD. We explore both few-shot learning and fine-tuning strategies, investigate error patterns across different models, and propose directions for future research.

Meloni, A., Recupero, D., Osborne, F., Salatino, A., Motta, E., Vahadati, S., et al. (2025). Assessing Large Language Models for SPARQL Query Generation in Scientific Question Answering. In Proceedings of the Special Session on Harmonising Generative AI and Semantic Web Technologies (HGAIS 2024) co-located with the 23rd International Semantic Web Conference (ISWC 2024) (pp.1-7). CEUR-WS.

Assessing Large Language Models for SPARQL Query Generation in Scientific Question Answering

Meloni A.;Recupero D. R.;Osborne F.;Salatino A.;Motta E.;Vahadati S.;Lehmann J.

2025

Abstract

Scientific question answering remains a significant challenge for the current generation of large language models (LLMs) due to the requirement of engaging with highly specialised concepts. A promising solution is to integrate LLMs with knowledge graphs of research concepts, ensuring that responses are grounded in structured, verifiable information. One effective approach involves using LLMs to translate questions posed in natural language into SPARQL queries, enabling the retrieval of relevant data. In this paper, we analyse the performance of several LLMs on this task using two scientific question-answering benchmarks: SciQA and DBLP-QuAD. We explore both few-shot learning and fine-tuning strategies, investigate error patterns across different models, and propose directions for future research.

Scheda breve

Scheda completa

Scheda completa (DC)

	Tipo di intervento
	
				paper
			
	Parole chiave
	
				Knowledge Graphs; Large Language Models; Machine Translation; SPARQL;
			
	Lingua del contenuto
	
				English
			
	Nome del convegno
	
				2024 Harmonising Generative AI and Semantic Web Technologies, HGAIS 2024 - November 13, 2024
			
	Anno del convegno
	
				2024
			
	Curatori della monografia
	
				Alharbi, R; de Berardinis, J; Groth, P; Meroño Peñuela, A; Simperl , E; Tamma, V
			
	Titolo degli atti
	
				Proceedings of the Special Session on Harmonising Generative AI and Semantic Web Technologies (HGAIS 2024)
co-located with the 23rd International Semantic Web Conference (ISWC 2024)
			
	Collana o serie
	
				CEUR WORKSHOP PROCEEDINGS
			
	Data di pubblicazione
	
				2025
			
	Numero del volume
	
				3953
			
	Pagina iniziale
	
				1
			
	Pagina finale
	
				7
			
	URL alternativo
	
				https://ceur-ws.org/Vol-3953/
			
	Fulltext
	
				open
			
	Citazione
	
				Meloni, A., Recupero, D., Osborne, F., Salatino, A., Motta, E., Vahadati, S., et al. (2025). Assessing Large Language Models for SPARQL Query Generation in Scientific Question Answering. In Proceedings of the Special Session on Harmonising Generative AI and Semantic Web Technologies (HGAIS 2024)
co-located with the 23rd International Semantic Web Conference (ISWC 2024) (pp.1-7). CEUR-WS.
			
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

File	Dimensione	Formato
Meloni-2025-HGAIS-VoR.pdf accesso aperto Descrizione: This volume and its papers are published under the Creative Commons License Attribution 4.0 International (CC BY 4.0). Tipologia di allegato: Publisher’s Version (Version of Record, VoR) Licenza: Creative Commons Dimensione 209.26 kB Formato Adobe PDF Visualizza/Apri	209.26 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/553722

Citazioni

1

ND

Social impact