In scientific writing, references are crucial in supporting claims, spotlighting evidence, and highlighting research gaps. However, where to add a reference and which reference to cite are subjectively chosen by the papers’ authors; thus the automation of the task is challenging and requires proper investigations. This paper focuses on the automatic placement of references, considering its diverse approaches depending on writing style and community norms, and investigates the use of transformers and Natural Language Processing heuristics to predict i) if a reference is needed in a scientific statement, and ii) where the reference should be placed within the statement. For this investigation, this paper investigates two techniques, namely Mask-filling (MF) and Named Entity Recognition (NER), and provides insights on how to solve this task.

Buscaldi, D., Dessi, D., Motta, E., Murgia, M., Osborne, F., Recupero, D. (2024). Automating Citation Placement with Natural Language Processing and Transformers. In Second International Workshop on Semantic Technologies and Deep Learning Models for Scientific, Technical and Legal Data (SemTech4STLD) co-located with the Extended Semantic Web Conference 2024 (ESWC 2024) (pp.68-75). CEUR-WS.

Automating Citation Placement with Natural Language Processing and Transformers

Osborne F.;
2024

Abstract

In scientific writing, references are crucial in supporting claims, spotlighting evidence, and highlighting research gaps. However, where to add a reference and which reference to cite are subjectively chosen by the papers’ authors; thus the automation of the task is challenging and requires proper investigations. This paper focuses on the automatic placement of references, considering its diverse approaches depending on writing style and community norms, and investigates the use of transformers and Natural Language Processing heuristics to predict i) if a reference is needed in a scientific statement, and ii) where the reference should be placed within the statement. For this investigation, this paper investigates two techniques, namely Mask-filling (MF) and Named Entity Recognition (NER), and provides insights on how to solve this task.
paper
Citation Prediction; Generative Approach; Named Entity Recognition; Natural Language Processing;
English
2nd International Workshop on Semantic Technologies and Deep Learning Models for Scientific, Technical and Legal Data, SemTech4STLD 2024 - 26 May 2024
2024
Dessi, R; Dessi, D; Osborne, F; Aras, H
Second International Workshop on Semantic Technologies and Deep Learning Models for Scientific, Technical and Legal Data (SemTech4STLD) co-located with the Extended Semantic Web Conference 2024 (ESWC 2024)
2024
3697
68
75
https://ceur-ws.org/Vol-3697/
open
Buscaldi, D., Dessi, D., Motta, E., Murgia, M., Osborne, F., Recupero, D. (2024). Automating Citation Placement with Natural Language Processing and Transformers. In Second International Workshop on Semantic Technologies and Deep Learning Models for Scientific, Technical and Legal Data (SemTech4STLD) co-located with the Extended Semantic Web Conference 2024 (ESWC 2024) (pp.68-75). CEUR-WS.
File in questo prodotto:
File Dimensione Formato  
Buscaldi-2024-SemTech4STLD-VoR.pdf

accesso aperto

Descrizione: This volume and its papers are published under the Creative Commons License Attribution 4.0 International (CC BY 4.0).
Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Creative Commons
Dimensione 233.6 kB
Formato Adobe PDF
233.6 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/521190
Citazioni
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
Social impact