Chatbots, also known as conversational agents, have become ubiquitous, offering services for a multitude of domains. Unlike general-purpose chatbots, task-based chatbots are software designed to prioritize the completion of tasks of the domain they handle (e.g., flight booking). Given the growing popularity of chatbots, testing techniques that can generate full conversations as test cases have emerged. Still, thoroughly testing all the possible conversational scenarios implemented by a taskbased chatbot is challenging, resulting in incorrect behaviors that may remain unnoticed. To address this challenge, we proposed Mutabot, a mutation testing approach for injecting faults in conversations and producing faulty chatbots that emulate defects that may affect the conversational aspects. In this paper, we present our extension of Mutabot to multiple platforms (Dialogflow and Rasa), and present experiments that show how mutation testing can be used to reveal weaknesses in test suites generated by the Botium state-of-The-Art test generator.

Clerissi, D., Masserini, E., Micucci, D., Mariani, L. (2025). Towards Multi-Platform Mutation Testing of Task-Based Chatbots. In 2025 IEEE 36th International Symposium on Software Reliability Engineering Workshops (ISSREW) (pp.205-208). Institute of Electrical and Electronics Engineers Inc. [10.1109/issrew67781.2025.00073].

Towards Multi-Platform Mutation Testing of Task-Based Chatbots

Clerissi, Diego;Masserini, Elena;Micucci, Daniela;Mariani, Leonardo
2025

Abstract

Chatbots, also known as conversational agents, have become ubiquitous, offering services for a multitude of domains. Unlike general-purpose chatbots, task-based chatbots are software designed to prioritize the completion of tasks of the domain they handle (e.g., flight booking). Given the growing popularity of chatbots, testing techniques that can generate full conversations as test cases have emerged. Still, thoroughly testing all the possible conversational scenarios implemented by a taskbased chatbot is challenging, resulting in incorrect behaviors that may remain unnoticed. To address this challenge, we proposed Mutabot, a mutation testing approach for injecting faults in conversations and producing faulty chatbots that emulate defects that may affect the conversational aspects. In this paper, we present our extension of Mutabot to multiple platforms (Dialogflow and Rasa), and present experiments that show how mutation testing can be used to reveal weaknesses in test suites generated by the Botium state-of-The-Art test generator.
paper
Chatbot; Mutation; Rasa; Testing;
English
International Symposium on Software Reliability Engineering (ISSRE) - 21-21 October 2025
2025
2025 IEEE 36th International Symposium on Software Reliability Engineering Workshops (ISSREW)
9798331553258
2025
205
208
reserved
Clerissi, D., Masserini, E., Micucci, D., Mariani, L. (2025). Towards Multi-Platform Mutation Testing of Task-Based Chatbots. In 2025 IEEE 36th International Symposium on Software Reliability Engineering Workshops (ISSREW) (pp.205-208). Institute of Electrical and Electronics Engineers Inc. [10.1109/issrew67781.2025.00073].
File in questo prodotto:
File Dimensione Formato  
Clerissi-2025-ISSREW-VoR.pdf

Solo gestori archivio

Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Tutti i diritti riservati
Dimensione 573.68 kB
Formato Adobe PDF
573.68 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/588563
Citazioni
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 1
Social impact