Chatbots, also known as conversational agents, have become ubiquitous, offering services for a multitude of domains. Unlike general-purpose chatbots, task-based chatbots are software designed to prioritize the completion of tasks of the domain they handle (e.g., flight booking). Given the growing popularity of chatbots, testing techniques that can generate full conversations as test cases have emerged. Still, thoroughly testing all the possible conversational scenarios implemented by a taskbased chatbot is challenging, resulting in incorrect behaviors that may remain unnoticed. To address this challenge, we proposed Mutabot, a mutation testing approach for injecting faults in conversations and producing faulty chatbots that emulate defects that may affect the conversational aspects. In this paper, we present our extension of Mutabot to multiple platforms (Dialogflow and Rasa), and present experiments that show how mutation testing can be used to reveal weaknesses in test suites generated by the Botium state-of-The-Art test generator.
Clerissi, D., Masserini, E., Micucci, D., Mariani, L. (2025). Towards Multi-Platform Mutation Testing of Task-Based Chatbots. In 2025 IEEE 36th International Symposium on Software Reliability Engineering Workshops (ISSREW) (pp.205-208). Institute of Electrical and Electronics Engineers Inc. [10.1109/issrew67781.2025.00073].
Towards Multi-Platform Mutation Testing of Task-Based Chatbots
Clerissi, Diego;Masserini, Elena;Micucci, Daniela;Mariani, Leonardo
2025
Abstract
Chatbots, also known as conversational agents, have become ubiquitous, offering services for a multitude of domains. Unlike general-purpose chatbots, task-based chatbots are software designed to prioritize the completion of tasks of the domain they handle (e.g., flight booking). Given the growing popularity of chatbots, testing techniques that can generate full conversations as test cases have emerged. Still, thoroughly testing all the possible conversational scenarios implemented by a taskbased chatbot is challenging, resulting in incorrect behaviors that may remain unnoticed. To address this challenge, we proposed Mutabot, a mutation testing approach for injecting faults in conversations and producing faulty chatbots that emulate defects that may affect the conversational aspects. In this paper, we present our extension of Mutabot to multiple platforms (Dialogflow and Rasa), and present experiments that show how mutation testing can be used to reveal weaknesses in test suites generated by the Botium state-of-The-Art test generator.| File | Dimensione | Formato | |
|---|---|---|---|
|
Clerissi-2025-ISSREW-VoR.pdf
Solo gestori archivio
Tipologia di allegato:
Publisher’s Version (Version of Record, VoR)
Licenza:
Tutti i diritti riservati
Dimensione
573.68 kB
Formato
Adobe PDF
|
573.68 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


