Abstract
Large Language Models (LLMs) are fast becoming indispensable tools for software developers, assisting or even partnering with them in crafting complex programs. The advantages are evident — LLMs can significantly reduce development time, generate well-organized and comprehensible code, and occasionally suggest innovative ideas that developers might not conceive on their own. However, despite their strengths, LLMs will often introduce significant errors and present incorrect code with persuasive confidence, potentially misleading developers into accepting flawed solutions. In order to bring LLMs into the software development cycle in a more reliable manner, we propose a methodology for combining them with “traditional” software engineering techniques in a structured way, with the goal of streamlining the development process, reducing errors, and enabling users to verify crucial program properties with increased confidence. Specifically, we focus on the Scenario-Based Modeling (SBM) paradigm — an event-driven, scenario-based approach for software engineering — to allow human developers to pour their expert knowledge into the LLM, as well as to inspect and verify its outputs. To evaluate our methodology, we conducted a significant case study, and used it to design and implement the Connect4 game. By combining LLMs and SBM we were able to create a highly-capable agent, which could defeat various strong existing agents. Further, in some cases, we were able to formally verify the correctness of our agent. Finally, our experience reveals interesting insights regarding the ease-of-use of our proposed approach. The full code of our case-study will be made publicly available with the final version of this paper.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 14th International Conference on Model-Based Software and Systems Engineering |
| Editors | Federico Ciccozzi, Luís Ferreira Pires, Francis Bordeleau |
| Publisher | Science and Technology Publications, Lda |
| Pages | 21-32 |
| Number of pages | 12 |
| ISBN (Print) | 9789897587986 |
| DOIs | |
| State | Published - 2026 |
| Event | 14th International Conference on Model-Based Software and Systems Engineering, MODELSWARD 2026 - Marbella, Spain Duration: 7 Mar 2026 → 9 Mar 2026 |
Publication series
| Name | International Conference on Model-Driven Engineering and Software Development |
|---|---|
| Volume | 1 |
| ISSN (Electronic) | 2184-4348 |
Conference
| Conference | 14th International Conference on Model-Based Software and Systems Engineering, MODELSWARD 2026 |
|---|---|
| Country/Territory | Spain |
| City | Marbella |
| Period | 7/03/26 → 9/03/26 |
Bibliographical note
Publisher Copyright:© 2026, Science and Technology Publications, Lda. All rights reserved.
Keywords
- Formal Verification
- Large Language Models
- Rule-Based Specifications
- Scenario-Based Modeling
Fingerprint
Dive into the research topics of 'On Integrating Large Language Models and Scenario-Based Modeling for Improving Software Reliability'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver