Integrating Domain Knowledge into Process Discovery Using Large Language Models
Norouzifar, Ali, Kourani, Humam, Dees, Marcus, van der Aalst, Wil
–arXiv.org Artificial Intelligence
Process discovery aims to derive process models from event logs, providing insights into operational behavior and forming a foundation for conformance checking and process improvement. However, models derived solely from event data may not accurately reflect the real process, as event logs are often incomplete or affected by noise, and domain knowledge, an important complementary resource, is typically disregarded. As a result, the discovered models may lack reliability for downstream tasks. We propose an interactive framework that incorporates domain knowledge, expressed in natural language, into the process discovery pipeline using Large Language Models (LLMs). Our approach leverages LLMs to extract declarative rules from textual descriptions provided by domain experts. These rules are used to guide the IMr discovery algorithm, which recursively constructs process models by combining insights from both the event log and the extracted rules, helping to avoid problematic process structures that contradict domain knowledge. The framework coordinates interactions among the LLM, domain experts, and a set of backend services. We present a fully implemented tool that supports this workflow and conduct an extensive evaluation of multiple LLMs and prompt engineering strategies. Our empirical study includes a case study based on a real-life event log with the involvement of domain experts, who assessed the usability and effectiveness of the framework.
arXiv.org Artificial Intelligence
Oct-9-2025
- Country:
- Africa
- Middle East > Tunisia
- Tunis Governorate > Tunis (0.04)
- Rwanda > Kigali
- Kigali (0.04)
- Middle East > Tunisia
- Asia
- China > Shaanxi Province
- Xi'an (0.04)
- South Korea (0.04)
- China > Shaanxi Province
- Europe
- Austria > Vienna (0.14)
- Germany > North Rhine-Westphalia
- Cologne Region > Aachen (0.04)
- Italy > Lazio
- Rome (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Poland
- Lesser Poland Province > Kraków (0.04)
- Pomerania Province > Gdańsk (0.04)
- Łódź Province > Łódź (0.04)
- Portugal (0.04)
- Spain
- Andalusia > Seville Province
- Seville (0.04)
- Aragón > Zaragoza Province
- Zaragoza (0.04)
- Andalusia > Seville Province
- Sweden > Stockholm
- Stockholm (0.04)
- North America
- Canada (0.04)
- United States
- Illinois > Cook County
- Chicago (0.04)
- New Mexico > Santa Fe County
- Santa Fe (0.04)
- Illinois > Cook County
- Africa
- Genre:
- Overview (1.00)
- Research Report (1.00)
- Workflow (1.00)
- Industry:
- Materials > Metals & Mining (0.46)
- Technology: