Slice, mine and dice: Complexity-aware automated discovery of business process models

, Dumas-Menijvar, Marlon, Garcia-Banuelos, Luciano, & (2013) Slice, mine and dice: Complexity-aware automated discovery of business process models. In Wang, J, Weber, B, & Daniel, F (Eds.) Business Process Management: 11th International Conference, BPM 2013, Proceedings [Lecture Notes in Computer Science, Vol 8094]. Springer, Germany, pp. 49-64.

[img]
Preview
Accepted Version (PDF 2MB)
process_mining_with_clone_detection.pdf.

View at publisher

Description

Automated process discovery techniques aim at extracting models from information system logs in order to shed light into the business processes supported by these systems. Existing techniques in this space are effective when applied to relatively small or regular logs, but otherwise generate large and spaghetti-like models. In previous work, trace clustering has been applied in an attempt to reduce the size and complexity of automatically discovered process models. The idea is to split the log into clusters and to discover one model per cluster. The result is a collection of process models -- each one representing a variant of the business process -- as opposed to an all-encompassing model. Still, models produced in this way may exhibit unacceptably high complexity. In this setting, this paper presents a two-way divide-and-conquer process discovery technique, wherein the discovered process models are split on the one hand by variants and on the other hand hierarchically by means of subprocess extraction. The proposed technique allows users to set a desired bound for the complexity of the produced models. Experiments on real-life logs show that the technique produces collections of models that are up to 64% smaller than those extracted under the same complexity bounds by applying existing trace clustering techniques.

Impact and interest:

34 citations in Scopus
Search Google Scholar™

Citation counts are sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

Full-text downloads:

619 since deposited on 08 Apr 2013
24 in the past twelve months

Full-text downloads displays the total number of times this work’s files (e.g., a PDF) have been downloaded from QUT ePrints as well as the number of downloads in the previous 365 days. The count includes downloads for all files if a work has more than one.

ID Code: 58949
Item Type: Chapter in Book, Report or Conference volume (Conference contribution)
ORCID iD:
La Rosa, Marcelloorcid.org/0000-0001-9568-4035
Measurements or Duration: 16 pages
DOI: 10.1007/978-3-642-40176-3_6
ISBN: 978-3-642-40175-6
Pure ID: 32472558
Divisions: Past > QUT Faculties & Divisions > Science & Engineering Faculty
Copyright Owner: Copyright 2013 (please consult the authors).
Copyright Statement: This work is covered by copyright. Unless the document is being made available under a Creative Commons Licence, you must assume that re-use is limited to personal use and that permission from the copyright owner must be obtained for all other uses. If the document is available under a Creative Commons License (or other specified license) then refer to the Licence for details of permitted re-use. It is a condition of access that users recognise and abide by the legal requirements associated with these rights. If you believe that this work infringes copyright please provide details by email to [email protected]
Deposited On: 08 Apr 2013 01:11
Last Modified: 08 Feb 2025 08:14