skip to main content
10.1145/3341105.3373861acmconferencesArticle/Chapter ViewAbstractPublication PagessacConference Proceedingsconference-collections
research-article

Event-log abstraction using batch session identification and clustering

Published: 30 March 2020 Publication History

Abstract

Process-Mining techniques aim to use event data about past executions to gain insight into how processes are executed. While these techniques are proven to be very valuable, they are less successful to reach their goal if the process is flexible and, hence, it exhibits an extremely large number of variants. Furthermore, information systems can record events at very low level, which do not match the high-level concepts known at business level. Without abstracting sequences of events to high-level concepts, the results of applying process mining (to, e.g., discover a model) easily become very complex and difficult to interpret, which ultimately means that they are of little use. A large body of research exists on event abstraction but typically a large amount of domain knowledge is required, which is often not readily available. Other abstraction techniques are unsupervised, which ultimately return less accurate results and/or rely on stronger assumptions. This paper puts forward a technique that requires limited domain knowledge that can be easily provided. Traces are divided in batch sessions, and each session is abstracted as one single high-level activity execution. The abstraction is based on a combination of automatic clustering and visualization methods. The technique was assessed on two case studies about processes characterized by high variability. The results clearly illustrate the benefits of the abstraction to convey accurate knowledge to stakeholders.

References

[1]
Thomas Baier. 2015. Matching Events and Activities. PhD dissertation. University of Potsdam.
[2]
Massimiliano de Leoni and Safa Dündar. 2019. From Low-Level Events to Activities - A Session-Based Approach (Extended Version). arXiv.org abs/1903.03993 (2019). http://arxiv.org/abs/1903.03993
[3]
Jochen De Weerdt. 2018. Trace Clustering. Springer International Publishing, Cham.
[4]
Bettina Fazzinga, Sergio Flesca, Filippo Furfaro, Elio Masciari, and Luigi Pontieri. 2015. A probabilistic unified framework for event abstraction and process detection from log data. In Proceedings of the 23th OTM Confederated International Conference on Cooperative Information Systems (LNCS), Vol. 9415. Springer, 320--328.
[5]
Diogo R Ferreira, Fernando Szimanski, and Célia Ghedini Ralha. 2013. Mining the low-level behaviour of agents in high-level business processes. International Journal of Business Process Integration and Management 8 6, 2 (2013), 146--166.
[6]
Christian W Günther, Anne Rozinat, and Wil M. P. van der Aalst. 2009. Activity mining by global trace segmentation. In Proceeding of the 7th International Conference on Business Process Management. Springer, 128--139.
[7]
D.J. Ketchen and C. L Shook. 1996. The application of cluster analysis in strategic management research: An analysis and critique. Strategic Management Journal 17, 6 (1996), 441--458.
[8]
Maikel Leemans and Wil M. P. van der Aalst. 2015. Discovery of Frequent Episodes in Event Logs. In The 4th International Symposium on Data-Driven Process Discovery and Analysis, (SIMPDA 2014) (LNBIP), Vol. 237. Springer, 1--31.
[9]
Sander J. J. Leemans, Dirk Fahland, and Wil M. P. van der Aalst. 2013. Discovering Block-structured Process Models From Event Logs - A Constructive Approach. In Proceedings of the 34th International Conference on Application and Theory of Petri Nets and Concurrency (Petri Net 2013) (LNCS), Vol. 7927. Springer, 311--329.
[10]
Felix Mannhardt, Massimiliano de Leoni, and Hajo A. Reijers. 2017. Heuristic Mining Revamped: An Interactive Data-aware and Conformance-aware Miner. In Proceedings of the BPM Demo Track and BPM Dissertation Award at 15th International Conference on Business Process Management, Vol. 1920. CEUR-WS.org.
[11]
Felix Mannhardt, Massimiliano de Leoni, Hajo A. Reijers, Wil M. P. van der Aalst, and Pieter J. Toussaint. 2016. From Low-level Events to Activities - A Pattern-based Approach. In Proceedings of the 14th International Conference on Business Process Management (LNCS), Vol. 9850. Springer, 125--141.
[12]
Stefania Montani, Giorgio Leonardi, Manuel Striani, Silvana Quaglini, and Anna Cavallini. 2017. Multi-level abstraction for trace comparison and process discovery. Expert Systems with Applications 81 (2017), 398 -- 409.
[13]
Erich Schubert and Arthur Zimek. 2019. ELKI: A large open-source library for data analysis - ELKI Release 0.7.5 "Heidelberg". CoRR abs/1902.03616 (2019). arXiv:1902.03616 http://arxiv.org/abs/1902.03616
[14]
Niek Tax, Natalia Sidorova, Reinder Haakma, and Wil M.P. van der Aalst. 2016. Mining Local Process Models. Journal of Innovation in Digital Ecosystems 3, 2 (2016), 183 -- 196.
[15]
Niek Tax, Natalia Sidorova, Reinder Haakma, and Wil M. P. van der Aalst. 2018. Event Abstraction for Process Mining Using Supervised Learning Techniques. In Proceedings of SAI Intelligent Systems Conference (IntelliSys) 2016. Springer, 251--269.
[16]
Wil M. P. van der Aalst. 2016. Process Mining - Data Science in Action. Springer.
[17]
Boudewijn F van Dongen and Arya Adriansyah. 2009. Process mining: fuzzy clustering and performance visualization. In Proceedings of the 7th International Conference on Business Process Management. Springer, 158--169.
[18]
M. L. van Eck, N. Sidorova, and W. M. P. van der Aalst. 2016. Enabling process mining on sensor data from smart products. In Proceedings of the Tenth IEEE International Conference on Research Challenges in Information Science (RCIS).
[19]
Romain Vuillemot, Jeremy Boy, Aurélien Tabard, Charles Perin, and Jean-Daniel Fekete (Eds.). 2016. Proceedings of the workshop LIVVIL: Logging Interactive Visualizations and Visualizing Interaction Logs. Baltimore, United States. https://hal.inria.fr/hal-01535913
[20]
Ian H. Witten, Eibe Frank, and Mark A. Hall. 2011. Data Mining: Practical Machine Learning Tools and Techniques (3 ed.). Morgan Kaufmann, Amsterdam. http://www.sciencedirect.com/science/book/9780123748560

Cited By

View all

Index Terms

  1. Event-log abstraction using batch session identification and clustering
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        SAC '20: Proceedings of the 35th Annual ACM Symposium on Applied Computing
        March 2020
        2348 pages
        ISBN:9781450368667
        DOI:10.1145/3341105
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Sponsors

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 30 March 2020

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. clustering
        2. event log abstraction
        3. flexible processes
        4. process discovery
        5. visual analytics

        Qualifiers

        • Research-article

        Conference

        SAC '20
        Sponsor:
        SAC '20: The 35th ACM/SIGAPP Symposium on Applied Computing
        March 30 - April 3, 2020
        Brno, Czech Republic

        Acceptance Rates

        Overall Acceptance Rate 1,650 of 6,669 submissions, 25%

        Upcoming Conference

        SAC '25
        The 40th ACM/SIGAPP Symposium on Applied Computing
        March 31 - April 4, 2025
        Catania , Italy

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)57
        • Downloads (Last 6 weeks)5
        Reflects downloads up to 22 Jan 2025

        Other Metrics

        Citations

        Cited By

        View all

        View Options

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media