• search hit 4 of 5
Back to Result List

Using Hidden Markov Models for the accurate linguistic analysis of process model activity labels

  • Many process model analysis techniques rely on the accurate analysis of the natural language contents captured in the models’ activity labels. Since these labels are typically short and diverse in terms of their grammatical style, standard natural language processing tools are not suitable to analyze them. While a dedicated technique for the analysis of process model activity labels was proposed in the past, it suffers from considerable limitations. First of all, its performance varies greatly among data sets with different characteristics and it cannot handle uncommon grammatical styles. What is more, adapting the technique requires in-depth domain knowledge. We use this paper to propose a machine learning-based technique for activity label analysis that overcomes the issues associated with this rule-based state of the art. Our technique conceptualizes activity label analysis as a tagging task based on a Hidden Markov Model. By doing so, the analysis of activity labels no longer requires the manual specification of rules. AnMany process model analysis techniques rely on the accurate analysis of the natural language contents captured in the models’ activity labels. Since these labels are typically short and diverse in terms of their grammatical style, standard natural language processing tools are not suitable to analyze them. While a dedicated technique for the analysis of process model activity labels was proposed in the past, it suffers from considerable limitations. First of all, its performance varies greatly among data sets with different characteristics and it cannot handle uncommon grammatical styles. What is more, adapting the technique requires in-depth domain knowledge. We use this paper to propose a machine learning-based technique for activity label analysis that overcomes the issues associated with this rule-based state of the art. Our technique conceptualizes activity label analysis as a tagging task based on a Hidden Markov Model. By doing so, the analysis of activity labels no longer requires the manual specification of rules. An evaluation using a collection of 15,000 activity labels demonstrates that our machine learning-based technique outperforms the state of the art in all aspects.show moreshow less

Export metadata

Additional Services

Search Google Scholar Statistics
Metadaten
Author details:Henrik LeopoldORCiDGND, Han van der Aa, Jelmer Offenberg, Hajo A. ReijersORCiDGND
DOI:https://doi.org/10.1016/j.is.2019.02.005
ISSN:0306-4379
ISSN:1873-6076
Title of parent work (English):Information systems
Publisher:Elsevier
Place of publishing:Oxford
Publication type:Article
Language:English
Year of first publication:2019
Publication year:2019
Release date:2021/01/18
Tag:Hidden Markov models; Label analysis; Natural language; Process model
Volume:83
Number of pages:10
First page:30
Last Page:39
Funding institution:Alexander von Humboldt FoundationAlexander von Humboldt Foundation
Organizational units:An-Institute / Hasso-Plattner-Institut für Digital Engineering gGmbH
DDC classification:0 Informatik, Informationswissenschaft, allgemeine Werke / 00 Informatik, Wissen, Systeme / 000 Informatik, Informationswissenschaft, allgemeine Werke
Peer review:Referiert
Accept ✔
This website uses technically necessary session cookies. By continuing to use the website, you agree to this. You can find our privacy policy here.