OATAO - Open Archive Toulouse Archive Ouverte Open Access Week

Using k-means for redundancy and inconsistency detection: application to industrial requirements

Mezghani, Manel and Kang Choi, Juyeon and Sèdes, Florence Using k-means for redundancy and inconsistency detection: application to industrial requirements. (2018) In: 23rd International conference on Applications of Natural Language Processing to Information Systems (NLDB 2018), 13 June 2018 - 15 June 2018 (Paris, France).

[img]
Preview
(Document in English)

PDF (Author's version) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
219kB

Official URL: https://doi.org/10.1007/978-3-319-91947-8_52

Abstract

Requirements are usually “hand-written” and suffers from several problems like redundancy and inconsistency. These problems between requirements or sets of requirements impact negatively the success of final products. Manually processing these issues requires too much time and it is very costly. We propose in this paper to automatically handle redundancy and inconsistency issues in a classification approach. The main contribution of this paper is the use of k-means algorithm for redundancy and inconsistency detection in a new context, which is Requirements Engineering context. Also, we introduce a preprocessing step based on the Natural Language Processing techniques in order to see the impact of this latter to the k-means results. We use Part-Of-Speech (POS) tagging and noun chunking in order to detect technical business terms associated with the requirements documents that we analyze. We experiment this approach on real industrial datasets. The results show the efficiency of the k-means clustering algorithm, especially with the preprocessing.

Item Type:Conference or Workshop Item (Paper)
Additional Information:Thanks to Springer editor. This papers appears in Volume 10859 of Lecture Notes in Computer Science ISSN : 0302-9743 ISBN 978-3-319-91946-1 The original PDF is available at: https://link.springer.com/chapter/10.1007/978-3-319-91947-8_52
HAL Id:hal-02305354
Audience (conference):International conference proceedings
Uncontrolled Keywords:
Institution:Université de Toulouse > Institut National Polytechnique de Toulouse - Toulouse INP (FRANCE)
French research institutions > Centre National de la Recherche Scientifique - CNRS (FRANCE)
Université de Toulouse > Université Toulouse III - Paul Sabatier - UT3 (FRANCE)
Université de Toulouse > Université Toulouse - Jean Jaurès - UT2J (FRANCE)
Université de Toulouse > Université Toulouse 1 Capitole - UT1 (FRANCE)
Laboratory name:
Statistics:download
Deposited On:24 Sep 2019 09:37

Repository Staff Only: item control page