OATAO - Open Archive Toulouse Archive Ouverte Open Access Week

A clustering approach for detecting defects in technical documents

Mezghani, Manel and Kang Choi, Juyeon and Sèdes, Florence A clustering approach for detecting defects in technical documents. (2018) In: 13th International Workshop on Natural Language Processing and Cognitive Science (NLPCS 2018), 11 September 2018 - 12 September 2018 (Krakow, Poland).

(Document in English)

PDF (Author's version) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader


Requirements are usually “hand-written” and suffers from several problems like redundancy and inconsistency. The problems of redundancy and inconsistency between requirements or sets of requirements impact negatively the success of final products. Manually processing these issues requires too much time and it is very costly. The main contribution of this paper is the use of k-means algorithm for a redundancy and inconsistency detection in a new context, which is Requirements Engineering context. Also, we introduce a pre-processing step based on the Natural Language Processing (NLP) techniques to see the impact of this latter to the k-means results. We use Part-Of-Speech (POS) tagging and noun chunking to detect technical busi-ness terms associated to the requirements documents that we analyze. We experiment this approach on real industrial datasets. The results show the efficiency of the k-means clustering algorithm especially with the pre-processing.

Item Type:Conference or Workshop Item (Paper)
Additional Information:ISBN 978-83-949716-7-0
HAL Id:hal-02191796
Audience (conference):International conference proceedings
Uncontrolled Keywords:
Institution:French research institutions > Centre National de la Recherche Scientifique - CNRS (FRANCE)
Université de Toulouse > Institut National Polytechnique de Toulouse - Toulouse INP (FRANCE)
Université de Toulouse > Université Toulouse III - Paul Sabatier - UT3 (FRANCE)
Université de Toulouse > Université Toulouse - Jean Jaurès - UT2J (FRANCE)
Université de Toulouse > Université Toulouse 1 Capitole - UT1 (FRANCE)
Other partners > Prometil (FRANCE)
Laboratory name:
ccitanie region of France in the framework of )-ELENAA (des Exigences en LanguEs Naturelles à leurs Analyses Au-tomatiques) project - Région Occtanie (France)
Deposited On:22 Jul 2019 12:26

Repository Staff Only: item control page