OATAO - Open Archive Toulouse Archive Ouverte Open Access Week

A Supervised Machine Learning Approach for Taxonomic Relation Recognition through Non-linear Enumerative Structures

Fauconnier, Jean-Philippe and Kamel, Mouna and Rothenburger, Bernard A Supervised Machine Learning Approach for Taxonomic Relation Recognition through Non-linear Enumerative Structures. (2015) In: 30th ACM Symposium on Applied Computing (SAC 2015), 13 April 2015 - 17 April 2015 (Salamanque, Spain).

[img] (Document in English)

PDF (Author's version) - Depositor and staff only - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
152kB

Official URL: http://dx.doi.org/10.1145/2695664.2695988

Abstract

Improving relation extraction process requires to have a better insight of the proper text or to use external resources. Our work lies in the first term of this alternative, and aim at extending works about semantic relation identification in texts for building taxonomies which constitute the backbone of ontologies on which Semantic Web applications are built. We consider a specific discursive structure, the enumerative structure, as it bears explicit hierarchical knowledge. This structure is expressed with the help of lexical or typo-dispositional markers whose role is to introduce hierarchical levels between its components. Typo-dispositional markers are unfortunately not integrated into most parsing systems used for information extraction tasks. In order to extend the taxonomic relation identification process, we thus propose a method for recognizing this relation through enumerative structures which benefit from typo-dispositional markers (we called them non-linear enumerative structures). Our method is based on supervised machine learning. Two strategies have been applied: a linear classification with a MaxEnt and a non-linear one with a SVM. The results obtained in each of these approaches are close, with respectively an F1 of 81.25% and of 81.77%.

Item Type:Conference or Workshop Item (Paper)
Additional Information:Thanks to ACM editor. The definitive version is available at http://dl.acm.org This papers appears in SAC '15 ISBN: 978-1-4503-3196-8 The original PDF is available at: http://dl.acm.org/citation.cfm?id=2695988
Audience (conference):International conference proceedings
Uncontrolled Keywords:
Institution:Université de Toulouse > Institut National Polytechnique de Toulouse - INPT (FRANCE)
French research institutions > Centre National de la Recherche Scientifique - CNRS (FRANCE)
Université de Toulouse > Université Toulouse III - Paul Sabatier - UPS (FRANCE)
Université de Toulouse > Université Toulouse - Jean Jaurès - UT2J (FRANCE)
Université de Toulouse > Université Toulouse 1 Capitole - UT1 (FRANCE)
Laboratory name:
Statistics:download
Deposited By: IRIT IRIT
Deposited On:11 Oct 2016 14:09

Repository Staff Only: item control page