OATAO - Open Archive Toulouse Archive Ouverte Open Access Week

Towards Enriching DBpedia from Vertical Enumerative Structures Using a Distant Learning Approach

Kamel, Mouna and Trojahn, Cassia Towards Enriching DBpedia from Vertical Enumerative Structures Using a Distant Learning Approach. (2018) In: International Conference on Knowledge Engineering and Knowledge Management (EKAW 2018), 12 November 2018 - 16 November 2018 (Nancy, France).

[img]
Preview
(Document in English)

PDF (Author's version) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
855kB

Official URL: https://doi.org/10.1007/978-3-030-03667-6_12

Abstract

Automatic construction of semantic resources at large scale usually relies on general purpose corpora as Wikipedia. This resource, by nature rich in encyclopedic knowledge, exposes part of this knowledge with strongly structured elements (infoboxes, categories, etc.). Several extractors have targeted these structures in order to enrich or to populate semantic resources as DBpedia, YAGO or BabelNet. The remain semi-structured textual structures, such as vertical enumerative structures (those using typographic and dispositional layout) have been however under-exploited. However, frequent in corpora, they are rich sources of specific semantic relations, such as hypernyms. This paper presents a distant learning approach for extracting hypernym relations from vertical enumerative structures of Wikipedia, with the aim of enriching DBpedia. Our relation extraction approach achieves an overall precision of 62%, and 99% of the extracted relations can enrich DBpedia, with respect to a reference corpus.

Item Type:Conference or Workshop Item (Paper)
Additional Information:Thanks to Springer editor. This papers appears in Volume 11313 of Lecture Notes in Computer Science ISSN : 0302-9743 ISBN 978-3-030-03666-9 The original PDF is available at: https://link.springer.com/chapter/10.1007/978-3-030-03667-6_12
HAL Id:hal-02089278
Audience (conference):International conference proceedings
Uncontrolled Keywords:
Institution:Université de Toulouse > Institut National Polytechnique de Toulouse - INPT (FRANCE)
French research institutions > Centre National de la Recherche Scientifique - CNRS (FRANCE)
Université de Toulouse > Université Toulouse III - Paul Sabatier - UPS (FRANCE)
Université de Toulouse > Université Toulouse - Jean Jaurès - UT2J (FRANCE)
Université de Toulouse > Université Toulouse 1 Capitole - UT1 (FRANCE)
Laboratory name:
Statistics:download
Deposited By: IRIT IRIT
Deposited On:26 Mar 2019 10:06

Repository Staff Only: item control page