OATAO - Open Archive Toulouse Archive Ouverte Open Access Week

A semi-automatic approach for building ontologies from a collection of structured web documents

Kamel, Mouna and Aussenac-Gilles, Nathalie and Buscaldi, Davide and Comparot, Catherine A semi-automatic approach for building ontologies from a collection of structured web documents. (2013) In: 7th International Conference on Knowledge Capture (K-CAP 2013), 23 June 2013 - 26 June 2013 (Banff, Canada).

[img]
Preview
(Document in English)

PDF (Author's version) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
252kB

Abstract

Many collections of structured documents are available on the web. The collection generally describes the characteristics of entities from a single type, where each page describes one entity. These documents are adequate knowledge sources for building ontologies. As they benefit from a strong and shared layout, they contain less well written text than plain text files but their architecture is very meaningful. Classical linguistic-based methods for identifying concepts and relations are no longer appropriate for analyzing them. The approach we propose in this paper exploits various properties of such documents, combining layout/formatting analysis and linguistic analysis, and using semantic annotation.

Item Type:Conference or Workshop Item (Paper)
Additional Information:Thanks to ACM editor. This papers appears in K-CAP'13 ISBN : 978-1-4503-2102-0 The definitive version is available at : http://dl.acm.org/ft_gateway.cfm?id=2479856
HAL Id:hal-01264565
Audience (conference):International conference proceedings
Uncontrolled Keywords:
Institution:French research institutions > Centre National de la Recherche Scientifique - CNRS (FRANCE)
Université de Toulouse > Institut National Polytechnique de Toulouse - INPT (FRANCE)
Université de Toulouse > Université Toulouse III - Paul Sabatier - UPS (FRANCE)
Université de Toulouse > Université Toulouse - Jean Jaurès - UT2J (FRANCE)
Université de Toulouse > Université Toulouse 1 Capitole - UT1 (FRANCE)
Other partners > Université Paris 13 (FRANCE)
Laboratory name:
Statistics:download
Deposited By: IRIT IRIT
Deposited On:07 Dec 2015 09:40

Repository Staff Only: item control page