OATAO - Open Archive Toulouse Archive Ouverte Open Access Week

Semi-automatic enrichment of crowdsourced synonymy networks: the WISIGOTH system applied to Wiktionary

Sajous, Franck and Navarro, Emmanuel and Gaume, Bruno and Prévot, Laurent and Chudy, Yannick Semi-automatic enrichment of crowdsourced synonymy networks: the WISIGOTH system applied to Wiktionary. (2013) Language Resources and Evaluation, 47 (1). 63-96. ISSN 1574-020X

[img]
Preview
(Document in English)

PDF (Author's version) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
771kB

Official URL: http://dx.doi.org/10.1007/s10579-011-9168-6

Abstract

Semantic lexical resources are a mainstay of various Natural Language Processing applications. However, comprehensive and reliable resources are rare and not often freely available. Handcrafted resources are too costly for being a general solution while automatically-built resources need to be validated by experts or at least thoroughly evaluated. We propose in this paper a picture of the current situation with regard to lexical resources, their building and their evaluation. We give an in-depth description of Wiktionary, a freely available and collaboratively built multilingual dictionary. Wiktionary is presented here as a promising raw resource for NLP. We propose a semi-automatic approach based on random walks for enriching Wiktionary synonymy network that uses both endogenous and exogenous data. We take advantage of the wiki infrastructure to propose a validation "by crowds". Finally, we present an implementation called WISIGOTH, which supports our approach.

Item Type:Article
Additional Information:Thanks to Springer editor. The definitive version is available at http://link.springer.com The original PDF of the article can be found at Language Resources and Evaluation website : http://link.springer.com/article/10.1007%2Fs10579-011-9168-6
HAL Id:hal-01154247
Audience (journal):International peer-reviewed journal
Uncontrolled Keywords:
Institution:Other partners > Aix-Marseille Université - AMU (FRANCE)
French research institutions > Centre National de la Recherche Scientifique - CNRS (FRANCE)
Université de Toulouse > Institut National Polytechnique de Toulouse - INPT (FRANCE)
Université de Toulouse > Université Toulouse III - Paul Sabatier - UPS (FRANCE)
Université de Toulouse > Université Toulouse - Jean Jaurès - UT2J (FRANCE)
Université de Toulouse > Université Toulouse 1 Capitole - UT1 (FRANCE)
Laboratory name:
Statistics:download
Deposited By: IRIT IRIT
Deposited On:10 Mar 2015 08:53

Repository Staff Only: item control page