OATAO - Open Archive Toulouse Archive Ouverte Open Access Week

Corpus Annotation within the French FrameNet: a Domain-by-domain Methodology

Djemaa, Marianne and Candito, Marie and Muller, Philippe and Vieu, Laure Corpus Annotation within the French FrameNet: a Domain-by-domain Methodology. (2016) In: 10th edition of the Language Resources and Evaluation Conference (LREC 2016), 23 May 2016 - 28 May 2016 (Portoroz, Slovenia).

[img]
Preview
(Document in English)

PDF (Author's version) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
99kB

Abstract

This paper reports on the development of a French FrameNet, within the ASFALDA project. While the first phase of the project focused on the development of a French set of frames and corresponding lexicon (Candito et al., 2014), this paper concentrates on the subsequent corpus annotation phase, which focused on four notional domains (commercial transactions, cognitive stances, causality and verbal communication). Given full coverage is not reachable for a relatively " new " FrameNet project, we advocate that focusing on specific notional domains allowed us to obtain full lexical coverage for the frames of these domains, while partially reflecting word sense ambiguities. Furthermore, as frames and roles were annotated on two French Treebanks (the French Treebank (Abeillé and Barrier, 2004) and the Sequoia Treebank (Candito and Seddah, 2012), we were able to extract a syntactico-semantic lexicon from the annotated frames. In the resource's current status, there are 98 frames, 662 frame-evoking words, 872 senses, and about 13000 annotated frames, with their semantic roles assigned to portions of text. The French FrameNet is freely available at alpage.inria.fr/asfalda.

Item Type:Conference or Workshop Item (Paper)
Additional Information:Thanks to European Language Resources Association (ELRA). This papers appears in Proceedings of LREC 2016 ISBN: 978-2-9517408-9-1 The definitive version is available at : http://www.lrec-conf.org/proceedings/lrec2016/index.html The original PDF of the article can be found at: http://www.lrec-conf.org/proceedings/lrec2016/summaries/292.html
Audience (conference):International conference proceedings
Uncontrolled Keywords:
Institution:French research institutions > Centre National de la Recherche Scientifique - CNRS (FRANCE)
Université de Toulouse > Institut National Polytechnique de Toulouse - INPT (FRANCE)
French research institutions > Institut National de la Recherche en Informatique et en Automatique - INRIA (FRANCE)
Other partners > Université de Paris Diderot - Paris 7 (FRANCE)
Université de Toulouse > Université Toulouse III - Paul Sabatier - UPS (FRANCE)
Université de Toulouse > Université Toulouse - Jean Jaurès - UT2J (FRANCE)
Université de Toulouse > Université Toulouse 1 Capitole - UT1 (FRANCE)
Laboratory name:
Statistics:download
Deposited By: IRIT IRIT
Deposited On:24 Jan 2018 14:49

Repository Staff Only: item control page