OATAO - Open Archive Toulouse Archive Ouverte Open Access Week

RFreeStem: A multilanguage rule-free stemmer

Baril, Xavier and Coustié, Oihana and Mothe, Josiane and Teste, Olivier RFreeStem: A multilanguage rule-free stemmer. (2019) In: 37e Congres Informatique des Organisations et Systemes d'Information et de Decision (INFORSID 2019), 11 June 2019 - 14 June 2019 (Paris, France).

[img]
Preview
(Document in English)

PDF (Author's version) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
395kB

Official URL: http://inforsid.fr/actes/2019/Actes_INFORSID2019.pdf

Abstract

With the large expansion of available textual data, text mining has become of specialinterest. Due to their unstructured nature, such data require important preprocessing steps.Among them, stemming is a popularly used preprocessing method that extracts the root of thewords. However, the most popular algorithms are based on the application of rules, and there-fore highly language-related. We propose a new approach, the RFreeStem, that is rather basedon corpus and can therefore be applied on many languages.

Item Type:Conference or Workshop Item (Paper)
Audience (conference):National conference proceedings
Uncontrolled Keywords:
Institution:Other partners > Airbus (FRANCE)
French research institutions > Centre National de la Recherche Scientifique - CNRS (FRANCE)
Université de Toulouse > Institut National Polytechnique de Toulouse - Toulouse INP (FRANCE)
Université de Toulouse > Université Toulouse III - Paul Sabatier - UT3 (FRANCE)
Université de Toulouse > Université Toulouse - Jean Jaurès - UT2J (FRANCE)
Université de Toulouse > Université Toulouse 1 Capitole - UT1 (FRANCE)
Laboratory name:
Statistics:download
Deposited On:23 Jun 2020 13:27

Repository Staff Only: item control page