OATAO - Open Archive Toulouse Archive Ouverte Open Access Week

Word sense discrimination in information retrieval: a spectral clustering-based approach

Chifu, Adrian-Gabriel and Hristea, Florentina and Mothe, Josiane and Popescu, Marius Word sense discrimination in information retrieval: a spectral clustering-based approach. (2014) Information Processing & Management, 51 (2). 16-31. ISSN 0306-4573

[img]
Preview
(Document in English)

PDF (Author's version) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
1MB

Official URL: http://dx.doi.org/10.1016/j.ipm.2014.10.007

Abstract

Word sense ambiguity has been identified as a cause of poor precision in information retrieval (IR) systems. Word sense disambiguation and discrimination methods have been defined to help systems choose which documents should be retrieved in relation to an ambiguous query. However, the only approaches that show a genuine benefit for word sense discrimination or disambiguation in IR are generally supervised ones. In this paper we propose a new unsupervised method that uses word sense discrimination in IR. The method we develop is based on spectral clustering and reorders an initially retrieved document list by boosting documents that are semantically similar to the target query. For several TREC ad hoc collections we show that our method is useful in the case of queries which contain ambiguous terms. We are interested in improving the level of precision after 5, 10 and 30 retrieved documents (P@5, P@10, P@30) respectively. We show that precision can be improved by 8% above current state-of-the-art baselines. We also focus on poor performing queries.

Item Type:Article
Additional Information:Thanks to Elsevier editor. The definitive version is available at http://www.sciencedirect.com The original PDF of the article can be found at Information Processing & Management website : http://www.sciencedirect.com/science/article/pii/S0306457314001046
HAL Id:hal-01153775
Audience (journal):International peer-reviewed journal
Uncontrolled Keywords:
Institution:French research institutions > Centre National de la Recherche Scientifique - CNRS (FRANCE)
Université de Toulouse > Institut National Polytechnique de Toulouse - INPT (FRANCE)
Université de Toulouse > Université Toulouse III - Paul Sabatier - UPS (FRANCE)
Université de Toulouse > Université Toulouse - Jean Jaurès - UT2J (FRANCE)
Université de Toulouse > Université Toulouse 1 Capitole - UT1 (FRANCE)
Other partners > University of Bucharest (ROMANIA)
Laboratory name:
Statistics:download
Deposited By: IRIT IRIT
Deposited On:04 Feb 2015 16:19

Repository Staff Only: item control page