OATAO - Open Archive Toulouse Archive Ouverte Open Access Week

TournaRank: when retrieval becomes document competition

Hubert, Gilles and Pitarch, Yoann and Pinel-Sauvagnat, Karen and Tournier, Ronan and Laporte, Léa TournaRank: when retrieval becomes document competition. (2018) Information Processing and Management, 54 (2). 252-272. ISSN 0306-4573

[img]
Preview
(Document in English)

PDF (Author's version) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
1MB

Official URL: https://doi.org/10.1016/j.ipm.2017.11.006

Abstract

Numerous feature-based models have been recently proposed by the information retrieval community. The capability of features to express different relevance facets (query- or document-dependent) can explain such a success story. Such models are most of the time supervised, thus requiring a learning phase. To leverage the advantages of feature-based representations of documents, we propose TournaRank, an unsupervised approach inspired by real-life game and sport competition principles. Documents compete against each other in tournaments using features as evidences of relevance. Tournaments are modeled as a sequence of matches, which involve pairs of documents playing in turn their features. Once a tournament is ended, documents are ranked according to their number of won matches during the tournament. This principle is generic since it can be applied to any collection type. It also provides great flexibility since different alternatives can be considered by changing the tournament type, the match rules, the feature set, or the strategies adopted by documents during matches. TournaRank was experimented on several collections to evaluate our model in different contexts and to compare it with related approaches such as Learning To Rank and fusion ones: the TREC Robust2004 collection for homogeneous documents, the TREC Web2014 (ClueWeb12) collection for heterogeneous web documents, and the LETOR3.0 collection for comparison with supervised feature-based models.

Item Type:Article
HAL Id:hal-02640769
Audience (journal):International peer-reviewed journal
Uncontrolled Keywords:
Institution:French research institutions > Centre National de la Recherche Scientifique - CNRS (FRANCE)
Université de Toulouse > Institut National Polytechnique de Toulouse - Toulouse INP (FRANCE)
Other partners > Institut National des Sciences Appliquées de Lyon - INSA (FRANCE)
Université de Toulouse > Université Toulouse III - Paul Sabatier - UT3 (FRANCE)
Université de Toulouse > Université Toulouse - Jean Jaurès - UT2J (FRANCE)
Université de Toulouse > Université Toulouse 1 Capitole - UT1 (FRANCE)
Other partners > Université Claude Bernard-Lyon I - UCBL (FRANCE)
Other partners > Ecole Centrale de Lyon (FRANCE)
Other partners > Université Lumière-Lyon 2 (FRANCE)
Other partners > Université Jean Moulin Lyon 3 (FRANCE)
Laboratory name:
Statistics:download
Deposited On:29 Apr 2020 10:06

Repository Staff Only: item control page