OATAO - Open Archive Toulouse Archive Ouverte Open Access Week

DeShaTo: Describing the Shape of Cumulative Topic Distributions to Rank Retrieval Systems without Relevance Judgments

Ionescu, Radu Tudor and Chifu, Adrian-Gabriel and Mothe, Josiane DeShaTo: Describing the Shape of Cumulative Topic Distributions to Rank Retrieval Systems without Relevance Judgments. (2015) In: 22nd International Symposium on String Processing and Information Retrieval (SPIRE 2015), 1 September 2015 - 4 September 2015 (London, United Kingdom).

[img]
Preview
(Document in English)

PDF (Author's version) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
163kB

Official URL: http://dx.doi.org/10.1007/978-3-319-23826-5_8

Abstract

This paper investigates an approach for estimating the effectiveness of any IR system. The approach is based on the idea that a set of documents retrieved for a specific query is highly relevant if there are only a small number of predominant topics in the retrieved documents. The proposed approach is to determine the topic probability distribution of each document offline, using Latent Dirichlet Allocation. Then, for a retrieved set of documents, a set of probability distribution shape descriptors, namely the skewness and the kurtosis, are used to compute a score based on the shape of the cumulative topic distribution of the respective set of documents. The proposed model is termed DeShaTo, which is short for Describing the Shape of cumulative Topic distributions. In this work, DeShaTo is used to rank retrieval systems without relevance judgments. In most cases, the empirical results are better than the state of the art approach. Compared to other approaches, DeShaTo works independently for each system. Therefore, it remains reliable even when there are less systems to be ranked by relevance.

Item Type:Conference or Workshop Item (Paper)
Additional Information:Thanks to Springer editor. This papers appears in Volume 9309 Lecture Notes in Computer Science ISSN : 0302-9743. ISBN: 978-3-319-23825-8. The original PDF is available at: http://link.springer.com/chapter/10.1007%2F978-3-319-23826-5_8
Audience (conference):International conference proceedings
Uncontrolled Keywords:
Institution:French research institutions > Centre National de la Recherche Scientifique - CNRS (FRANCE)
Université de Toulouse > Institut National Polytechnique de Toulouse - INPT (FRANCE)
Université de Toulouse > Université Toulouse III - Paul Sabatier - UPS (FRANCE)
Université de Toulouse > Université Toulouse - Jean Jaurès - UT2J (FRANCE)
Université de Toulouse > Université Toulouse 1 Capitole - UT1 (FRANCE)
Other partners > University of Bucharest (ROMANIA)
Laboratory name:
Statistics:download
Deposited By: IRIT IRIT
Deposited On:24 Jun 2016 13:11

Repository Staff Only: item control page