OATAO - Open Archive Toulouse Archive Ouverte Open Access Week

A probabilistic model to exploit user expectations in XML information retrieval

Dahak, Fouad and Boughanem, Mohand and Balla, Amar A probabilistic model to exploit user expectations in XML information retrieval. (2017) Information Processing & Management, 53 (1). 87-105. ISSN 0306-4573

[img]
Preview
(Document in English)

PDF (Author's version) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
2MB

Official URL: https://doi.org/10.1016/j.ipm.2016.06.008

Abstract

The main objective of this paper is to exploit a new source of evidence derived from the document hierarchical structure for XML information retrieval. We consider that the structure of XML document is an important source of prior knowledge, and the structural features of an element may influence the user to consider that element as relevant. We build a probabilistic model to estimate the probability that the structural characteristics of an element attract user to explore the content of this element and consider it as relevant. This probability reflects the context importance. We propose a simple, well-motivated probabilistic model to estimate the context importance. Finally, we demonstrate the effectiveness of the context importance through comprehensive experimental studies carried out on IEEE XML document collection. Experimental results show that the proposed approach outperforms models exploiting other sources of evidence.

Item Type:Article
Additional Information:Thanks to Elsevier editor. This papers appears in volume 53 Information Processing & Management ISSN: 0306-4573 The definitive version is available at: http://www.sciencedirect.com The original PDF of the article can be found at : http://www.sciencedirect.com/science/article/pii/S0306457316302278?via%3Dihub
HAL Id:hal-01682968
Audience (journal):International peer-reviewed journal
Uncontrolled Keywords:
Institution:French research institutions > Centre National de la Recherche Scientifique - CNRS (FRANCE)
Other partners > Ecole nationale supérieure d'informatique - ESI (ALGERIA)
Université de Toulouse > Institut National Polytechnique de Toulouse - INPT (FRANCE)
Université de Toulouse > Université Toulouse III - Paul Sabatier - UPS (FRANCE)
Université de Toulouse > Université Toulouse - Jean Jaurès - UT2J (FRANCE)
Université de Toulouse > Université Toulouse 1 Capitole - UT1 (FRANCE)
Laboratory name:
Statistics:download
Deposited By: IRIT IRIT
Deposited On:19 Dec 2017 10:15

Repository Staff Only: item control page