OATAO - Open Archive Toulouse Archive Ouverte Open Access Week

On Ranking Relevant Entities in Heterogeneous Networks Using a Language-Based Model

Soulier, Laure and Ben Jabeur, Lamjed and Tamine, Lynda and Bahsoun, Wahiba On Ranking Relevant Entities in Heterogeneous Networks Using a Language-Based Model. (2013) Journal of the Association for Information Science and Technology, vol. 64 (n° 3). pp. 500-515. ISSN 2330-1635

[img]
Preview
(Document in English)

PDF (Author's version) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
507kB

Official URL: http://dx.doi.org/DOI:10.1002/asi.22762

Abstract

A new challenge, accessing multiple relevant entities, arises from the availability of linked heterogeneous data. In this article, we address more specifically the problem of accessing relevant entities, such as publications and authors within a bibliographic network, given an information need. We propose a novel algorithm, called BibRank, that estimates a joint relevance of documents and authors within a bibliographic network. This model ranks each type of entity using a score propagation algorithm with respect to the query topic and the structure of the underlying bi-type information entity network. Evidence sources, namely content-based and network-based scores, are both used to estimate the topical similarity between connected entities. For this purpose, authorship relationships are analyzed through a language model-based score on the one hand and on the other hand, non topically related entities of the same type are detected through marginal citations. The article reports the results of experiments using the Bibrank algorithm for an information retrieval task. The CiteSeerX bibliographic data set forms the basis for the topical query automatic generation and evaluation. We show that a statistically significant improvement over closely related ranking models is achieved.

Item Type:Article
Additional Information:Thanks to Wiley editor. The definitive version is available at http://onlinelibrary.wiley.com/doi/10.1002/asi.22762/abstract
HAL Id:hal-01118673
Audience (journal):International peer-reviewed journal
Uncontrolled Keywords:
Institution:Université de Toulouse > Institut National Polytechnique de Toulouse - INPT (FRANCE)
French research institutions > Centre National de la Recherche Scientifique - CNRS (FRANCE)
Université de Toulouse > Université Toulouse III - Paul Sabatier - UPS (FRANCE)
Université de Toulouse > Université Toulouse - Jean Jaurès - UT2J (FRANCE)
Université de Toulouse > Université Toulouse 1 Capitole - UT1 (FRANCE)
Laboratory name:
Statistics:download
Deposited By: IRIT IRIT
Deposited On:19 Feb 2015 15:19

Repository Staff Only: item control page