Using Wikipedia categories and links in entity ranking

Vercoustre, A, Pehcevski, J and Thom, J 2008, 'Using Wikipedia categories and links in entity ranking', in N. Fuhr, J. Kamps, M. Lalmas, A. Trotman (ed.) Focused Access to XML Documents, Germany, 17-19 December, 2007, pp. 321-335.


Document type: Conference Paper
Collection: Conference Papers

Title Using Wikipedia categories and links in entity ranking
Author(s) Vercoustre, A
Pehcevski, J
Thom, J
Year 2008
Conference name 6th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2007
Conference location Germany
Conference dates 17-19 December, 2007
Proceedings title Focused Access to XML Documents
Editor(s) N. Fuhr, J. Kamps, M. Lalmas, A. Trotman
Publisher Springer
Place of publication Germany
Start page 321
End page 335
Total pages 15
Abstract This paper describes the participation of the INRIA group in the INEX 2007 XML entity ranking and ad hoc tracks. We developed a system for ranking Wikipedia entities in answer to a query. Our approach utilises the known categories, the link structure of Wikipedia, as well as the link co-occurrences with the examples (when provided) to improve the effectiveness of entity ranking. Our experiments on both the training and the testing data sets demonstrate that the use of categories and the link structure of Wikipedia can significantly improve entity retrieval effectiveness. We also use our system for the ad hoc tasks by inferring target categories from the title of the query. The results were worse than when using a full-text search engine, which confirms our hypothesis that ad hoc retrieval and entity retrieval are two different tasks.
Subjects Information Retrieval and Web Search
Keyword(s) XML
entity ranking
Wikipedia
Copyright notice © Springer - Verlag Berlin Heidelberg 2008
ISBN 9783540859017
Versions
Version Filter Type
Citation counts: TR Web of Science Citation Count  Cited 11 times in Thomson Reuters Web of Science Article | Citations
Scopus Citation Count Cited 16 times in Scopus Article | Citations
Access Statistics: 158 Abstract Views  -  Detailed Statistics
Created: Tue, 12 Mar 2013, 10:32:00 EST by Catalyst Administrator
© 2014 RMIT Research Repository • Powered by Fez SoftwareContact us