A testbed for Indonesian text retrieval

Asian, J, Williams, H and Tahaghoghi, S 2004, 'A testbed for Indonesian text retrieval', in Proceedings of the Ninth Australasian Document Computing Symposium, Melbourne, Australia, 13 December 2004.


Document type: Conference Paper
Collection: Conference Papers

Title A testbed for Indonesian text retrieval
Author(s) Asian, J
Williams, H
Tahaghoghi, S
Year 2004
Conference name Australasian Document Computing Symposium
Conference location Melbourne, Australia
Conference dates 13 December 2004
Proceedings title Proceedings of the Ninth Australasian Document Computing Symposium
Publisher University of Melbourne
Place of publication Melbourne, Australia
Abstract Indonesia is the fourth most populous country and a close neighbour of Australia. However, despite media and intelligence interest in Indonesia, little work has been done on evaluating Information Retrieval techniques for Indonesian, and no standard testbed exists for such a purpose. An effective testbed should include a collection of documents, realistic queries, and relevance judgements. The TREC and TDT testbeds have provided such an environment for the evaluation of English, Mandarin, and Arabic text retrieval techniques. The NTCIR testbed provides a similar environment for Chinese, Korean, Japanese, and English. This paper describes an Indonesian TREC-like testbed we have constructed and made available for the evaluation of ad hoc retrieval techniques. To illustrate how the test collection is used, we briefly report the effect of stemming for Indonesian text retrieval, showing --similarly to English-- that it has little effect on accuracy.
Subjects Business Information Management (incl. Records, Knowledge and Information Management, and Intelligence)
Keyword(s) Indonesian
stemming
relevance judgements,
collection
queries
Copyright notice ©2004 University of Melbourne, Department of Computer Science and Software Engineering
Versions
Version Filter Type
Access Statistics: 176 Abstract Views  -  Detailed Statistics
Created: Mon, 09 Aug 2010, 09:40:26 EST by Catalyst Administrator
© 2014 RMIT Research Repository • Powered by Fez SoftwareContact us